THREE-CLASS OVERLAPPED SPEECH DETECTION USING A CONVOLUTIONAL RECURRENT NEURAL NETWORK

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:INTERSPEECH (22. : 2021 : Brünn; Online) 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021) ; Volume 4 of 6
1. Verfasser: Jung, Jee-Weon (VerfasserIn)
Weitere Verfasser: Heo, Hee-Soo (VerfasserIn), Kwon, Youngki (VerfasserIn), Chung, Joon Son (VerfasserIn), Lee, Bong-Jin (VerfasserIn)
Pages:22
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
A HANDS-ON COMPARISON OF DNNS FOR DIALOG SEPARATION USING TRANSFER LEARNING FROM MUSIC SOURCE SEPARATION 2021 Strauss, Martin
EMPIRICAL ANALYSIS OF GENERALIZED ITERATIVE SPEECH SEPARATION NETWORKS 2021 Luo, Yi
TECANET: TEMPORAL-CONTEXTUAL ATTENTION NETWORK FOR ENVIRONMENT-AWARE SPEECH DEREVERBERATION 2021 Wang, Helin
RESIDUAL ECHO AND NOISE CANCELLATION WITH FEATURE ATTENTION MODULE AND MULTI-DOMAIN LOSS FUNCTION 2021 Gu, Jianjun
SHOULD WE ALWAYS SEPARATE?: SWITCHING BETWEEN ENHANCED AND OBSERVED SIGNALS FOR OVERLAPPING SPEECH RECOGNITION 2021 Sato, Hiroshi
MIMO SELF-ATTENTIVE RNN BEAMFORMER FOR MULTI-SPEAKER SPEECH SEPARATION 2021 Li, Xivun
PRESENTATION MATTERS: EVALUATING SPEAKER IDENTIFICATION TASKS 2021 O'Brien, Benjamin
DR-VECTORS: DECISION RESIDUAL NETWORKS AND AN IMPROVED LOSS FOR SPEAKER RECOGNITION 2021 Pelecanos, Jason
GRAPH-BASED LABEL PROPAGATION FOR SEMI-SUPERVISED SPEAKER IDENTIFICATION 2021 Chen, Long
ADAPTIVE MARGIN CIRCLE LOSS FOR SPEAKER VERIFICATION 2021 Xiao, Rungiu
A GENERATIVE MODEL FOR DURATION-DEPENDENT SCORE CALIBRATION 2021 Cumani, Sandro
ADVERSARIAL DISENTANGLEMENT OF SPEAKER REPRESENTATION FOR ATTRIBUTE-DRIVEN PRIVACY PRESERVATION EEE Parcollet 2021 Noe, Paul-Gauthier
CODED SPEECH ENHANCEMENT USING NEURAL NETWORK-BASED VECTOR-QUANTIZED RESIDUAL FEATURES 2021 Cheon, Youngju
N-MTTL SI MODEL: NON-INTRUSIVE MULTI-TASK TRANSFER LEARNING-BASED SPEECH INTELLIGIBILITY PREDICTION MODEL WITH SCENERY CLASSIFICATION 2021 Marcinek, Lubos
END-TO-END OPTIMIZED MULTI-STAGE VECTOR QUANTIZATION OF SPECTRAL ENVELOPES FOR SPEECH AND AUDIO CODING 2021 Vali, Mohammad Hassan
RESTORING DEGRADED SPEECH VIA A MODIFIED DIFFUSION MODEL 2021 Zhang, Jianwei
INCORPORATING EMBEDDING VECTORS FROM A HUMAN MEAN-OPINION SCORE PREDICTION MODEL FOR MONAURAL SPEECH ENHANCEMENT 2021 Nayem, Khandokar Md.
METRICGAN+: AN IMPROVED VERSION OF METRICGAN FOR SPEECH ENHANCEMENT 2021 Fu, Szu-Wei
PILOT: INTRODUCING TRANSFORMERS FOR PROBABILISTIC SOUND EVENT LOCALIZATION 2021 Schymura, Christopher
RELIABLE INTENSITY VECTOR SELECTION FOR MULTI-SOURCE DIRECTION-OF- ARRIVAL ESTIMATION USING A SINGLE ACOUSTIC VECTOR SENSOR 2021 Geng, Jianhua
Alle Artikel auflisten