LARGE-SCALE PRE-TRAINING OF END-TO-END MULTI-TALKER ASR FOR MEETING TRANSCRIPTION WITH SINGLE DISTANT MICROPHONE

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:INTERSPEECH (22. : 2021 : Brünn; Online) 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021) ; Volume 2 of 6
1. Verfasser: Kanda, Naoyuki (VerfasserIn)
Weitere Verfasser: Ye, Guoli (VerfasserIn), Wu, Yu (VerfasserIn), Gaur, Yashesh (VerfasserIn), Wang, Xiaofei (VerfasserIn), Meng, Zhong (VerfasserIn), Chen, Zhuo (VerfasserIn), Yoshioka, Takuya (VerfasserIn)
Pages:22
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
SPEECH BASED DEPRESSION SEVERITY LEVEL CLASSIFICATION USING A MULTI- STAGE DILATED CNN-LSTM MODEL 2021 Seneviratne, Nadee
ACOUSTIC ECHO CANCELLATION USING DEEP COMPLEX NEURAL NETWORK WITH NONLINEAR MAGNITUDE COMPRESSION AND PHASE INFORMATION 2021 Peng, Renhua
LOW-DELAY SPEECH ENHANCEMENT USING PERCEPTUALLY MOTIVATED TARGET AND LOSS 2021 Zhang, Xu
DEVICE PLAYBACK AUGMENTATION WITH ECHO CANCELLATION FOR KEYWORD SPOTTING 2021 Lopatka, Kuba
END-TO-END LANGUAGE DIARIZATION FOR BILINGUAL CODE-SWITCHING SPEECH 2021 Liu, Hexin
EXPLORING WAV2VEC 2.0 ON SPEAKER VERIFICATION AND LANGUAGE IDENTIFICATION 2021 Fan, Zhiyun
IMPROVING CUSTOMIZATION OF NEURAL TRANSDUCERS BY MITIGATING ACOUSTIC MISMATCH OF SYNTHESIZED AUDIO 2021 Kurata, Gakuto
CORRECTING AUTOMATED AND MANUAL SPEECH TRANSCRIPTION ERRORS USING WARPED LANGUAGE MODELS 2021 Namazifar, Mahdi
FAST TEXT-ONLY DOMAIN ADAPTATION OF RNN-TRANSDUCER PREDICTION NETWORK 2021 Pylkkonen, Janne
ON SAMPLING-BASED TRAINING CRITERIA FOR NEURAL LANGUAGE MODELING 2021 Gao, Yingbo
MODELING DIALECTAL VARIATION FOR SWISS GERMAN AUTOMATIC SPEECH RECOGNITION 2021 Khosravani, Abbas
EQUIVALENCE OF SEGMENTAL AND NEURAL TRANSDUCER MODELING: A PROOF OF CONCEPT 2021 Zhou, Wei
LOW RESOURCE ASR: THE SURPRISING EFFECTIVENESS OF HIGH RESOURCE TRANSLITERATION 2021 Khare, Shreya
LISTEN WITH INTENT: IMPROVING SPEECH RECOGNITION WITH AUDIO-TO-INTENT FRONT-END 2021 Ray, Swayambhu Nath
EXPLORING TARGETED UNIVERSAL ADVERSARIAL PERTURBATIONS TO END-TO-END ASR MODELS 2021 Lu, Zhiyun
A DEEP LEARNING METHOD TO MULTI-CHANNEL ACTIVE NOISE CONTROL 2021 Zhang, Hao
CANCELLATION OF LOCAL COMPETING SPEAKER WITH NEAR-FIELD LOCALIZATION FOR DISTRIBUTED AD-HOC SENSOR NETWORK 2021 Zarazaga, Pablo Perez
EXPLAINING DEEP LEARNING MODELS FOR SPEECH ENHANCEMENT 2021 Sivasankaran, Sunit
LIRA: LEARNING VISUAL SPEECH REPRESENTATIONS FROM AUDIO THROUGH SELF-SUPERVISION 2021 Ma, Pingchuan
TALK, DON'T WRITE: A STUDY OF DIRECT SPEECH-BASED IMAGE RETRIEVAL 2021 Sanabria, Ramon
Alle Artikel auflisten