AUTOMATIC SPEECH RECOGNITION OF DISORDERED SPEECH: PERSONALIZED MODELS OUTPERFORMING HUMAN LISTENERS ON SHORT PHRASES

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:INTERSPEECH (22. : 2021 : Brünn; Online) 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021) ; Volume 4 of 6
1. Verfasser: Green, Jordan R. (VerfasserIn)
Weitere Verfasser: Macdonald, Robert L. (VerfasserIn), Jiang, Pan-Pan (VerfasserIn), Cattiau, Julie (VerfasserIn), Heywood, Rus (VerfasserIn), Cave, Richard (VerfasserIn), Seaver, Katie (VerfasserIn), Ladewig, Marilyn A. (VerfasserIn), Tobin, Jimmy (VerfasserIn), Brenner, Michael P. (VerfasserIn), Nelson, Philip C. (VerfasserIn), Tomanek, Katrin (VerfasserIn)
Pages:22
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
ONLINE SPEAKER DIARIZATION EQUIPPED WITH DISCRIMINATIVE MODELING AND GUIDED INFERENCE 2021 Wan, Xucheng
ONLINE STREAMING END-TO-END NEURAL DIARIZATION HANDLING OVERLAPPING SPEECH AND FLEXIBLE NUMBERS OF SPEAKERS 2021 Xue, Yawen
SEMI-SUPERVISED TRAINING WITH PSEUDO-LABELING FOR END-TO-END NEURAL DIARIZATION 2021 Takashima, Yuki
ECAPA-TDNN EMBEDDINGS FOR SPEAKER DIARIZATION 2021 Dawalatabad, Nauman
TARGET-SPEAKER VOICE ACTIVITY DETECTION WITH IMPROVED I-VECTOR ESTIMATION FOR UNKNOWN NUMBER OF SPEAKER 2021 He, Maokui
LEAP SUBMISSION FOR THE THIRD DIHARD DIARIZATION CHALLENGE 2021 Singh, Prachi
INVESTIGATION OF SPATIAL-ACOUSTIC FEATURES FOR OVERLAPPING SPEECH DETECTION IN MULTIPARTY MEETINGS 2021 Zhang, Shiliang
ROBUST END-TO-END SPEAKER DIARIZATION WITH CONFORMER AND ADDITIVE MARGIN PENALTY 2021 Leung, Tsun-Yat
AUTOMATIC ERROR CORRECTION FOR SPEAKER EMBEDDING LEARNING WITH NOISY LABELS 2021 Tong, Fuchuan
FAIR VOICE BIOMETRICS: IMPACT OF DEMOGRAPHIC IMBALANCE ON GROUP FAIRNESS IN SPEAKER RECOGNITION 2021 Fenu, Gianni
AUDIO RETRIEVAL WITH NATURAL LANGUAGE QUERIES 2021 Oncescu, Andreea-Maria
ORCA-SLANG: AN AUTOMATIC MULTI-STAGE SEMI-SUPERVISED DEEP LEARNING FRAMEWORK FOR LARGE-SCALE KILLER WHALE CALL TYPE IDENTIFICATION 2021 Bergler, Christian
VOICE PRIVACY THROUGH X-VECTOR AND CYCLEGAN-BASED ANONYMIZATION 2021 Prajapati, Gauri P.
A TWO-STAGE APPROACH TO SPEECH BANDWIDTH EXTENSION 2021 Lin, Ju
NU-WAVE: A DIFFUSION PROBABILISTIC MODEL FOR NEURAL AUDIO UPSAMPLING 2021 Lee, Junhyeok
X-NET: A JOINT SCALE DOWN AND SCALE UP METHOD FOR VOICE CALL 2021 Wen, Liang
FUSION-NET: TIME-FREQUENCY INFORMATION FUSION Y-NETWORK FOR SPEECH ENHANCEMENT 2021 Nareddula, Santhan Kumar Reddy
A SPECTRO-TEMPORAL GLIMPSING INDEX (STGI) FOR SPEECH INTELLIGIBILITY PREDICTION 2021 Edraki, Amin
TEMPORAL CONVOLUTIONAL NETWORK WITH FREQUENCY DIMENSION ADAPTIVE ATTENTION FOR SPEECH ENHANCEMENT 2021 Zhang, Oiquan
IMPROVING PERCEPTUAL QUALITY BY PHONE-FORTIFIED PERCEPTUAL LOSS USING WASSERSTEIN DISTANCE FOR SPEECH ENHANCEMENT 2021 Hsieh, Tsun-An
Alle Artikel auflisten