AN OPEN-SOURCE VOICE TYPE CLASSIFIER FOR CHILD-CENTERED DAYLONG RECORDINGS

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:INTERSPEECH (21. : 2020 : Online) Cognitive intelligence for speech processing ; Volume 5 of 7
1. Verfasser: Lavechin, Marvin (VerfasserIn)
Weitere Verfasser: Bousbib, Ruben (VerfasserIn), Bredin, Herve (VerfasserIn), Dupoux, Emmanuel (VerfasserIn), Cristia, Alejandrina (VerfasserIn)
Pages:5
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2020
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
DYNAMIC SOFT WINDOWING AND LANGUAGE DEPENDENT STYLE TOKEN FOR CODE-SWITCHING END-TO-END SPEECH SYNTHESIS 2020 Fu, Ruibo
PHONOLOGICAL FEATURES FOR 0-SHOT MULTILINGUAL SPEECH SYNTHESIS 2020 Staib, Marlene
ON IMPROVING CODE MIXED SPEECH SYNTHESIS WITH MIXLINGUAL GRAPHEME-TO-PHONEME MODEL 2020 Bansal, Shubham
UNSUPERVISED TRAINING OF SIAMESE NETWORKS FOR SPEAKER VERIFICATION 2020 Khan, Umair
AN END-TO-END MISPRONUNCIATION DETECTION SYSTEM FOR L2 ENGLISH SPEECH LEVERAGING NOVEL ANTI-PHONE MODELING 2020 Yan, Bi-Cheng
RECOGNIZE MISPRONUNCIATIONS TO IMPROVE NON-NATIVE ACOUSTIC MODELING THROUGH A PHONE DECODER BUILT FROM ONE EDIT DISTANCE FINITE STATE AUTOMATON ................ tetera 2020 Chu, Wei
DIARIZATION PARTIAL AUC OPTIMISATION USING RECURRENT NEURAL NETWORKS FOR MUSIC DETECTION WITH LIMITED TRAINING 2020 Gimeno, Pablo
AN OPEN-SOURCE VOICE TYPE CLASSIFIER FOR CHILD-CENTERED DAYLONG RECORDINGS 2020 Lavechin, Marvin
MULTI-TALKER ASR FOR UNKNOWN NUMBER OF SOURCES: JOINT TRAINING OF SOURCE COUNTING, SEPARATION AND ASR 2020 Neumann, Thilo Von
PREDICTING COLLABORATIVE TASK PERFORMANCE USING GRAPH INTERLOCUTOR ACOUSTIC NETWORK IN SMALL GROUP INTERACTION 2020 Zhong, Shun-Chang
EFFECTS OF COMMUNICATION CHANNELS AND ACTOR'S GENDER ON EMOTION IDENTIFICATION BY NATIVE MANDARIN SPEAKERS 2020 Lin, Yi
DETECTION OF VOICING AND PLACE OF ARTICULATION OF FRICATIVES WITH DEEP LEARNING IN A VIRTUAL SPEECH AND LANGUAGE THERAPY TUTOR INTERSPEECH 2020 : SPEECH SYNTHESIS PARADIGMS AND METHODS II 2020 Anjos, Ivo
ENHANCING MONOTONICITY FOR ROBUST AUTOREGRESSIVE TRANSFORMER TTS 2020 Liang, Xiangyu
LEARNING JOINT ARTICULATORY-ACOUSTIC REPRESENTATIONS WITH NORMALIZING FLOWS 2020 Saha, Pramit
HOW DOES LABEL NOISE AFFECT THE QUALITY OF SPEAKER EMBEDDINGS? 2020 Pham, Minh
DEEP SPEAKER EMBEDDING WITH LONG SHORT TERM CENTROID LEARNING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION 2020 Peng, Junyi
NEURAL DISCRIMINANT ANALYSIS FOR DEEP SPEAKER EMBEDDING 2020 Li, Lantian
REAL-TIME SINGLE-CHANNEL DEEP NEURAL NETWORK-BASED SPEECH ENHANCEMENT ON EDGE DEVICES 2020 Shankar, Nikhil
IMPROVED SPEECH ENHANCEMENT USING A TIME-DOMAIN GAN WITH MASK LEARNING 2020 Lin, Ju
EFFICIENT LOW-LATENCY SPEECH ENHANCEMENT WITH MOBILE AUDIO STREAMING NETWORKS 2020 Romaniuk, Michal
Alle Artikel auflisten