MASSIVELY MULTILINGUAL ASR: 50 LANGUAGES, 1 MODEL, 1 BILLION PARAMETERS

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:INTERSPEECH (21. : 2020 : Online) Cognitive intelligence for speech processing ; Volume 7 of 7
1. Verfasser: Pratap, Vineel (VerfasserIn)
Weitere Verfasser: Sriram, Anuroop (VerfasserIn), Tomasello, Paden (VerfasserIn), Hannun, Awni (VerfasserIn), Liptchinsky, Vitaliy (VerfasserIn), Synnaeve, Gabriel (VerfasserIn), Collobert, Ronan (VerfasserIn)
Pages:7
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2020
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
DOMAIN ADAPTATION USING CLASS SIMILARITY FOR ROBUST SPEECH RECOGNITION 2020 Zhu, Han
INCREMENTAL MACHINE SPEECH CHAIN TOWARDS ENABLING LISTENING WHILE SPEAKING IN REAL TIME 2020 Novitasari, Sashi
COPYCAT: MANY-TO-MANY FINE-GRAINED PROSODY TRANSFER FOR NEURAL TEXT-TO-SPEECH 2020 Karlapati, Sri
SPEAKING SPEED CONTROL OF END-TO-END SPEECH SYNTHESIS USING SENTENCE- LEVEL CONDITIONING 2020 Bae, Jae-Sung
IMPROVING THE PROSODY OF RNN-BASED ENGLISH TEXT-TO-SPEECH SYNTHESIS BY INCORPORATING A BERT MODEL 2020 Kenter, Tom
PROSODY LEARNING MECHANISM FOR SPEECH SYNTHESIS SYSTEM WITHOUT TEXT LENGTH LIMIT 2020 Zeng, Zhen
MULTI-REFERENCE NEURAL TTS STYLIZATION WITH ADVERSARIAL CYCLE CONSISTENCY 2020 Whitehill, Matt
CROSS-LINGUISTIC INTERACTION BETWEEN PHONOLOGICAL CATEGORIZATION AND ORTHOGRAPHY PREDICTS PROSODIC EFFECTS IN THE ACQUISITION OF PORTUGUESE LIQUIDS BY LI-MANDARIN LEARNERS 2020 Zhou, Chao
HIFI-GAN: HIGH-FIDELITY DENOISING AND DEREVERBERATION BASED ON SPEECH DEEP FEATURES IN ADVERSARIAL NETWORKS 2020 Su, Jiagi
SQUEEZE FOR SNEEZE: COMPACT NEURAL NETWORKS FOR COLD AND FLU RECOGNITION 2020 Albes, Merlin
DOMAIN ADAPTATION FOR ENHANCING SPEECH-BASED DEPRESSION DETECTION IN NATURAL ENVIRONMENTAL CONDITIONS USING DILATED CNNS 2020 Huang, Zhaocheng
TONGUE AND LIP MOTION PATTERNS IN ALARYNGEAL SPEECH 2020 Teplansky, Kristin J.
IMPROVING REPLAY DETECTION SYSTEM WITH CHANNEL CONSISTENCY DENSENEXT FOR THE ASVSPOOF 2019 CHALLENGE 2020 Zhang, Chao
INVESTIGATING THE VISUAL LOMBARD EFFECT WITH GABOR BASED FEATURES 2020 Chiu, Waito
DEVELOPMENT OF A SPEECH QUALITY DATABASE UNDER UNCONTROLLED CONDITIONS 2020 Ragano, Alessandro
EVALUATING THE RELIABILITY OF ACOUSTIC SPEECH EMBEDDINGS 2020 Algayres, Robin
A PYRAMID RECURRENT NETWORK FOR PREDICTING CROWDSOURCED SPEECH- QUALITY RATINGS OF REAL-WORLD SIGNALS 2020 Dong, Xuan
EFFECT OF SPECTRAL COMPLEXITY REDUCTION AND NUMBER OF INSTRUMENTS | ON MUSICAL ENJOYMENT WITH COCHLEAR IMPLANTS 2020 Brueggeman, Avamarie
DETECTING AUDIO ATTACKS ON ASR SYSTEMS WITH DROPOUT UNCERTAINTY 2020 Jayashankar, Tejas
VOICE TRANSFORMER NETWORK: SEQUENCE-TO-SEQUENCE VOICE CONVERSION USING TRANSFORMER WITH TEXT-TO-SPEECH PRETRAINING 2020 Huang, Wen-Chin
Alle Artikel auflisten