COVOST 2 AND MASSIVELY MULTILINGUAL SPEECH TRANSLATION

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:INTERSPEECH (22. : 2021 : Brünn; Online) 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021) ; Volume 5 of 6
1. Verfasser: Wang, Changhan (VerfasserIn)
Weitere Verfasser: Wu, Anne (VerfasserIn), Gu, Jiatao (VerfasserIn), Pino, Juan (VerfasserIn)
Pages:22
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
UNITNET-BASED HYBRID SPEECH SYNTHESIS 2021 Zhou, Xiao
A NEURAL-NETWORK-BASED APPROACH TO IDENTIFYING SPEAKERS IN NOVELS 2021 Chen, Yue
GANSPEECH: ADVERSARIAL TRAINING FOR HIGH-FIDELITY MULTI-SPEAKER SPEECH SYNTHESIS 2021 Yang, Jinhyeok
APPLYING THE INFORMATION BOTTLENECK PRINCIPLE TO PROSODIC REPRESENTATION LEARNING 2021 Zhang, Guangyan
PHONEME DURATION MODELING USING SPEECH RHYTHM-BASED SPEAKER EMBEDDINGS FOR MULTI-SPEAKER SPEECH SYNTHESIS 2021 Fujita, Kenichi
IMPROVING MULTI-SPEAKER TTS PROSODY VARIANCE WITH A RESIDUAL ENCODER AND NORMALIZING FLOWS 2021 Valles-Perez, Ivan
ALTERNATE ENDINGS: IMPROVING PROSODY FOR INCREMENTAL NEURAL TTS WITH PREDICTED FUTURE TEXT INPUT 2021 Stephenson, Brooke
CROSS-LINGUAL SPEAKER ADAPTATION USING DOMAIN ADAPTATION AND SPEAKER CONSISTENCY LOSS FOR TEXT-TO-SPEECH SYNTHESIS 2021 Xin, Detai
CROSS-LINGUAL VOICE CONVERSION WITH DISENTANGLED UNIVERSAL LINGUISTIC REPRESENTATIONS 2021 Yang, Zhenchuan
FINE-GRAINED STYLE MODELING, TRANSFER AND PREDICTION IN TEXT-TO- SPEECH SYNTHESIS VIA PHONE-LEVEL CONTENT-STYLE DISENTANGLEMENT 2021 Tan, Daxin
EXPRESSIVE TEXT-TO-SPEECH USING STYLE TAG 2021 Kim, Minchan
IMPROVING PERFORMANCE OF SEEN AND UNSEEN SPEECH STYLE TRANSFER IN END-TO-END NEURAL TTS 2021 An, Xiaochun
PERCEPTION OF SOCIAL SPEAKER CHARACTERISTICS IN SYNTHETIC SPEECH 2021 Rallabandi, Sai Sirisha
SPECTRAL AND LATENT SPEECH REPRESENTATION DISTORTION FOR TTS EVALUATION 2021 Kongthaworn, Thananchai
LITETTS: A LIGHTWEIGHT MEL-SPECTROGRAM-FREE TEXT-TO-WAVE SYNTHESIZER BASED ON GENERATIVE ADVERSARIAL NETWORKS 2021 Nguyen, Huu-Kim
DIFF-TTS: A DENOISING DIFFUSION MODEL FOR TEXT-TO-SPEECH 2021 Jeong, Myeonghun
TRANSFORMER-BASED ACOUSTIC MODELING FOR STREAMING SPEECH SYNTHESIS 2021 Wu, Chunyang
PHONETIC AND PROSODIC INFORMATION ESTIMATION FROM TEXTS FOR GENUINE JAPANESE END-TO-END TEXT-TO-SPEECH 2021 Kakegawa, Naoto
SPEED UP TRAINING WITH VARIABLE LENGTH INPUTS BY EFFICIENT BATCHING STRATEGIES 2021 Ge, Zhenhao
OPEN-SET AUDIO CLASSIFICATION WITH LIMITED TRAINING RESOURCES BASED ON AUGMENTATION ENHANCED VARIATIONAL AUTO-ENCODER GAN WITH DETECTION-CLASSIFICATION JOINT TRAINING 2021 Teh, Kah Kuan
Alle Artikel auflisten