Y-VECTOR: MULTISCALE WAVEFORM ENCODER FOR SPEAKER EMBEDDING

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:INTERSPEECH (22. : 2021 : Brünn; Online) 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021) ; Volume 1 of 6
1. Verfasser: Zhu, Ge (VerfasserIn)
Weitere Verfasser: Jiang, Fei (VerfasserIn), Duan, Zhiyao (VerfasserIn)
Pages:22
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
SPECAUGMENT++: A HIDDEN SPACE DATA AUGMENTATION METHOD FOR ACOUSTIC SCENE CLASSIFICATION 2021 Wang, Helin
A STUDY ON FINE-TUNING WAV2VEC2.0 MODEL FOR THE TASK OF MISPRONUNCIATION DETECTION AND DIAGNOSIS 2021 Peng, Linkai
EXPLORE WAV2VEC 2.0 FOR MISPRONUNCIATION DETECTION 2021 Xu, Xiaoshuo
ALIGNED CONTRASTIVE PREDICTIVE CODING 2021 Chorowski, Jan
SPEECH DISORDER CLASSIFICA TION USING EXTENDED FACTORIZED HIERARCHICAL VARIATIONAL AUTO-ENCODERS 2021 Oi, Jinzi
UNCERTAINTY-AWARE COVID-19 DETECTION FROM IMBALANCED SOUND DATA 2021 Xia, Tong
MODELING THE EFFECT OF MILITARY OXYGEN MASKS ON SPEECH CHARACTERISTICS 2021 Elie, Benjamin
DETECTING ENGLISH SPEECH IN THE AIR TRAFFIC CONTROL VOICE COMMUNICATION 2021 Szoke, Igor
LEXICAL ENTRAINMENT AND INTRA-SPEAKER VARIABILITY IN COOPERATIVE DIALOGUES 2021 Menshikova, Alla
A PSYCHOLOGY-DRIVEN COMPUTATIONAL ANALYSIS OF POLITICAL INTERVIEWS 2021 Cook, Darren
CROSS-MODAL LEARNING FOR AUDIO-VISUAL VIDEO PARSING 2021 Lamba, Jatin
A PARTITIONED-BLOCK FREQUENCY-DOMAIN ADAPTIVE KALMAN FILTER FOR STEREOPHONIC ACOUSTIC ECHO CANCELLATION 2021 Zhu, Rui
SRIB-LEAP SUBMISSION TO FAR-FIELD MULTI-CHANNEL SPEECH ENHANCEMENT CHALLENGE FOR VIDEO CONFERENCING 2021 Raj, R. G. Prithvi
DIFFERENTIABLE ALLOPHONE GRAPHS FOR LANGUAGE-UNIVERSAL SPEECH RECOGNITION 2021 Yan, Brian
USING LARGE SELF-SUPERVISED MODELS FOR LOW-RESOURCE SPEECH RECOGNITION 2021 Krishna, D. N.
TOWARDS ONE MODEL TO RULE ALL: MULTILINGUAL STRATEGY FOR DIALECTAL CODE-SWITCHING ARABIC ASR 2021 Chowdhury, Shammur Absar
HIERARCHICAL PHONE RECOGNITION WITH COMPOSITIONAL PHONETICS 2021 Li, Xinjian
ON MODELING GLOTTAL SOURCE INFORMATION FOR PHONATION ASSESSMENT IN PARKINSON'S DISEASE 2021 Vasquez-Correa, J. C.
IMAGE-BASED ASSESSMENT OF JAW PARAMETERS AND JAW KINEMATICS FOR ARTICULATORY SIMULATION: PRELIMINARY RESULTS 2021 Abraham, Ajish K.
STOCHASTIC PROCESS REGRESSION FOR CROSS-CULTURAL SPEECH EMOTION RECOGNITION 2021 Kumar, Mani T.
Alle Artikel auflisten