END-TO-END ASR WITH ADAPTIVE SPAN SELF-ATTENTION

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:INTERSPEECH (21. : 2020 : Online) Cognitive intelligence for speech processing ; Volume 5 of 7
1. Verfasser: Chang, Xuankai (VerfasserIn)
Weitere Verfasser: Subramanian, Aswin Shanmugam (VerfasserIn), Guo, Pengcheng (VerfasserIn), Watanabe, Shinji (VerfasserIn), Fujita, Yuya (VerfasserIn), Omachi, Motoi (VerfasserIn)
Pages:5
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2020
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
SURFBOARD: AUDIO FEATURE EXTRACTION FOR MODERN MACHINE LEARNING 2020 Lenain, Raphael
MULTI-LINGUAL MULTI-SPEAKER TEXT-TO-SPEECH SYNTHESIS FOR VOICE CLONING WITH ONLINE SPEAKER ENROLLMENT 2020 Liu, Zhaoyu
WHISPER ACTIVITY DETECTION USING CNN-LSTM BASED ATTENTION POOLING NETWORK TRAINED FOR A SPEAKER IDENTIFICATION TASK 2020 Naini, Abinay Reddy
EFFICIENT NEURAL SPEECH SYNTHESIS FOR LOW-RESOURCE LANGUAGES THROUGH MULTILINGUAL MODELING 2020 Korte, Marcel De
META-LEARNING FOR SHORT UTTERANCE SPEAKER RECOGNITION WITH IMBALANCE LENGTH PAIRS 2020 Kye, Seong Min
SEGMENT-LEVEL EFFECTS OF GENDER, NATIONALITY AND EMOTION INFORMATION ON TEXT-INDEPENDENT SPEAKER VERIFICATION 2020 Li, Kai
AUTOMATIC SCORING AT MULTI-GRANULARITY FOR L2 PRONUNCIATION 2020 Lin, Binghuai
AN EFFECTIVE END-TO-END MODELING APPROACH FOR MISPRONOUNCIATION DETECTION 2020 Lo, Tien-Hong
UNSUPERVISED FEATURE ADAPTATION USING ADVERSARIAL MULTI-TASK TRAINING FOR AUTOMATIC EVALUATION OF CHILDREN'S SPEECH 2020 Duan, Richeng
CONTEXT-AWARE GOODNESS OF PRONUNCIATION FOR COMPUTER-ASSISTED PRONUNCIATION TRAINING 2020 Shi, Jiatong
ATTENTIVE CONVOLUTIONAL RECURRENT NEURAL NETWORK USING PHONEME- LEVEL ACOUSTIC REPRESENTATION FOR RARE SOUND EVENT DETECTION 2020 Upadhyav, Shreva G.
VERY SHORT-TERM CONFLICT INTENSITY ESTIMATION USING FISHER VECTORS 2020 Gosztolya, Gabor
SPEAKER DISCRIMINATION IN HUMANS AND MACHINES: EFFECTS OF SPEAKING STYLE VARIABILITY 2020 Afshan, Amber
EFFECTS OF COMMUNICATION CHANNELS AND ACTOR'S GENDER ON EMOTION IDENTIFICATION BY NATIVE MANDARIN SPEAKERS 2020 Lin, Yi
HIDER-FINDER-COMBINER: AN ADVERSARIAL ARCHITECTURE FOR GENERAL SPEECH SIGNAL MODIFICATION 2020 Webber, Jacob J.
SPEAKER REPRESENTATION LEARNING USING GLOBAL CONTEXT GUIDED CHANNEL AND TIME-FREQUENCY TRANSFORMATIONS 2020 Xia, Wei
DEEP SPEECH INPAINTING OF TIME-FREQUENCY MASKS 2020 Kegler, Mikolaj
ON LOSS FUNCTIONS AND RECURRENCY TRAINING FOR GAN-BASED SPEECH ENHANCEMENT SYSTEMS 2020 Zhang, Zhuohuang
THE METHOD OF RANDOM DIRECTIONS OPTIMIZATION FOR STEREO AUDIO SOURCE SEPARATION 2020 Golokolenko, Oleg
GEV BEAMFORMING SUPPORTED BY DOA-BASED MASKS GENERATED ON PAIRS OF MICROPHONES 2020 Grondin, Francois
Alle Artikel auflisten