RECOAPY: DATA RECORDING, PRE-PROCESSING AND PHONETIC TRANSCRIPTION FOR END-TO-END SPEECH-BASED APPLICATIONS

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:INTERSPEECH (21. : 2020 : Online) Cognitive intelligence for speech processing ; Volume 1 of 7
1. Verfasser: Stan, Adriana (VerfasserIn)
Pages:1
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2020
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
ASAPP-ASR: MULTISTREAM CNN AND SELF-ATTENTIVE SRU FOR SOTA SPEECH RECOGNITION 2020 Pan, Jing
COMPRESSING LSTM NETWORKS WITH HIERARCHICAL COARSE-GRAIN SPARSITY 2020 Kadetotad, Deepak
BLSTM-DRIVEN STREAM FUSION FOR AUTOMATIC SPEECH RECOGNITION: NOVEL METHODS AND A MULTI-SIZE WINDOW FUSION EXAMBPLE 2020 Lohrenz, Timo
DIFFERENTIAL BEAMFORMING FOR UNIFORM CIRCULAR ARRAY WITH DIRECTIONAL MICROPHONES 2020 Huang, Weilong
COMPUTATIONALLY EFFICIENT AND VERSATILE FRAMEWORK FOR JOINT OPTIMIZATION OF BLIND SPEECH SEPARATION AND DEREVERBERATION 2020 Ikeshita, Tomohiro Nakatani. Rintaro
CONGRUENT AUDIOVISUAL SPEECH ENHANCES CORTICAL ENVELOPE TRACKING DURING AUDITORY SELECTIVE ATTENTION 2020 Fu, Zhen
AUTOMATIC ANALYSIS OF SPEECH PROSODY IN DUTCH 2020 Hu, Na
LEARNING VOICE REPRESENTATION USING KNOWLEDGE DISTILLATION FOR AUTOMATIC VOICE CASTING 2020 Gresse, Adrien
NONLINEAR ISA WITH AUXILIARY VARIABLES FOR LEARNING SPEECH REPRESENTATIONS 2020 Setlur, Amrith
WG-WAVENET: REAL-TIME HIGH-FIDELITY SPEECH SYNTHESIS WITHOUT GPU 2020 Hsu, Po-Chun
SPEAKER CONDITIONAL WAVERNN: TOWARDS UNIVERSAL NEURAL VOCODER FOR UNSEEN SPEAKER AND RECORDING CONDITIONS 2020 Paul, Dipjyoti
DATA AUGMENTATION USING PROSODY AND FALSE STARTS TO RECOGNIZE NON-NATIVE CHILDREN'S SPEECH 2020 Kathania, Hemant
NON-NATIVE CHILDREN'S AUTOMATIC SPEECH RECOGNITION: THE INTERSPEECH 2020 SHARED TASK ALTA SYSTEM 2020 Knill, Kate M.
SPOT THE CONVERSATION: SPEAKER DIARISATION IN THE WILD 2020 Chung, Joon Son
SIMULATING REALISTICALLY-SPATIALISED SIMULTANEOUS SPEECH USING VIDEO- DRIVEN SPEAKER DETECTION AND THE CHIME-5 DATASET 2020 Deadman, Jack
TOWARD SILENT PARALINGUISTICS: SPEECH-TO-EMG --- RETRIEVING ARTICULATORY MUSCLE ACTIVITY FROM SPEECH 2020 Botelho, Catarina
USING SPEAKER-ALIGNED GRAPH MEMORY BLOCK IN MULTIMODALLY ATTENTIVE EMOTION RECOGNITION NETWORK 2020 Li, Jeng-Lin
CLOVACALL: KOREAN GOAL-ORIENTED DIALOG SPEECH CORPUS FOR AUTOMATIC SPEECH RECOGNITION OF CONTACT CENTERS 2020 Ha, Jung-Woo
DIPCO --- DINNER PARTY CORPUS 2020 Segbroeck, Maarten Van
ON THE USAGE OF MULTI-FEATURE INTEGRATION FOR SPEAKER VERIFICATION AND LANGUAGE IDENTIFICA TION 2020 Li, Zheng
Alle Artikel auflisten