DUAL ATTENTION IN TIME AND FREQUENCY DOMAIN FOR VOICE ACTIVITY DETECTION

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:INTERSPEECH (21. : 2020 : Online) Cognitive intelligence for speech processing ; Volume 6 of 7
1. Verfasser: Lee, Joohvung (VerfasserIn)
Weitere Verfasser: Jung, Youngmoon (VerfasserIn), Kim, Hoirin (VerfasserIn)
Pages:6
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2020
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
MASK CTC: NON-AUTOREGRESSIVE END-TO-END ASR WITH CTC AND MASK PREDICT 2020 Higuchi, Yosuke
CONTEXTUALIZING ASR LATTICE RESCORING WITH HYBRID POINTER NETWORK LANGUAGE MODEL 2020 Liu, Da-Rong
A NOISE ROBUST TECHNIQUE FOR DETECTING VOWELS IN SPEECH SIGNALS 2020 Kumar, Avinash
DISCOVERING ARTICULATORY SPEECH TARGETS FROM SYNTHESIZED RANDOM BABBLE 2020 Rasilo, Heikki
CSL-EMG_ARRAY: AN OPEN ACCESS CORPUS FOR EMG-TO-SPEECH CONVERSION 2020 Diener, Lorenz
JOINTLY FINE-TUNING "BERT-LIKE" SELF SUPERVISED MODELS TO IMPROVE MULTIMODAL SPEECH EMOTION RECOGNITION 2020 Siriwardhana, Shamane
SPEECH-XLNET: UNSUPERVISED ACOUSTIC MODEL PRETRAINING FOR SELF-ATENTION NETWORKS 2020 Song, Xingchen
UNDERSTANDING SELF-ATTENTION OF SELF-SUPERVISED AUDIO TRANSFORMERS 2020 Yang, Shu-Wen
SEQUENCE-LEVEL SELF-LEARNING WITH MULTIPLE HYPOTHESES 2020 Kumatani, Kenichi
A CONVOLUTIONAL DEEP MARKOV MODEL FOR UNSUPERVISED SPEECH REPRESENTATION LEARNING 2020 Khurana, Sameer
ON PARAMETER ADAPTATION IN SOFTMAX-BASED CROSS-ENTROPY LOSS FOR IMPROVED CONVERGENCE SPEED AND ACCURACY IN DNN-BASED SPEAKER RECOGNIZATION 2020 Rybicka, Magdalena
ENSEMBLE APPROACHES FOR UNCERTAINTY IN SPOKEN LANGUAGE ASSESSMENT 2020 Wu, Xixin
PROTOTYPICAL Q NETWORKS FOR AUTOMATIC CONVERSATIONAL DIAGNOSIS AND FEW-SHOT NEW DISEASE ADAPTION 2020 Luo, Hongyin
DISCRIMINATIVE TRANSFER LEARNING FOR OPTIMIZING ASR AND SEMANTIC LABELING IN TASK-ORIENTED SPOKEN DIALOG 2020 Qian, Yao
DATASETS AND BENCHMARKS FOR TASK-ORIENTED LOG DIALOGUE RANKING TASK 2020 Xu, Xinnuo
VIRTUAL ACOUSTIC CHANNEL EXPANSION BASED ON NEURAL NETWORKS FOR WEIGHTED PREDICTION ERROR-BASED SPEECH DEREVERBERATION 2020 Yang, Joon-Young
FROM SPEAKER VERIFICATION TO MULTISPEAKER SPEECH SYNTHESIS, DEEP TRANSFER WITH FEEDBACK CONSTRAINT 2020 Cai, Zexin
NON-AUTOREGRESSIVE END-TO-END TTS WITH COARSE-TO-FINE DECODING 2020 Wang, Tao
NATURALNESS ENHANCEMENT WITH LINGUISTIC INFORMATION IN END-TO-END TTS USING UNSUPERVISED PARALLEL ENCODING 2020 Peiro-Lilja, Alex
END-TO-END TEXT-TO-SPEECH SYNTHESIS WITH UNALIGNED MULTIPLE LANGUAGE UNITS BASED ON ATTENTION 2020 Aso, Masashi
Alle Artikel auflisten