AUDIO-VISUAL MULTI-SPEAKER TRACKING BASED ON THE GLMB FRAMEWORK

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:INTERSPEECH (21. : 2020 : Online) Cognitive intelligence for speech processing ; Volume 5 of 7
1. Verfasser: Lin, Shoufeng (VerfasserIn)
Weitere Verfasser: Qian, Xinyuan (VerfasserIn)
Pages:5
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2020
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
ATTENTION AND ENCODER-DECODER BASED MODELS FOR TRANSFORMING ARTICULATORY MOVEMENTS AT DIFFERENT SPEAKING RATES 2020 Singh, Abhavjeet
TOWARDS NATURAL BILINGUAL AND CODE-SWITCHED SPEECH SYNTHESIS BASED ON MIX OF MONOLINGUAL RECORDINGS AND CROSS-LINGUAL VOICE 2020 Zhao, Shengkui
CROSS-LINGUAL TEXT-TO-SPEECH SYNTHESIS VIA DOMAIN ADAPTATION AND PERCEPTUAL SIMILARITY REGRESSION IN SPEAKER SPACE 2020 Xin, Detai
TONE LEARNING IN LOW-RESOURCE BILINGUAL TTS 2020 Liu, Ruolan
GENERIC INDIC TEXT-TO-SPEECH SYNTHESISERS WITH RAPID ADAPTATION IN AN END-TO-END FRAMEWORK 2020 Prakash, Anusha
ADVERSARIAL DOMAIN ADAPTATION FOR SPEAKER VERIFICATION USING PARTIALLY SHARED NETWORK 2020 Chen, Zhengyang
AUTOMATIC DETECTION OF ACCENT AND LEXICAL PRONUNCIATION ERRORS IN SPONTANEOUS NON-NATIVE ENGLISH SPEECH 2020 Knill, Konstantinos Kyriakopoulos. Kate M.
IDENTIFY SPEAKERS IN COCKTAIL PARTIES WITH END-TO-END ATTENTION 2020 Zhu, Junzhe
TOWARDS SILENT PARALINGUISTICS: DERIVING SPEAKING MODE AND SPEAKER ID FROM ELCTROMYOGRAPHIC SIGNALS 2020 Diener, Lorenz
UNSUPERVISED LEARNING FOR SEQUENCE-TO-SEQUENCE TEXT-TO-SPEECH FOR LOW-RESOURCE LANGUAGES 2020 Zhang, Haitong
CONDITIONAL SPOKEN DIGIT GENERATION WITH STYLEGAN 2020 Palkama, Kasperi
INCREMENTAL TEXT TO SPEECH FOR NEURAL SEQUENCE-TO-SEQUENCE MODELS USING REINFORCEMENT LEARNING 2020 Mohan, Devang S. Ram
WAV2SPK: A SIMPLE DNN ARCHITECTURE FOR LEARNING SPEAKER EMBEDDINGS FROM WAVEFORMS 2020 Lin, Weiwei
A COMPARATIVE RE-ASSESSMENT OF FEATURE EXTRACTORS FOR DEEP SPEAKER EMBEDDINGS 2020 Liu, Xuechen
COSINE-DISTANCE VIRTUAL ADVERSARIAL TRAINING FOR SEMI-SUPERVISED SPEAKER-DISCRIMINATIVE ACOUSTIC EMBEDDINGS 2020 Kreyssig, Florian L.
LEARNING SPEAKER EMBEDDING FROM TEXT-TO-SPEECH 2020 Cho, Jaejin
MATCHBOXNET: 1D TIME-CHANNEL SEPARABLE CONVOLUTIONAL NEURAL NETWORK ARCHITECTURE FOR SPEECH COMMANDS RECOGNITION .................. FOR AUTOMATIC SPEECH RECOGNITION .............. sese IH aee 0 7B 2020 Majumdar, Somshubra
LISTEN ATTENTIVELY, AND SPELL ONCE: WHOLE SENTENCE GENERATION VIA A NON-AUTOREGRESSIVE ARCHITECTURE FOR LOW-LATENCY SPEECH 2020 Bai, Ye
LAUGHTER SYNTHESIS: COMBINING SEQ2SEQ MODELING WITH TRANSFER LEARNING 2020 Tits, Noe
DEEP EMBEDDING LEARNING FOR TEXT-DEPENDENT SPEAKER VERIFICATION 2020 Zhang, Peng
Alle Artikel auflisten