IMPROVING MULTI-SCALE AGGREGATION USING FEATURE PYRAMID MODULE FOR ROBUST SPEAKER VERIFICATION OF VARIABLE-DURATION UTTERANCES
|
2020 |
Jung, Youngmoon |
SUM-PRODUCT NETWORKS FOR ROBUST AUTOMATIC SPEAKER IDENTIFICATION
|
2020 |
Nicolson, Aaron |
SPEAKER RE-IDENTIFICATION WITH SPEAKER DEPENDENT SPEECH ENHANCEMENT
|
2020 |
Shi, Yanpei |
SIAMESE X-VECTOR RECONSTRUCTION FOR DOMAIN ADAPTED SPEAKER RECOGNITION
|
2020 |
Rozenberg, Shai |
MODELING ASR AMBIGUITY FOR NEURAL DIALOGUE STATE TRACKING
|
2020 |
Pal, Vaishali |
STYLE ATTUNED PRE-TRAINING AND PARAMETER EFFICIENT FINE-TUNING FOR SPOKEN LANGUAGE UNDERSTANDING
|
2020 |
Cao, Jin |
DEEP F-MEASURE MAXIMIZATION FOR END-TO-END SPEECH UNDERSTANDING
|
2020 |
Sari, Leda |
CATEGORIZATION OF WHISTLED CONSONANTS BY FRENCH SPEAKERS ?
|
2020 |
Ngoc, Anais Tran |
MANDARIN AND ENGLISH ADULTS' CUE-WEIGHTING OF LEXICAL STRESS
|
2020 |
Zeng, Zhen |
IDENTIFYING IMPORTANT TIME-FREQUENCY LOCATIONS IN CONTINUOUS SPEECH UTTERANCES
|
2020 |
Kavaki, Hassan Salami |
LIGHTWEIGHT END-TO-END SPEECH RECOGNITION FROM RAW AUDIO DATA USING SINC-CONVOLUTION
|
2020 |
Kurzinger, Ludwig |
AN ALTERNATIVE TO MFCCS FOR ASR
|
2020 |
Ghahramani, Pegah |
DESIGN CHOICES FOR X-VECTOR BASED SPEAKER ANONYMIZATION
|
2020 |
Srivastava, Brij Mohan Lal |
PERCEPTION OF CONCATENATIVE VS. NEURAL TEXT-TO-SPEECH (TTS): DIFFERENCES IN INTELLIGIBILITY IN NOISE AND LANGUAGE ATTITUDES
|
2020 |
Cohn, Michelle |
ENHANCING SEQUENCE-TO-SEQUENCE TEXT-TO-SPEECH WITH MORPHOLOGY
|
2020 |
Taylor, Jason |
UNDERSTANDING THE EFFECT OF VOICE QUALITY AND ACCENT ON TALKER SIMILARITY
|
2020 |
Das, Anurag |
HIERARCHICAL MULTI-STAGE WORD-TO-GRAPHEME NAMED ENTITY CORRECTOR FOR AUTOMATIC SPEECH RECOGNITION
|
2020 |
Garg, Abhinav |
COMBINATION OF END-TO-END AND HYBRID MODELS FOR SPEECH RECOGNITION
|
2020 |
Wong, Jeremy H. M. |
LVCSR WITH TRANSFORMER LANGUAGE MODBELS
|
2020 |
Beck, Eugen |
UNCERTAINTY-AWARE MACHINE SUPPORT FOR PAPER REVIEWING ON THE INTERSPEECH 2019 SUBMISSION CORPUS
|
2020 |
Stappen, Lukas |