WSRGLOW: A GLOW-BASED WAVEFORM GENERATIVE MODEL FOR AUDIO SUPER-RESOLUTION

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:INTERSPEECH (22. : 2021 : Brünn; Online) 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021) ; Volume 4 of 6
1. Verfasser: Zhang, Kexun (VerfasserIn)
Weitere Verfasser: Ren, Yi (VerfasserIn), Xu, Changliang (VerfasserIn), Zhao, Zhou (VerfasserIn)
Pages:22
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
ONLINE SPEAKER DIARIZATION EQUIPPED WITH DISCRIMINATIVE MODELING AND GUIDED INFERENCE 2021 Wan, Xucheng
ONLINE STREAMING END-TO-END NEURAL DIARIZATION HANDLING OVERLAPPING SPEECH AND FLEXIBLE NUMBERS OF SPEAKERS 2021 Xue, Yawen
SEMI-SUPERVISED TRAINING WITH PSEUDO-LABELING FOR END-TO-END NEURAL DIARIZATION 2021 Takashima, Yuki
ECAPA-TDNN EMBEDDINGS FOR SPEAKER DIARIZATION 2021 Dawalatabad, Nauman
TARGET-SPEAKER VOICE ACTIVITY DETECTION WITH IMPROVED I-VECTOR ESTIMATION FOR UNKNOWN NUMBER OF SPEAKER 2021 He, Maokui
LEAP SUBMISSION FOR THE THIRD DIHARD DIARIZATION CHALLENGE 2021 Singh, Prachi
INVESTIGATION OF SPATIAL-ACOUSTIC FEATURES FOR OVERLAPPING SPEECH DETECTION IN MULTIPARTY MEETINGS 2021 Zhang, Shiliang
ROBUST END-TO-END SPEAKER DIARIZATION WITH CONFORMER AND ADDITIVE MARGIN PENALTY 2021 Leung, Tsun-Yat
AUTOMATIC ERROR CORRECTION FOR SPEAKER EMBEDDING LEARNING WITH NOISY LABELS 2021 Tong, Fuchuan
FAIR VOICE BIOMETRICS: IMPACT OF DEMOGRAPHIC IMBALANCE ON GROUP FAIRNESS IN SPEAKER RECOGNITION 2021 Fenu, Gianni
AUDIO RETRIEVAL WITH NATURAL LANGUAGE QUERIES 2021 Oncescu, Andreea-Maria
ORCA-SLANG: AN AUTOMATIC MULTI-STAGE SEMI-SUPERVISED DEEP LEARNING FRAMEWORK FOR LARGE-SCALE KILLER WHALE CALL TYPE IDENTIFICATION 2021 Bergler, Christian
VOICE PRIVACY THROUGH X-VECTOR AND CYCLEGAN-BASED ANONYMIZATION 2021 Prajapati, Gauri P.
A TWO-STAGE APPROACH TO SPEECH BANDWIDTH EXTENSION 2021 Lin, Ju
NU-WAVE: A DIFFUSION PROBABILISTIC MODEL FOR NEURAL AUDIO UPSAMPLING 2021 Lee, Junhyeok
X-NET: A JOINT SCALE DOWN AND SCALE UP METHOD FOR VOICE CALL 2021 Wen, Liang
FUSION-NET: TIME-FREQUENCY INFORMATION FUSION Y-NETWORK FOR SPEECH ENHANCEMENT 2021 Nareddula, Santhan Kumar Reddy
A SPECTRO-TEMPORAL GLIMPSING INDEX (STGI) FOR SPEECH INTELLIGIBILITY PREDICTION 2021 Edraki, Amin
TEMPORAL CONVOLUTIONAL NETWORK WITH FREQUENCY DIMENSION ADAPTIVE ATTENTION FOR SPEECH ENHANCEMENT 2021 Zhang, Oiquan
IMPROVING PERCEPTUAL QUALITY BY PHONE-FORTIFIED PERCEPTUAL LOSS USING WASSERSTEIN DISTANCE FOR SPEECH ENHANCEMENT 2021 Hsieh, Tsun-An
Alle Artikel auflisten