APNet2: High-Quality and High-Efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:National Conference on Man-Machine Speech Communication (18. : 2023 : Suzhou) Man-machine speech communication
1. Verfasser: Du, Hui-Peng (VerfasserIn)
Weitere Verfasser: Lu, Ye-Xin (VerfasserIn), Ai, Yang (VerfasserIn), Ling, Zhen-Hua (VerfasserIn)
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2024
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
A Lightweight Music Source Separation Model with Graph Convolution Network 2024 Zhu, Mengying
APNet2: High-Quality and High-Efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra 2024 Du, Hui-Peng
A Study on Domain Adaptation for Audio-Visual Speech Enhancement 2024 Wang, Chenxi
Semi-End-to-End Nested Named Entity Recognition from Speech 2024 Zhang, Min
Within- and Between-Class Sample Interpolation Based Supervised Metric Learning for Speaker Verification 2024 Zhang, Jian-Tao
End-to-End Streaming Customizable Keyword Spotting Based on Text-Adaptive Neural Search 2024 Yang, Baochen
The Production of Successive Addition Boundary Tone in Mandarin Preschoolers 2024 Li, Aijun
Emotional Support Dialog System Through Recursive Interactions Among Large Language Models 2024 Chen, Keqi
Accent-VITS: Accent Transfer for End-to-End TTS 2024 Ma, Linhan
Adaptive Deep Graph Convolutional Network for Dialogical Speech Emotion Recognition 2024 Liu, Jiaxing
Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker Diarization 2024 Zhao, Huan
Zero-Shot Singing Voice Conversion Based on Timbre Space Modeling and Excitation Signal Control 2024 Jiang, Yuan
The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023 2024 Cheng, Ming
Chinese EFL Learners’ Auditory and Visustion Intonations: The Effect of Lexical Stress 2024 Xu, Qiunan
Leveraging Synthetic Speech for CIF-Based Customized Keyword Spotting 2024 Liu, Shuiyun
Data Augmentation by Finite Element Analysis for Enhanced Machine Anomalous Sound Detection 2024 Zhang, Zhixian
Joint Speech and Noise Estimation Using SNR-Adaptive Target Learning for Deep-Learning-Based Speech Enhancement 2024 Li, Xiaoran
Improving Speech Perceptual Quality and Intelligibility Through Sub-band Temporal Envelope Characteristics 2024 Wu, Ruilin
Multi-branch Network with Cross-Domain Feature Fusion for Anomalous Sound Detection 2024 Fang, Wenjie
CAM-GUI: A Conversational Assistant on Mobile GUI 2024 Zhu, Zichen
Alle Artikel auflisten