APNet2: High-Quality and High-Efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	National Conference on Man-Machine Speech Communication (18. : 2023 : Suzhou) Man-machine speech communication
1. Verfasser:	Du, Hui-Peng (VerfasserIn)
Weitere Verfasser:	Lu, Ye-Xin (VerfasserIn), Ai, Yang (VerfasserIn), Ling, Zhen-Hua (VerfasserIn)
Format:	UnknownFormat
Sprache:	eng
Veröffentlicht:	2024
Schlagworte:
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Bestellen

Titel	Jahr	Verfasser
A Lightweight Music Source Separation Model with Graph Convolution Network	2024	Zhu, Mengying
APNet2: High-Quality and High-Efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra	2024	Du, Hui-Peng
A Study on Domain Adaptation for Audio-Visual Speech Enhancement	2024	Wang, Chenxi
Semi-End-to-End Nested Named Entity Recognition from Speech	2024	Zhang, Min
Within- and Between-Class Sample Interpolation Based Supervised Metric Learning for Speaker Verification	2024	Zhang, Jian-Tao
End-to-End Streaming Customizable Keyword Spotting Based on Text-Adaptive Neural Search	2024	Yang, Baochen
The Production of Successive Addition Boundary Tone in Mandarin Preschoolers	2024	Li, Aĳun
Emotional Support Dialog System Through Recursive Interactions Among Large Language Models	2024	Chen, Keqi
Accent-VITS: Accent Transfer for End-to-End TTS	2024	Ma, Linhan
Adaptive Deep Graph Convolutional Network for Dialogical Speech Emotion Recognition	2024	Liu, Jiaxing
Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker Diarization	2024	Zhao, Huan
Zero-Shot Singing Voice Conversion Based on Timbre Space Modeling and Excitation Signal Control	2024	Jiang, Yuan
The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023	2024	Cheng, Ming
Chinese EFL Learners’ Auditory and Visustion Intonations: The Effect of Lexical Stress	2024	Xu, Qiunan
Leveraging Synthetic Speech for CIF-Based Customized Keyword Spotting	2024	Liu, Shuiyun
Data Augmentation by Finite Element Analysis for Enhanced Machine Anomalous Sound Detection	2024	Zhang, Zhixian
Joint Speech and Noise Estimation Using SNR-Adaptive Target Learning for Deep-Learning-Based Speech Enhancement	2024	Li, Xiaoran
Improving Speech Perceptual Quality and Intelligibility Through Sub-band Temporal Envelope Characteristics	2024	Wu, Ruilin
Multi-branch Network with Cross-Domain Feature Fusion for Anomalous Sound Detection	2024	Fang, Wenjie
CAM-GUI: A Conversational Assistant on Mobile GUI	2024	Zhu, Zichen

Alle Artikel auflisten