Multi-branch Network with Cross-Domain Feature Fusion for Anomalous Sound Detection

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:National Conference on Man-Machine Speech Communication (18. : 2023 : Suzhou) Man-machine speech communication
1. Verfasser: Fang, Wenjie (VerfasserIn)
Weitere Verfasser: Fan, Xin (VerfasserIn), Hu, Ying (VerfasserIn)
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2024
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
Ultra-Low Complexity Residue Echo and Noise Suppression Based on Recurrent Neural Network 2024 Zhou, Jianquan
A Framework Combining Separate and Joint Training for Neural Vocoder-Based Monaural Speech Enhancement 2024 Pan, Qiaoyi
Iterative Noisy-Target Approach: Speech Enhancement Without Clean Speech 2024 Zhang, Yifan
A Comparative Study of Pre-trained Audio and Speech Models for Heart Sound Detection 2024 Duan, Yuxin
A Pilot Study on the Prosodic Factors Influencing Voice Attractiveness of AI Speech 2024 Wang, Yihui
Joint Time-Domain and Frequency-Domain Progressive Learning for Single-Channel Speech Enhancement and Recognition 2024 Zou, Gongzhen
A Fast Sampling Method in Diffusion-Based Dance Generation Models 2024 Guo, Puyuan
Task-Adaptive Generative Adversarial Network Based Speech Dereverberation for Robust Speech Recognition 2024 Liu, Ji
Real-Time Automotive Engine Sound Simulation with Deep Neural Network 2024 Li, Hao
A Packet Loss Concealment Method Based on the Demucs Network Structure 2024 Li, Wenwen
A Lightweight Music Source Separation Model with Graph Convolution Network 2024 Zhu, Mengying
APNet2: High-Quality and High-Efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra 2024 Du, Hui-Peng
A Study on Domain Adaptation for Audio-Visual Speech Enhancement 2024 Wang, Chenxi
Semi-End-to-End Nested Named Entity Recognition from Speech 2024 Zhang, Min
Within- and Between-Class Sample Interpolation Based Supervised Metric Learning for Speaker Verification 2024 Zhang, Jian-Tao
End-to-End Streaming Customizable Keyword Spotting Based on Text-Adaptive Neural Search 2024 Yang, Baochen
The Production of Successive Addition Boundary Tone in Mandarin Preschoolers 2024 Li, Aijun
Emotional Support Dialog System Through Recursive Interactions Among Large Language Models 2024 Chen, Keqi
Accent-VITS: Accent Transfer for End-to-End TTS 2024 Ma, Linhan
Adaptive Deep Graph Convolutional Network for Dialogical Speech Emotion Recognition 2024 Liu, Jiaxing
Alle Artikel auflisten