Machine learning for speaker recognition

"In the last ten years, many methods have been developed and deployed for real-world biometric applications and multimedia information systems. Machine learning has been playing a crucial role in these applications where the model parameters could be learned and the system performance could be...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Mak, M. W. (VerfasserIn)
Weitere Verfasser: Chien, Jen-tzung (VerfasserIn)
Format: UnknownFormat
Sprache:eng
Veröffentlicht: Cambridge, New York, NY, Melbourne, New Delhi, Singapore Cambridge University Press 2021
Schlagworte:
Online Zugang:Inhaltsverzeichnis
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:"In the last ten years, many methods have been developed and deployed for real-world biometric applications and multimedia information systems. Machine learning has been playing a crucial role in these applications where the model parameters could be learned and the system performance could be optimized. As for speaker recognition, researchers and engineers have been attempting to tackle the most di cult challenges: noise robustness and domain mismatch. These e orts have now been fruitful, leading to commercial products starting to emerge, e.g., voice authentication for e-banking and speaker identication in smart speakers. Research in speaker recognition has traditionally been focused on signal processing (for extracting the most relevant and robust features) and machine learning (for classifying the features). Recently, we have witnessed the shift in the focus from signal processing to machine learning. In particular, many studies have shown that model adaptation can address both robustness and domain mismatch. As for robust feature extraction, recent studies also demonstrate that deep learning and feature learning can be a great alternative to traditional signal processing algorithms. This book has two perspectives: Machine Learning and Speaker Recognition. The machine learning perspective gives readers insights on what make stateof-the-art systems perform so well. The speaker recognition perspective enables readers to apply machine learning techniques to address practical issues (e.g., robustness under adverse acoustic environments and domain mismatch) when deploying speaker recognition systems. The theories and practices of speaker recognition are tightly connected in the book"--
Beschreibung:Literaturverzeichnis: Seite 289-306
Beschreibung:xviii, 309 Seiten
Illustrationen, Diagramme
ISBN:9781108428125
978-1-108-42812-5
9781108552332