Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features

Dissertation, Carl von Ossietzky Universität Oldenburg, 2015

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	Schädler, Marc René (VerfasserIn)
Körperschaft:	Carl von Ossietzky Universität Oldenburg (Grad-verleihende Institution)
Weitere Verfasser:	Kollmeier, Birger (AkademischeR BetreuerIn), Hermansky, Hynek (AkademischeR BetreuerIn)
Format:	UnknownFormat
Sprache:	eng
Veröffentlicht:	Oldenburg BIS-Verlag der Carl von Ossietzky Universität Oldenburg 2016
Schlagworte:	Automatic speech recognition > Hochschulschrift > Automatische Spracherkennung > Sprachsignal > Sprachverarbeitung
Online Zugang:	Inhaltsverzeichnis
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Dissertation, Carl von Ossietzky Universität Oldenburg, 2015 Automatic speech recognition (ASR) systems still do not perform as well as human listeners under realistic conditions. The unmatched ability of humans to understand speech in most difficult acoustic conditions originates from the superior properties of their auditory system. The aim of this thesis is to improve the recognition performance of ASR systems in difficult acoustic conditions by carefully integrating auditory signal processing strategies. To this end, the physiologically inspired extraction of spectro-temporal modulation patterns was successfully integrated into the front-end of a standard ASR system. Further, the joint spectro-temporal processing could be separated into independent temporal and spectral processes. To investigate the reason for the remaining "man-machine-gap" in recognition performance, a range of critical auditory discrimination tasks were performed using ASR systems. The comparison with empirical data showed that the separate spectro-temporal modulation front-end provides a suitable auditory model and revealed the importance of across-frequency processing in speech recognition. <engl.>
Beschreibung:	ix, 176 Seiten Illustrationen, Diagramme
ISBN:	9783814223339 978-3-8142-2333-9