Online Speaker Diarization Using Optimized SE-ResNet Architecture

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:TSD (26. : 2023 : Pilsen) Text, speech, and dialogue
1. Verfasser: Kynych, Frantisek (VerfasserIn)
Weitere Verfasser: Zdansky, Jindrich (VerfasserIn), Cerva, Petr (VerfasserIn), Mateju, Lukas (VerfasserIn)
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2023
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
Resolving Hungarian Anaphora with ChatGPT 2023 Vadász, Noémi
Advancing Hungarian Text Processing with HuSpaCy: Efficient and Accurate NLP Pipelines 2023 Orosz, György
Impact of Including Pathological Speech in Pre-training on Pathology Detection 2023 Weise, Tobias
Online Speaker Diarization Using Optimized SE-ResNet Architecture 2023 Kynych, Frantisek
VITS: Quality Vs. Speed Analysis 2023 Matoušek, Jindřich
Voice Cloning for Voice Disorders: Impact of Phonetic Content 2023 Wadoux, Lily
Towards End-to-End Speech-to-Text Summarization 2023 Monteiro, Raul
Automatic Pronunciation Assessment of Non-native English Based on Phonological Analysis 2023 Rios-Urrego, C. D.
Japanese How-to Tip Machine Reading Comprehension by Multi-task Learning Based on Generative Model 2023 Wang, Xiaotian
The Unbearable Lightness of Morph Classification 2023 John, Vojtěch
Mono- and Multilingual GPT-3 Models for Hungarian 2023 Yang, Zijian Győző
HATS: An Open Data Set Integrating Human Perception Applied to the Evaluation of Automatic Speech Recognition Metrics 2023 Bañeras-Roux, Thibault
Developing State-of-the-Art End-to-End ASR for Norwegian 2023 Nouza, Jan
Unsupervised Learning for Automatic Speech Recognition in Air Traffic Control Environment 2023 Formoe, Lars
Transfer Learning of Transformer-Based Speech Recognition Models from Czech to Slovak 2023 Lehečka, Jan
Language Generalization Using Active Learning in the Context of Parkinson’s Disease Classification 2023 Moreno-Acevedo, S. A.
ParaDiom – A Parallel Corpus of Idiomatic Texts 2023 Donaj, Gregor
Morphological Tagging and Lemmatization of Spoken Corpora of Czech 2023 Jelínek, Tomáš
When Whisper Meets TTS: Domain Adaptation Using only Synthetic Speech Data 2023 Vásquez-Correa, Juan Camilo
An Online Diarization Approach for Streaming Applications Based on Tree-Clustering and Bayesian Resegmentation 2023 Martín-Doñas, Juan M.
Alle Artikel auflisten