Online Speaker Diarization Using Optimized SE-ResNet Architecture

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	TSD (26. : 2023 : Pilsen) Text, speech, and dialogue
1. Verfasser:	Kynych, Frantisek (VerfasserIn)
Weitere Verfasser:	Zdansky, Jindrich (VerfasserIn), Cerva, Petr (VerfasserIn), Mateju, Lukas (VerfasserIn)
Format:	UnknownFormat
Sprache:	eng
Veröffentlicht:	2023
Schlagworte:
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Bestellen

Titel	Jahr	Verfasser
Resolving Hungarian Anaphora with ChatGPT	2023	Vadász, Noémi
Advancing Hungarian Text Processing with HuSpaCy: Efficient and Accurate NLP Pipelines	2023	Orosz, György
Impact of Including Pathological Speech in Pre-training on Pathology Detection	2023	Weise, Tobias
Online Speaker Diarization Using Optimized SE-ResNet Architecture	2023	Kynych, Frantisek
VITS: Quality Vs. Speed Analysis	2023	Matoušek, Jindřich
Voice Cloning for Voice Disorders: Impact of Phonetic Content	2023	Wadoux, Lily
Towards End-to-End Speech-to-Text Summarization	2023	Monteiro, Raul
Automatic Pronunciation Assessment of Non-native English Based on Phonological Analysis	2023	Rios-Urrego, C. D.
Japanese How-to Tip Machine Reading Comprehension by Multi-task Learning Based on Generative Model	2023	Wang, Xiaotian
The Unbearable Lightness of Morph Classification	2023	John, Vojtěch
Mono- and Multilingual GPT-3 Models for Hungarian	2023	Yang, Zijian Győző
HATS: An Open Data Set Integrating Human Perception Applied to the Evaluation of Automatic Speech Recognition Metrics	2023	Bañeras-Roux, Thibault
Developing State-of-the-Art End-to-End ASR for Norwegian	2023	Nouza, Jan
Unsupervised Learning for Automatic Speech Recognition in Air Traffic Control Environment	2023	Formoe, Lars
Transfer Learning of Transformer-Based Speech Recognition Models from Czech to Slovak	2023	Lehečka, Jan
Language Generalization Using Active Learning in the Context of Parkinson’s Disease Classification	2023	Moreno-Acevedo, S. A.
ParaDiom – A Parallel Corpus of Idiomatic Texts	2023	Donaj, Gregor
Morphological Tagging and Lemmatization of Spoken Corpora of Czech	2023	Jelínek, Tomáš
When Whisper Meets TTS: Domain Adaptation Using only Synthetic Speech Data	2023	Vásquez-Correa, Juan Camilo
An Online Diarization Approach for Streaming Applications Based on Tree-Clustering and Bayesian Resegmentation	2023	Martín-Doñas, Juan M.

Alle Artikel auflisten