Speech and computer 24th International Conference, SPECOM 2022, Gurugram, India, November 14-16, 2022, proceedings

Thematic Diversity of Everyday Russian Discourse: a Case Study Based on the ORD corpus.- Neural Embedding Extractors for Text-Independent Speaker Verification.- Deep Speaker Embeddings based Online Diarization.- Overlapped Speech Detection Using AM-FM based Time-Frequency Representations.- Significa...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Körperschaft:	SPECOM (VerfasserIn)
Weitere Verfasser:	Agrawal, Shyam S. (HerausgeberIn), Karpov, Aleksej (HerausgeberIn), Prasanna, S. R. Mahadeva (HerausgeberIn), Samudravijaya, K. (HerausgeberIn)
Format:	UnknownFormat
Sprache:	eng
Veröffentlicht:	Cham Springer 2022
Schriftenreihe:	Lecture notes in computer science 13721. Lecture notes in artificial intelligence
Schlagworte:	Angewandte Informatik > Artificial intelligence > Bildverarbeitung > COMPUTERS / Artificial Intelligence > COMPUTERS / Computer Vision & Pattern Recognition > COMPUTERS / Data Processing / General > COMPUTERS / Online Services / General > Computer networking & communications > Computer vision > Computerhardware > Information technology: general issues > Künstliche Intelligenz > Konferenzschrift > Computerlinguistik > Zeichensprache > Multimodales System
Online Zugang:	Cover Inhaltsverzeichnis
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Bestellen

Beschreibung
Zusammenfassung:	Thematic Diversity of Everyday Russian Discourse: a Case Study Based on the ORD corpus.- Neural Embedding Extractors for Text-Independent Speaker Verification.- Deep Speaker Embeddings based Online Diarization.- Overlapped Speech Detection Using AM-FM based Time-Frequency Representations.- Significance of Dimensionality Reduction in CNN-based Vowel Classification from Imagined Speech using Electroencephalogram Signals.- Study of Speech Recognition System Based on Transformer and Connectionist Temporal Classification Models for Low Resource Language.- An Initial Study on Birdsong Re-synthesis using Neural Vocoders.- Speech Music Overlap Detection using Spectral Peak Evolutions.- Influence of Accented Speech in Automatic Speech Recognition: A Case Study on Assamese L1 Speakers Speaking Code Switched Hindi-English.- ClusterVote: Automatic Summarization Dataset Construction with Document Clusters.- Comparing Unsupervised Detection Algorithms for Audio Adversarial Examples.- Celtic English Continuum in Pitch Patterns of Spontane-ous Talk: Evidence of Long-Term Contacts .- Coherence Based Automatic Essay Scoring Using Sentence Embedding and Recurrent Neural Networks.- Analysis of Automatic Evaluation Metric on Low-Resourced Language: BERTScore Vs BLEU Score.- DyCoDa: A Multi-Modal Data Collection of Multi-User Remote Survival Game Recordings.- On the Use of Ensemble X-Vector Embeddings for Improved Sleepiness Detection.- Multiresolution Decomposition Analysis via Wavelet Transforms for Audio Deepfake Detection .- Automatic Rhythm and Speech Rate Analysis of Mising Spontaneous Speech.- An Electroglottographic Method for Assessing the Emotional State of the Speaker.- Significance of Distance on Pop Noise for Voice Liveness Detection .- CRIM's Speech Recognition System for OpenASR21 Evaluation with Conformer and Voice Activity Detector Embeddings.- Joint Changes in First and Second Formants of /a/, /i/, /u/ Vowels in Babble Noise - a New Statistical Approach.- Comparing NLP Solutions for the Disambiguation of French Heterophonic Homographs for End-to-End TTS Systems.- Detection of Speech Related Disorders by Pre-Trained Embedding Models Extracted Biomarkers.- Multi-Label Dysfluency Classification.- Harnessing Uncertainty - Multi-Label Dysfluency Classification with Uncertain Labels.- Continuous Wavelet Transform for Severity-Level Classification of Dysarthria.- Significance of Energy Features for Severity Classification of Dysarthria.- Sailor and Hemant A. Patil An Analytic Study on Clustering-based Pseudo-Labels for Self-Supervised Deep Speaker Verification.- Investigation of Transfer Learning for End-to-End Russian Speech Recognition.- Prosodic Features of Verbal Irony in Russian and French: Universal vs. Language-Specific.- Categorization of Threatening Speech Acts.- Assessment of Speech Quality During Speech Rehabilitation Based on the Solution of the Classification Problem.- Multi-level Fusion of Fisher Vector Encoded BERT and wav2vec 2.0 Embeddings for Native Language Identification.- Fake Speech Detection using OpenSMILE Features.- Nonverbal Constituents of Argumentative Discourse: Gesture and Prosody Interaction.- Classifying Mahout and Social Interactions of Asian Elephants based on Trumpet Calls.- Recognition of the Emotional State of Children with Down Syndrome by Video, Audio and Text Modalities: Human and Automatic.- Fake Speech Detection using Modulation Spectrogram.- Self-Configuring Genetic Programming Feature Generation in Affec This book constitutes the proceedings of the 24th International Conference on Speech and Computer, SPECOM 2022, held as a hybrid event in Gurugram, India, in November 2021.The 51 full and 9 short papers presented in this volume were carefully reviewed and selected from 99 submissions. The papers present current research in the area of computer speech processing including audio signal processing, automatic speech recognition, speaker recognition, computational paralinguistics, speech synthesis, sign language and multimodal processing, and speech and language resources
Beschreibung:	Konferenz wurde laut Vorwort in hybridem Format, online und vor Ort, abgehalten Interessenniveau: 06, Professional and scholarly: For an expert adult audience, including academic research. (06)
Beschreibung:	xvi, 720 Seiten Illustrationen
ISBN:	9783031209796 978-3-031-20979-6