255 Artículos

Toward Effective Aircraft Call Sign Detection Using Fuzzy String-Matching between ASR and ADS-B Data

Acceso

en línea

Mohammed Saïd Kasttet, Abdelouahid Lyhyaoui, Douae Zbakh, Adil Aramja and Abderazzek Kachkari

Recently, artificial intelligence and data science have witnessed dramatic progress and rapid growth, especially Automatic Speech Recognition (ASR) technology based on Hidden Markov Models (HMMs) and Deep Neural Networks (DNNs). Consequently, new end-to-... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 1 Año: 2024

Analyzing Multi-Mode Fatigue Information from Speech and Gaze Data from Air Traffic Controllers

Acceso

en línea

Lin Xu, Shanxiu Ma, Zhiyuan Shen, Shiyu Huang and Ying Nan

In order to determine the fatigue state of air traffic controllers from air talk, an algorithm is proposed for discriminating the fatigue state of controllers based on applying multi-speech feature fusion to voice data using a Fuzzy Support Vector Machin... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 1 Año: 2024

Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction

Acceso

en línea

Yusuf Brima, Ulf Krumnack, Simone Pika and Gunther Heidemann

Self-supervised learning (SSL) has emerged as a promising paradigm for learning flexible speech representations from unlabeled data. By designing pretext tasks that exploit statistical regularities, SSL models can capture useful representations that are ... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 2 Año: 2024

Optimizing Speech Emotion Recognition with Deep Learning and Grey Wolf Optimization: A Multi-Dataset Approach

Acceso

en línea

Suryakant Tyagi and Sándor Szénási

Machine learning and speech emotion recognition are rapidly evolving fields, significantly impacting human-centered computing. Machine learning enables computers to learn from data and make predictions, while speech emotion recognition allows computers t... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 17 Num: 0 Par: 3 Año: 2024

Predicting Individual Well-Being in Teamwork Contexts Based on Speech Features

Acceso

en línea

Tobias Zeulner, Gerhard Johann Hagerer, Moritz Müller, Ignacio Vazquez and Peter A. Gloor

Current methods for assessing individual well-being in team collaboration at the workplace often rely on manually collected surveys. This limits continuous real-world data collection and proactive measures to improve team member workplace satisfaction. W... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 4 Año: 2024

Customization of the ASR System for ATC Speech with Improved Fusion

Acceso

en línea

Jiahao Fan and Weijun Pan

In recent years, automatic speech recognition (ASR) technology has improved significantly. However, the training process for an ASR model is complex, involving large amounts of data and a large number of algorithms. The task of training a new model for a... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 3 Año: 2024

A Light-Weight Autoregressive CNN-Based Frame Level Transducer Decoder for End-to-End ASR

Acceso

en línea

Hyeon-Kyu Noh and Hong-June Park

A convolutional neural network (CNN) transducer decoder was proposed to reduce the decoding time of an end-to-end automatic speech recognition (ASR) system while maintaining accuracy. The CNN of 177 k parameters and a kernel size of 6 generates the proba... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 3 Año: 2024

Acoustic Characteristics of Greek Vowels Produced by Adult Heritage Speakers of Albanian

Acceso

en línea

Georgios P. Georgiou and Aretousa Giannakou

Investigating heritage language (HL)-contact effects on the dominant language has received limited attention despite its importance in understanding the dynamic interplay between linguistic systems in situations of bilingualism. This study compares the a... ver más

Revista: Acoustics Formato: Electrónico

Tabla de contenido: Vol: 6 Num: 0 Par: 1 Año: 2024

A Systematic Evaluation of Recurrent Neural Network Models for Edge Intelligence and Human Activity Recognition Applications

Acceso

en línea

Varsha S. Lalapura, Veerender Reddy Bhimavarapu, J. Amudha and Hariram Selvamurugan Satheesh

The Recurrent Neural Networks (RNNs) are an essential class of supervised learning algorithms. Complex tasks like speech recognition, machine translation, sentiment classification, weather prediction, etc., are now performed by well-trained RNNs. Local o... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 17 Num: 0 Par: 3 Año: 2024

Dementia Detection from Speech: What If Language Models Are Not the Answer?

Acceso

en línea

Mondher Bouazizi, Chuheng Zheng, Siyuan Yang and Tomoaki Ohtsuki

A growing focus among scientists has been on researching the techniques of automatic detection of dementia that can be applied to the speech samples of individuals with dementia. Leveraging the rapid advancements in Deep Learning (DL) and Natural Languag... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 1 Año: 2024

« Anterior Página: 1 de 14 Siguiente »