58 Artículos

Toward Effective Aircraft Call Sign Detection Using Fuzzy String-Matching between ASR and ADS-B Data

Acceso

en línea

Mohammed Saïd Kasttet, Abdelouahid Lyhyaoui, Douae Zbakh, Adil Aramja and Abderazzek Kachkari

Recently, artificial intelligence and data science have witnessed dramatic progress and rapid growth, especially Automatic Speech Recognition (ASR) technology based on Hidden Markov Models (HMMs) and Deep Neural Networks (DNNs). Consequently, new end-to-... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 1 Año: 2024

Customization of the ASR System for ATC Speech with Improved Fusion

Acceso

en línea

Jiahao Fan and Weijun Pan

In recent years, automatic speech recognition (ASR) technology has improved significantly. However, the training process for an ASR model is complex, involving large amounts of data and a large number of algorithms. The task of training a new model for a... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 3 Año: 2024

Dementia Detection from Speech: What If Language Models Are Not the Answer?

Acceso

en línea

Mondher Bouazizi, Chuheng Zheng, Siyuan Yang and Tomoaki Ohtsuki

A growing focus among scientists has been on researching the techniques of automatic detection of dementia that can be applied to the speech samples of individuals with dementia. Leveraging the rapid advancements in Deep Learning (DL) and Natural Languag... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 1 Año: 2024

Multilingual Speech Recognition for Turkic Languages

Acceso

en línea

Saida Mussakhojayeva, Kaisar Dauletbek, Rustem Yeshpanov and Huseyin Atakan Varol

The primary aim of this study was to contribute to the development of multilingual automatic speech recognition for lower-resourced Turkic languages. Ten languages?Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek?we... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 2 Año: 2023

The Development of a Kazakh Speech Recognition Model Using a Convolutional Neural Network with Fixed Character Level Filters

Acceso

en línea

Nurgali Kadyrbek, Madina Mansurova, Adai Shomanov and Gaukhar Makharova

This study is devoted to the transcription of human speech in the Kazakh language in dynamically changing conditions. It discusses key aspects related to the phonetic structure of the Kazakh language, technical considerations in collecting the transcribe... ver más

Revista: Big Data and Cognitive Computing Formato: Electrónico

Tabla de contenido: Vol: 7 Num: 0 Par: 3 Año: 2023

Semi-Supervised Learning for Robust Emotional Speech Synthesis with Limited Data

Acceso

en línea

Jialin Zhang, Mairidan Wushouer, Gulanbaier Tuerhong and Hanfang Wang

Emotional speech synthesis is an important branch of human?computer interaction technology that aims to generate emotionally expressive and comprehensible speech based on the input text. With the rapid development of speech synthesis technology based on ... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 9 Año: 2023

High-Quality Data from Crowdsourcing towards the Creation of a Mexican Anti-Immigrant Speech Corpus

Acceso

en línea

Alejandro Molina-Villegas, Thomas Cattin, Karina Gazca-Hernandez and Edwin Aldana-Bobadilla

Currently, a significant portion of published research on online hate speech relies on existing textual corpora. However, when examining a specific context, there is a lack of preexisting datasets that include the particularities associated with various ... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 14 Año: 2023

An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain

Acceso

en línea

Driss Khalil, Amrutha Prasad, Petr Motlicek, Juan Zuluaga-Gomez, Iuliia Nigmatulina, Srikanth Madikeri and Christof Schuepbach

In air traffic management (ATM), voice communications are critical for ensuring the safe and efficient operation of aircraft. The pertinent voice communications?air traffic controller (ATCo) and pilot?are usually transmitted in a single channel, which po... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 10 Año: 2023

Effects of Language Ontology on Transatlantic Automatic Speech Understanding Research Collaboration in the Air Traffic Management Domain

Acceso

en línea

Shuo Chen, Hartmut Helmke, Robert M. Tarakan, Oliver Ohneiser, Hunter Kopald and Matthias Kleinert

As researchers around the globe develop applications for the use of Automatic Speech Recognition and Understanding (ASRU) in the Air Traffic Management (ATM) domain, Air Traffic Control (ATC) language ontologies will play a critical role in enabling rese... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 6 Año: 2023

Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding

Acceso

en línea

Juan Zuluaga-Gomez, Iuliia Nigmatulina, Amrutha Prasad, Petr Motlicek, Driss Khalil, Srikanth Madikeri, Allan Tart, Igor Szoke, Vincent Lenders, Mickael Rigault and Khalid Choukri

Voice communication between air traffic controllers (ATCos) and pilots is critical for ensuring safe and efficient air traffic control (ATC). The handling of these voice communications requires high levels of awareness from ATCos and can be tedious and e... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 10 Año: 2023

« Anterior Página: 1 de 4 Siguiente »