23 Artículos

Domain Adaptation Speech-to-Text for Low-Resource European Portuguese Using Deep Learning

Acceso

en línea

Eduardo Medeiros, Leonel Corado, Luís Rato, Paulo Quaresma and Pedro Salgueiro

Automatic speech recognition (ASR), commonly known as speech-to-text, is the process of transcribing audio recordings into text, i.e., transforming speech into the respective sequence of words. This paper presents a deep learning ASR system optimization ... ver más

Revista: Future Internet Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 5 Año: 2023

MLLP-VRAIN Spanish ASR Systems for the Albayzín-RTVE 2020 Speech-to-Text Challenge: Extension

Acceso

en línea

Pau Baquero-Arnal, Javier Jorge, Adrià Giménez, Javier Iranzo-Sánchez, Alejandro Pérez, Gonçal Vicent Garcés Díaz-Munío, Joan Albert Silvestre-Cerdà, Jorge Civera, Albert Sanchis and Alfons Juan

This work has direct application in live automatic captioning of audiovisual material, which is fundamental in accessibility.

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 2 Año: 2022

Arabic Automatic Speech Recognition: A Systematic Literature Review

Acceso

en línea

Amira Dhouib, Achraf Othman, Oussama El Ghoul, Mohamed Koutheair Khribi and Aisha Al Sinani

Automatic Speech Recognition (ASR), also known as Speech-To-Text (STT) or computer speech recognition, has been an active field of research recently. This study aims to chart this field by performing a Systematic Literature Review (SLR) to give insight i... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 17 Año: 2022

A Comparison of Hybrid and End-to-End ASR Systems for the IberSpeech-RTVE 2020 Speech-to-Text Transcription Challenge

Acceso

en línea

Juan M. Perero-Codosero, Fernando M. Espinoza-Cuadros and Luis A. Hernández-Gómez

This paper describes a comparison between hybrid and end-to-end Automatic Speech Recognition (ASR) systems, which were evaluated on the IberSpeech-RTVE 2020 Speech-to-Text Transcription Challenge. Deep Neural Networks (DNNs) are becoming the most promisi... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 2 Año: 2022

Developing a Speech Recognition System for Recognizing Tonal Speech Signals Using a Convolutional Neural Network

Acceso

en línea

Sakshi Dua, Sethuraman Sambath Kumar, Yasser Albagory, Rajakumar Ramalingam, Ankur Dumka, Rajesh Singh, Mamoon Rashid, Anita Gehlot, Sultan S. Alshamrani and Ahmed Saeed AlGhamdi

Deep learning-based machine learning models have shown significant results in speech recognition and numerous vision-related tasks. The performance of the present speech-to-text model relies upon the hyperparameters used in this research work. In this re... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 12 Año: 2022

Assistive Mobile Application for Visually Impaired People

Acceso

en línea

Sriraksha Nayak,Chandrakala C B Pág. pp. 52 - 69

According to the World Health Organization estimation, globally the number of people with some visual impairment is estimated to be 285 million, of whom 39 million are blind. The inability to use features such as sending and reading of email, sched... ver más

Revista: International Journal of Interactive Mobile Technologies (iJIM) Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 16 Par: 0 Año: 2020

Smart Home Voice Assistants: A Literature Survey of User Privacy and Security Vulnerabilities

Acceso

en línea

Khairunisa Sharif,Bastian Tenbergen Pág. 15 - 30

Intelligent voice assistants are internet-connected devices, which listen to their environment and react to spoken user commands in order to retrieve information from the internet, control appliances in the household, or notify the user of incoming messa... ver más

Revista: Complex Systems Informatics and Modeling Quarterly Formato: Electrónico

Tabla de contenido: Vol: PP Num: 24 Par: 0 Año: 2020

Inter-Sentence Segmentation of YouTube Subtitles Using Long-Short Term Memory (LSTM)

Acceso

en línea

Hye-Jeong Song, Hong-Ki Kim, Jong-Dae Kim, Chan-Young Park and Yu-Seop Kim

Recently, with the development of Speech to Text, which converts voice to text, and machine translation, technologies for simultaneously translating the captions of video into other languages have been developed. Using this, YouTube, a video-sharing site... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 9 Num: 0 Par: 7 Año: 2019

Enhanced Automatic Speech Recognition System Based on Enhancing Power-Normalized Cepstral Coefficients

Acceso

en línea

Mohamed Tamazin, Ahmed Gouda and Mohamed Khedr

Many new consumer applications are based on the use of automatic speech recognition (ASR) systems, such as voice command interfaces, speech-to-text applications, and data entry processes. Although ASR systems have remarkably improved in recent decades, t... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 9 Num: 0 Par: 10 Año: 2019

Speech Synthesis in the Translation Revision Process: Evidence from Error Analysis, Questionnaire, and Eye-Tracking

Acceso

en línea

Dragos Ciobanu, Valentina Ragni and Alina Secara

Translation revision is a relevant topic for translator training and research. Recent technological developments justify increased focus on embedding speech technologies?speech synthesis (text-to-speech) and speech recognition (speech-to-text)?into revis... ver más

Revista: Informatics Formato: Electrónico

Tabla de contenido: Vol: 6 Num: 0 Par: 4 Año: 2019

« Anterior Página: 1 de 2 Siguiente »