|
|
|
Eduardo Medeiros, Leonel Corado, Luís Rato, Paulo Quaresma and Pedro Salgueiro
Automatic speech recognition (ASR), commonly known as speech-to-text, is the process of transcribing audio recordings into text, i.e., transforming speech into the respective sequence of words. This paper presents a deep learning ASR system optimization ...
ver más
|
|
|
|
|
|
|
Pau Baquero-Arnal, Javier Jorge, Adrià Giménez, Javier Iranzo-Sánchez, Alejandro Pérez, Gonçal Vicent Garcés Díaz-Munío, Joan Albert Silvestre-Cerdà, Jorge Civera, Albert Sanchis and Alfons Juan
This work has direct application in live automatic captioning of audiovisual material, which is fundamental in accessibility.
|
|
|
|
|
|
|
Amira Dhouib, Achraf Othman, Oussama El Ghoul, Mohamed Koutheair Khribi and Aisha Al Sinani
Automatic Speech Recognition (ASR), also known as Speech-To-Text (STT) or computer speech recognition, has been an active field of research recently. This study aims to chart this field by performing a Systematic Literature Review (SLR) to give insight i...
ver más
|
|
|
|
|
|
|
Juan M. Perero-Codosero, Fernando M. Espinoza-Cuadros and Luis A. Hernández-Gómez
This paper describes a comparison between hybrid and end-to-end Automatic Speech Recognition (ASR) systems, which were evaluated on the IberSpeech-RTVE 2020 Speech-to-Text Transcription Challenge. Deep Neural Networks (DNNs) are becoming the most promisi...
ver más
|
|
|
|
|
|
|
Sakshi Dua, Sethuraman Sambath Kumar, Yasser Albagory, Rajakumar Ramalingam, Ankur Dumka, Rajesh Singh, Mamoon Rashid, Anita Gehlot, Sultan S. Alshamrani and Ahmed Saeed AlGhamdi
Deep learning-based machine learning models have shown significant results in speech recognition and numerous vision-related tasks. The performance of the present speech-to-text model relies upon the hyperparameters used in this research work. In this re...
ver más
|
|
|
|
|
|
|
Sriraksha Nayak,Chandrakala C B
Pág. pp. 52 - 69
According to the World Health Organization estimation, globally the number of people with some visual impairment is estimated to be 285 million, of whom 39 million are blind. The inability to use features such as sending and reading of email, sched...
ver más
|
|
|
|
|
|
|
Khairunisa Sharif,Bastian Tenbergen
Pág. 15 - 30
Intelligent voice assistants are internet-connected devices, which listen to their environment and react to spoken user commands in order to retrieve information from the internet, control appliances in the household, or notify the user of incoming messa...
ver más
|
|
|
|
|
|
|
Hye-Jeong Song, Hong-Ki Kim, Jong-Dae Kim, Chan-Young Park and Yu-Seop Kim
Recently, with the development of Speech to Text, which converts voice to text, and machine translation, technologies for simultaneously translating the captions of video into other languages have been developed. Using this, YouTube, a video-sharing site...
ver más
|
|
|
|
|
|
|
Mohamed Tamazin, Ahmed Gouda and Mohamed Khedr
Many new consumer applications are based on the use of automatic speech recognition (ASR) systems, such as voice command interfaces, speech-to-text applications, and data entry processes. Although ASR systems have remarkably improved in recent decades, t...
ver más
|
|
|
|
|
|
|
Dragos Ciobanu, Valentina Ragni and Alina Secara
Translation revision is a relevant topic for translator training and research. Recent technological developments justify increased focus on embedding speech technologies?speech synthesis (text-to-speech) and speech recognition (speech-to-text)?into revis...
ver más
|
|
|
|