|
|
|
Jialin Zhang, Mairidan Wushouer, Gulanbaier Tuerhong and Hanfang Wang
Emotional speech synthesis is an important branch of human?computer interaction technology that aims to generate emotionally expressive and comprehensible speech based on the input text. With the rapid development of speech synthesis technology based on ...
ver más
|
|
|
|
|
|
|
Juan Zuluaga-Gomez, Amrutha Prasad, Iuliia Nigmatulina, Petr Motlicek and Matthias Kleinert
In this paper we propose a novel virtual simulation-pilot engine for speeding up air traffic controller (ATCo) training by integrating different state-of-the-art artificial intelligence (AI)-based tools. The virtual simulation-pilot engine receives spoke...
ver más
|
|
|
|
|
|
|
Víctor García, Inma Hernáez and Eva Navas
In this paper, we describe the implementation and evaluation of Text to Speech synthesizers based on neural networks for Spanish and Basque. Several voices were built, all of them using a limited number of data. The system applies Tacotron 2 to compute m...
ver más
|
|
|
|
|
|
|
Muhammad Atif and Valentina Franzoni
Users of web or chat social networks typically use emojis (e.g., smilies, memes, hearts) to convey in their textual interactions the emotions underlying the context of the communication, aiming for better interpretability, especially for short polysemous...
ver más
|
|
|
|
|
|
|
Kostadin Mishev, Aleksandra Karovska Ristovska, Dimitar Trajanov, Tome Eftimov and Monika Simjanoska
This paper presents MAKEDONKA, the first open-source Macedonian language synthesizer that is based on the Deep Learning approach. The paper provides an overview of the numerous attempts to achieve a human-like reproducible speech, which has unfortunately...
ver más
|
|
|
|
|
|
|
Nikita Kireev,Eugene Ilyushin
Pág. 84 - 90
Scientists have long been working on algorithms for translate text written in natural language into speech. But the quality of work these algorithms left much to be desired until the moment when the application of deep learning methods was not possible. ...
ver más
|
|
|
|
|
|
|
Sriraksha Nayak,Chandrakala C B
Pág. pp. 52 - 69
According to the World Health Organization estimation, globally the number of people with some visual impairment is estimated to be 285 million, of whom 39 million are blind. The inability to use features such as sending and reading of email, sched...
ver más
|
|
|
|
|
|
|
Santiago Pascual, Joan Serrà and Antonio Bonafonte
Conversion from text to speech relies on the accurate mapping from linguistic to acoustic symbol sequences, for which current practice employs recurrent statistical models such as recurrent neural networks. Despite the good performance of such models (in...
ver más
|
|
|
|
|
|
|
Mohammed Salah Al-Radhi, Tamás Gábor Csapó and Géza Németh
The work discussed herein provides a reference for selecting appropriate techniques to optimize and improve the performance of current fundamental frequency estimation methods-based text-to-speech.
|
|
|
|
|
|
|
Yifan Liu and Jin Zheng
Text-to-speech synthesis is a computational technique for producing synthetic, human-like speech by a computer. In recent years, speech synthesis techniques have developed, and have been employed in many applications, such as automatic translation applic...
ver más
|
|
|
|