26 Artículos

Semi-Supervised Learning for Robust Emotional Speech Synthesis with Limited Data

Acceso

en línea

Jialin Zhang, Mairidan Wushouer, Gulanbaier Tuerhong and Hanfang Wang

Emotional speech synthesis is an important branch of human?computer interaction technology that aims to generate emotionally expressive and comprehensible speech based on the input text. With the rapid development of speech synthesis technology based on ... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 9 Año: 2023

A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers

Acceso

en línea

Juan Zuluaga-Gomez, Amrutha Prasad, Iuliia Nigmatulina, Petr Motlicek and Matthias Kleinert

In this paper we propose a novel virtual simulation-pilot engine for speeding up air traffic controller (ATCo) training by integrating different state-of-the-art artificial intelligence (AI)-based tools. The virtual simulation-pilot engine receives spoke... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 5 Año: 2023

Evaluation of Tacotron Based Synthesizers for Spanish and Basque

Acceso

en línea

Víctor García, Inma Hernáez and Eva Navas

In this paper, we describe the implementation and evaluation of Text to Speech synthesizers based on neural networks for Spanish and Basque. Several voices were built, all of them using a limited number of data. The system applies Tacotron 2 to compute m... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 3 Año: 2022

Tell Me More: Automating Emojis Classification for Better Accessibility and Emotional Context Recognition

Acceso

en línea

Muhammad Atif and Valentina Franzoni

Users of web or chat social networks typically use emojis (e.g., smilies, memes, hearts) to convey in their textual interactions the emotions underlying the context of the communication, aiming for better interpretability, especially for short polysemous... ver más

Revista: Future Internet Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 5 Año: 2022

MAKEDONKA: Applied Deep Learning Model for Text-to-Speech Synthesis in Macedonian Language

Acceso

en línea

Kostadin Mishev, Aleksandra Karovska Ristovska, Dimitar Trajanov, Tome Eftimov and Monika Simjanoska

This paper presents MAKEDONKA, the first open-source Macedonian language synthesizer that is based on the Deep Learning approach. The paper provides an overview of the numerous attempts to achieve a human-like reproducible speech, which has unfortunately... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 19 Año: 2020

Review of existing text-to-speech algorithms

Acceso

en línea

Nikita Kireev,Eugene Ilyushin Pág. 84 - 90

Scientists have long been working on algorithms for translate text written in natural language into speech. But the quality of work these algorithms left much to be desired until the moment when the application of deep learning methods was not possible. ... ver más

Revista: International Journal of Open Information Technologies Formato: Electrónico

Tabla de contenido: Vol: 8 Num: 7 Par: 0 Año: 2020

Assistive Mobile Application for Visually Impaired People

Acceso

en línea

Sriraksha Nayak,Chandrakala C B Pág. pp. 52 - 69

According to the World Health Organization estimation, globally the number of people with some visual impairment is estimated to be 285 million, of whom 39 million are blind. The inability to use features such as sending and reading of email, sched... ver más

Revista: International Journal of Interactive Mobile Technologies (iJIM) Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 16 Par: 0 Año: 2020

Exploring Efficient Neural Architectures for Linguistic?Acoustic Mapping in Text-To-Speech

Acceso

en línea

Santiago Pascual, Joan Serrà and Antonio Bonafonte

Conversion from text to speech relies on the accurate mapping from linguistic to acoustic symbol sequences, for which current practice employs recurrent statistical models such as recurrent neural networks. Despite the good performance of such models (in... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 9 Num: 0 Par: 16 Año: 2019

Adaptive Refinements of Pitch Tracking and HNR Estimation within a Vocoder for Statistical Parametric Speech Synthesis

Acceso

en línea

Mohammed Salah Al-Radhi, Tamás Gábor Csapó and Géza Németh

The work discussed herein provides a reference for selecting appropriate techniques to optimize and improve the performance of current fundamental frequency estimation methods-based text-to-speech.

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 9 Num: 0 Par: 12 Año: 2019

Es-Tacotron2: Multi-Task Tacotron 2 with Pre-Trained Estimated Network for Reducing the Over-Smoothness Problem

Acceso

en línea

Yifan Liu and Jin Zheng

Text-to-speech synthesis is a computational technique for producing synthetic, human-like speech by a computer. In recent years, speech synthesis techniques have developed, and have been employed in many applications, such as automatic translation applic... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 4 Año: 2019

« Anterior Página: 1 de 2 Siguiente »