REVISTA
Applied Sciences

TODAS

Redirigiendo al acceso original de articulo en 18 segundos...

Inicio / Applied Sciences / Vol: 10 Par: 19 (2020) / Artículo

ARTÍCULO

TITULO

MAKEDONKA: Applied Deep Learning Model for Text-to-Speech Synthesis in Macedonian Language

Kostadin Mishev

Aleksandra Karovska Ristovska

Dimitar Trajanov

Tome Eftimov and Monika Simjanoska

Resumen

This paper presents MAKEDONKA, the first open-source Macedonian language synthesizer that is based on the Deep Learning approach. The paper provides an overview of the numerous attempts to achieve a human-like reproducible speech, which has unfortunately shown to be unsuccessful due to the work invisibility and lack of integration examples with real software tools. The recent advances in Machine Learning, the Deep Learning-based methodologies, provide novel methods for feature engineering that allow for smooth transitions in the synthesized speech, making it sound natural and human-like. This paper presents a methodology for end-to-end speech synthesis that is based on a fully-convolutional sequence-to-sequence acoustic model with a position-augmented attention mechanism?Deep Voice 3. Our model directly synthesizes Macedonian speech from characters. We created a dataset that contains approximately 20 h of speech from a native Macedonian female speaker, and we use it to train the text-to-speech (TTS) model. The achieved MOS score of 3.93 makes our model appropriate for application in any kind of software that needs text-to-speech service in the Macedonian language. Our TTS platform is publicly available for use and ready for integration.

Palabras claves

macedonian language - text-to-speech - deep learning - natural language processing - speech synthesizer

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 10 Parte: 19 (2020)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Aerospace
Journal of Transport and Land Use
AI

DOI

https://doi.org/10.3390/app10196882

Artículos similares

Transfer Learning-Based Classification of Maxillary Sinus Using Generative Adversarial Networks

Acceso

Mohammad Alhumaid and Ayman G. Fayoumi

Paranasal sinus pathologies, particularly those affecting the maxillary sinuses, pose significant challenges in diagnosis and treatment due to the complex anatomical structures and diverse disease manifestations. The aim of this study is to investigate t... ver más

Revista: Applied Sciences

Progressing towards Estimates of Local Emissions from Trees in Cities: A Transdisciplinary Framework Integrating Available Municipal Data, AI, and Citizen Science

Acceso

Julia Mayer, Martin Memmel, Johannes Ruf, Dhruv Patel, Lena Hoff and Sascha Henninger

Urban tree cadastres, crucial for climate adaptation and urban planning, face challenges in maintaining accuracy and completeness. A transdisciplinary approach in Kaiserslautern, Germany, complements existing incomplete tree data with additional precise ... ver más

Revista: Applied Sciences

An Unsupervised Learning Method for Suppressing Ground Roll in Deep Pre-Stack Seismic Data Based on Wavelet Prior Information for Deep Learning in Seismic Data

Acceso

Jiarui Xia and Yongshou Dai

Ground roll noise suppression is a crucial step in processing deep pre-stack seismic data. Recently, supervised deep learning methods have gained popularity in this field due to their ability to adaptively learn and extract powerful features. However, th... ver más

Revista: Applied Sciences

Investigating the Impact of Wildfires on Lake Water Quality Using Earth Observation Satellites

Acceso

Rossana Caroni, Monica Pinardi, Gary Free, Daniela Stroppiana, Lorenzo Parigi, Giulio Tellina, Mariano Bresciani, Clément Albergel and Claudia Giardino

A study was carried out to investigate the effects of wildfires on lake water quality using a source dataset of 2024 lakes worldwide, covering different lake types and ecological settings. Satellite-derived datasets (Lakes_cci and Fire_cci) were used and... ver más

Revista: Applied Sciences

Detection Method for Rice Seedling Planting Conditions Based on Image Processing and an Improved YOLOv8n Model

Acceso

Bo Zhao, Qifan Zhang, Yangchun Liu, Yongzhi Cui and Baixue Zhou

In response to the need for precision and intelligence in the assessment of transplanting machine operation quality, this study addresses challenges such as low accuracy and efficiency associated with manual observation and random field sampling for the ... ver más

Revista: Applied Sciences

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas