REVISTA
Applied Sciences

TODAS

Redirigiendo al acceso original de articulo en 20 segundos...

Inicio / Applied Sciences / Vol: 9 Par: 9 (2019) / Artículo

ARTÍCULO

TITULO

Regularized Urdu Speech Recognition with Semi-Supervised Deep Learning

Mohammad Ali Humayun

Ibrahim A. Hameed

Syed Muslim Shah

Sohaib Hassan Khan

Irfan Zafar

Saad Bin Ahmed and Junaid Shuja

Resumen

Automatic Speech Recognition, (ASR) has achieved the best results for English, with end-to-end neural network based supervised models. These supervised models need huge amounts of labeled speech data for good generalization, which can be quite a challenge to obtain for low-resource languages like Urdu. Most models proposed for Urdu ASR are based on Hidden Markov Models (HMMs). This paper proposes an end-to-end neural network model, for Urdu ASR, regularized with dropout, ensemble averaging and Maxout units. Dropout and ensembles are averaging techniques over multiple neural network models while Maxout are units in a neural network which adapt their activation functions. Due to limited labeled data, Semi Supervised Learning (SSL) techniques are also incorporated to improve model generalization. Speech features are transformed into a lower dimensional manifold using an unsupervised dimensionality-reduction technique called Locally Linear Embedding (LLE). Transformed data along with higher dimensional features is used to train neural networks. The proposed model also utilizes label propagation-based self-training of initially trained models and achieves a Word Error Rate (WER) of 4% less than that reported as the benchmark on the same Urdu corpus using HMM. The decrease in WER after incorporating SSL is more significant with an increased validation data size.

Palabras claves

speech recognition - locally linear embedding - label propagation - Maxout - low resource languages

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 9 Parte: 9 (2019)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Water
Applied Sciences
Information

DOI

https://doi.org/10.3390/app9091956

Artículos similares

Quantum Computing and Machine Learning on an Integrated Photonics Platform

Acceso

Huihui Zhu, Hexiang Lin, Shaojun Wu, Wei Luo, Hui Zhang, Yuancheng Zhan, Xiaoting Wang, Aiqun Liu and Leong Chuan Kwek

Integrated photonic chips leverage the recent developments in integrated circuit technology, along with the control and manipulation of light signals, to realize the integration of multiple optical components onto a single chip. By exploiting the power o... ver más

Revista: Information

Quantum Convolutional Long Short-Term Memory Based on Variational Quantum Algorithms in the Era of NISQ

Acceso

Zeyu Xu, Wenbin Yu, Chengjun Zhang and Yadang Chen

In the era of noisy intermediate-scale quantum (NISQ) computing, the synergistic collaboration between quantum and classical computing models has emerged as a promising solution for tackling complex computational challenges. Long short-term memory (LSTM)... ver más

Revista: Information

Integrated Generative Adversarial Networks and Deep Convolutional Neural Networks for Image Data Classification: A Case Study for COVID-19

Acceso

Ku Muhammad Naim Ku Khalif, Woo Chaw Seng, Alexander Gegov, Ahmad Syafadhli Abu Bakar and Nur Adibah Shahrul

Convolutional Neural Networks (CNNs) have garnered significant utilisation within automated image classification systems. CNNs possess the ability to leverage the spatial and temporal correlations inherent in a dataset. This study delves into the use of ... ver más

Revista: Information

Research on Passenger Evacuation Behavior in Civil Aircraft Demonstration Experiments Based on Neural Networks and Modeling

Acceso

Zhenyu Feng, Qianqian You, Kun Chen, Houjin Song and Haoxuan Peng

Evacuation simulation is an important method for studying and evaluating the safety of passenger evacuation, and the key lies in whether it can accurately predict personnel evacuation behavior in different environments. The existing models have good adap... ver más

Revista: Aerospace

Stacked Multiscale Densely Connected Temporal Convolutional Attention Network for Multi-Objective Speech Enhancement in an Airborne Environment

Acceso

Ping Huang and Yafeng Wu

Airborne speech enhancement is always a major challenge for the security of airborne systems. Recently, multi-objective learning technology has become one of the mainstream methods of monaural speech enhancement. In this paper, we propose a novel multi-o... ver más

Revista: Aerospace

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas