Redirigiendo al acceso original de articulo en 19 segundos...
Inicio  /  Information  /  Vol: 10 Par: 9 (2019)  /  Artículo
ARTÍCULO
TITULO

The Usefulness of Imperfect Speech Data for ASR Development in Low-Resource Languages

Jaco Badenhorst and Febe de Wet    

Resumen

When the National Centre for Human Language Technology (NCHLT) Speech corpus was released, it created various opportunities for speech technology development in the 11 official, but critically under-resourced, languages of South Africa. Since then, the substantial improvements in acoustic modeling that deep architectures achieved for well-resourced languages ushered in a new data requirement: their development requires hundreds of hours of speech. A suitable strategy for the enlargement of speech resources for the South African languages is therefore required. The first possibility was to look for data that has already been collected but has not been included in an existing corpus. Additional data was collected during the NCHLT project that was not included in the official corpus: it only contains a curated, but limited subset of the data. In this paper, we first analyze the additional resources that could be harvested from the auxiliary NCHLT data. We also measure the effect of this data on acoustic modeling. The analysis incorporates recent factorized time-delay neural networks (TDNN-F). These models significantly reduce phone error rates for all languages. In addition, data augmentation and cross-corpus validation experiments for a number of the datasets illustrate the utility of the auxiliary NCHLT data.

 Artículos similares

       
 
Zuhier Alakayleh, Xing Fang and T. Prabhakar Clement    
This study aims at furthering our understanding of the Modified Philip?Dunne Infiltrometer (MPDI), which is used to determine the saturated hydraulic conductivity Ks and the Green?Ampt suction head ? at the wetting front. We have developed a forward-mode... ver más
Revista: Water

 
Jae Young Seo and Sang-Il Lee    
Drought is a complex phenomenon caused by lack of precipitation that affects water resources and human society. Groundwater drought is difficult to assess due to its complexity and the lack of spatio-temporal groundwater observations. In this study, we p... ver más
Revista: Water

 
Zain Nawaz, Xin Li, Yingying Chen, Yanlong Guo, Xufeng Wang and Naima Nawaz    
Identifying the changes in precipitation and temperature at a regional scale is of great importance for the quantification of climate change. This research investigates the changes in precipitation and surface air temperature indices in the seven irrigat... ver más
Revista: Water

 
Fhrizz S. De Jesus, Lyka Mae L. Fajardo     Pág. 13 - 32
AbstractEmployee development and training programs are critical to the global success of firms. Not only do these programs enable employees to develop new abilities, but they also enable businesses to increase employee productivity and improve company cu... ver más

 
M.H.J.P. Gunarathna, Kazuhito Sakai, Tamotsu Nakandakari, Kazuro Momii and M.K.N. Kumari    
Poor data availability on soil hydraulic properties in tropical regions hampers many studies, including crop and environmental modeling. The high cost and effort of measurement and the increasing demand for such data have driven researchers to search for... ver más
Revista: Water