Inicio  /  Applied Sciences  /  Vol: 11 Par: 11 (2021)  /  Artículo
ARTÍCULO
TITULO

A Deep Belief Network Classification Approach for Automatic Diacritization of Arabic Text

Waref Almanaseer    
Mohammad Alshraideh and Omar Alkadi    

Resumen

Deep learning has emerged as a new area of machine learning research. It is an approach that can learn features and hierarchical representation purely from data and has been successfully applied to several fields such as images, sounds, text and motion. The techniques developed from deep learning research have already been impacting the research on Natural Language Processing (NLP). Arabic diacritics are vital components of Arabic text that remove ambiguity from words and reinforce the meaning of the text. In this paper, a Deep Belief Network (DBN) is used as a diacritizer for Arabic text. DBN is an algorithm among deep learning that has recently proved to be very effective for a variety of machine learning problems. We evaluate the use of DBNs as classifiers in automatic Arabic text diacritization. The DBN was trained to individually classify each input letter with the corresponding diacritized version. Experiments were conducted using two benchmark datasets, the LDC ATB3 and Tashkeela. Our best settings achieve a DER and WER of 2.21% and 6.73%, receptively, on the ATB3 benchmark with an improvement of 26% over the best published results. On the Tashkeela benchmark, our system continues to achieve high accuracy with a DER of 1.79% and 14% improvement.

 Artículos similares

       
 
Qinmeng Yang, Ningming Nie, Yangang Wang, Xiaojing Wu, Weihua Liu, Xiaoli Ren, Zijian Wang, Meng Wan and Rongqiang Cao    
Gross primary productivity (GPP) is an important indicator in research on carbon cycling in terrestrial ecosystems. High-accuracy GPP prediction is crucial for ecosystem health and climate change assessments. We developed a site-level GPP prediction meth... ver más
Revista: Applied Sciences

 
K. Lavanya, Anand Mahendran, Ramani Selvanambi, Manuel Mazzara and Jude D Hemanth    
Every biological system on the planet is severely impacted by environmental change, and its primary driver is deforestation. Meanwhile, quantitative analysis of changes in Land Use and Land Cover (LULC) is one of the prominent ways to manage and understa... ver más
Revista: Applied Sciences

 
Abha Pragati, Debadatta Amaresh Gadanayak, Tanmoy Parida and Manohar Mishra    
Considering the advantage of the ability of data-mining techniques (DMTs) to detect and classify patterns, this paper explores their applicability for the protection of voltage source converter-based high voltage direct current (VSC-HVDC) transmission sy... ver más

 
Rui Yang, Yingbo Zhao and Yuan Shi    
When radar receives target echoes to form plots, it is inevitably affected by clutter, which brings a lot of imprecise and uncertain information to target recognition. Traditional radar plot recognition algorithms often have poor performance in dealing w... ver más
Revista: Applied Sciences

 
Zhaofei Xu, Weidong Lu, Zhenyu Hu, Wei Yan, Wei Xue, Ta Zhou and Feifei Jiang    
Different types of buildings in different climate zones have their own design specifications and specific user populations. Generally speaking, these populations have similar sensory feedbacks in their perception of environmental thermal comfort. Existin... ver más
Revista: Applied Sciences