ARTÍCULO
TITULO

AN EXPANDABLE AND UP-TO-DATE LEXICON FOR SENTIMENT ANALYSIS OF ARABIC TWEETS

B. Ihnaini    
M. Mahmuddin    

Resumen

Sentiment analysis is the process of identifying the subjective opinion within a text. And it gains a huge interest due to its several benefits in developing economy, politic, and sociology. And since twitter is considered a rich source of people?s thoughts and opinions, it is urged to benefit from it to explore public opinions. Many researches have been conducted for English language, while Arabic language still got limited number of sentiment analysis studies, especially in the context of Arab dialects in social media. A lexicon-based approach is adopted to perform sentiment analysis on Arabic tweets, which rely on detecting sentiment words. These sentiment words are loaded in a sentiment lexicon where words are annotated by its sentiment polarity. One of the main issues of handling Arabic tweets is the changing nature of twitter, where new words that imply sentiment values emerged, and many slang words are evolved. In this paper, an expandable and up-to-date lexicon for Arabic (EULA) is developed to overcome the issue of inventing new words and phrases in social media. EULA rely on a pre-built lexicon of MSA sentiment words, and a set of rules to expand and enrich it with dialectical polarity words from a small amount of labeled tweets, and a large amount of unlabeled tweets. For evaluation, eight different corpuses of Arabic tweets were selected. And a pre-processing phase that includes normalization and stemming is implemented to reduce the number of unique words to be analyzed for sentiment analysis. Experiments show that EULA improved the lexicon-based approach`s accuracy and F-1 score by more than 20% on average.

 Artículos similares

       
 
Franco Guzzetti, Karen Lara Ngozi Anyabolu, Francesca Biolo and Lara D?Ambrosio    
In the construction field, the Building Information Modeling (BIM) methodology is becoming increasingly predominant and the standardization of its use is now an essential operation. This method has become widespread in recent years, thanks to the advanta... ver más
Revista: Applied Sciences

 
Yehree Kim, Jeon Min Kang, Ho-Young Song, Woo Seok Kang, Jung-Hoon Park and Jong Woo Chung    
This study was conducted to investigate the efficacy of a self-expandable retainer (SER) for endoscopic visualization of the external auditory canal (EAC). Tympanomeatal flap (TMF) elevation was performed in six cadaveric heads. Two different types of SE... ver más
Revista: Applied Sciences

 
Asghar Rezaei, Hugo Giambini, Alan L. Miller II, Xifeng Liu, Benjamin D. Elder, Michael J. Yaszemski and Lichun Lu    
The spinal column is the most common site for bone metastasis. Vertebral metastases with instability have historically been treated with corpectomy of the affected vertebral body and adjacent intervertebral discs, and have more recently been treated with... ver más
Revista: Applied Sciences

 
Fan He, S.K. Ong and A.Y.C. Nee    
The ever-increasing complexity in manufacturing environments has caused delay and uncertainty for on-site personnel when retrieving critical information. Currently, information associated with manufacturing environments is created and stored in centraliz... ver más
Revista: Information

 
Lucie Sperkova    
Customer experience (CX) focuses on customer feedback. CX is a holistic construct which contains different perceptual elements such as satisfaction and loyalty, but also emotions or personality. Customers share their opinions, which contain these element... ver más