Inicio  /  Applied Sciences  /  Vol: 13 Par: 6 (2023)  /  Artículo
ARTÍCULO
TITULO

CWSXLNet: A Sentiment Analysis Model Based on Chinese Word Segmentation Information Enhancement

Shiqian Guo    
Yansun Huang    
Baohua Huang    
Linda Yang and Cong Zhou    

Resumen

This paper proposed a method for improving the XLNet model to address the shortcomings of segmentation algorithm for processing Chinese language, such as long sub-word lengths, long word lists and incomplete word list coverage. To address these issues, we proposed the CWSXLNet (Chinese Word Segmentation XLNet) model based on Chinese word segmentation information enhancement. The model first pre-processed Chinese pretrained text by Chinese word segmentation tool, and proposed a Chinese word segmentation attention mask mechanism by combining PLM (Permuted Language Model) and two-stream self-attention mechanism of XLNet. While performing natural language processing at word granularity, it can reduce the degree of masking between masked and non-masked words for two words belonging to the same word. For the Chinese sentiment analysis task, proposed the CWSXLNet-BiGRU-Attention model, which introduces bi-directional GRU as well as self-attention mechanism in the downstream task. Experiments show that CWSXLNet has achieved 89.91% precision, 91.53% recall rate and 90.71% F1-score, and CWSXLNet-BiGRU-Attention has achieved 92.61% precision, 93.19% recall rate and 92.90% F1-score on ChnSentiCorp dataset, which indicates that CWSXLNet has better performance than other models in Chinese sentiment analysis.

 Artículos similares

       
 
Achini Adikari, Su Nguyen, Rashmika Nawaratne, Daswin De Silva and Damminda Alahakoon    
The proliferation of online hotel review platforms has prompted decision-makers in the hospitality sector to acknowledge the significance of extracting valuable information from this vast source. While contemporary research has primarily focused on extra... ver más
Revista: Applied Sciences

 
Hongyu Shao, Sizhe Pan, Yufei Song and Quanfu Li    
In the context of rapid product iteration, design conflicts arise from discrepancies in designers? understanding of user needs, influenced by subjective preferences, behavioural stances, and other factors. This paper proposes a product conceptual design ... ver más
Revista: Applied Sciences

 
Haidi Badr, Nayer Wanas and Magda Fayek    
Unsupervised domain adaptation (UDA) presents a significant challenge in sentiment analysis, especially when faced with differences between source and target domains. This study introduces Weighted Sequential Unsupervised Domain Adaptation (WS-UDA), a no... ver más
Revista: Applied Sciences

 
Mahammad Khalid Shaik Vadla, Mahima Agumbe Suresh and Vimal K. Viswanathan    
Understanding customer emotions and preferences is paramount for success in the dynamic product design landscape. This paper presents a study to develop a prediction pipeline to detect the aspect and perform sentiment analysis on review data. The pre-tra... ver más
Revista: Algorithms

 
Peranut Nimitsurachat and Peter Washington    
Emotion recognition models using audio input data can enable the development of interactive systems with applications in mental healthcare, marketing, gaming, and social media analysis. While the field of affective computing using audio data is rich, a m... ver más
Revista: AI