Inicio  /  Applied Sciences  /  Vol: 13 Par: 13 (2023)  /  Artículo
ARTÍCULO
TITULO

Knowledge-Graph- and GCN-Based Domain Chinese Long Text Classification Method

Yifei Wang    
Yongwei Wang    
Hao Hu    
Shengnan Zhou and Qinwu Wang    

Resumen

In order to solve the current problems in domain long text classification tasks, namely, the long length of a document, which makes it difficult for the model to capture key information, and the lack of expert domain knowledge, which leads to insufficient classification accuracy, a domain long text classification model based on a knowledge graph and a graph convolutional neural network is proposed. BERT is used to encode the text, and each word?s corresponding vector is used as a node for the graph convolutional neural network so that the initialized vector contains rich semantic information. Using the trained entity?relationship extraction model, the entity-to-entity?relationships in the document are extracted and used as the edges of the graph convolutional neural network, together with syntactic dependency information. The graph structure mask is used to learn about edge relationships and edge types to further enhance the learning ability of the model for semantic dependencies between words. The method further improves the accuracy of domain long text classification by fusing knowledge features and data features. Experiments on three long text classification datasets?IFLYTEK, THUCNews, and the Chinese corpus of Fudan University?show accuracy improvements of 8.8%, 3.6%, and 2.6%, respectively, relative to the BERT model.

 Artículos similares

       
 
Eugenia I. Toki, Jenny Pange, Giorgos Tatsis, Konstantinos Plachouras and Ioannis G. Tsoulos    
Autism Spectrum Disorder is known to cause difficulties in social interaction and communication, as well as repetitive patterns of behavior, interests, or hobbies. These challenges can significantly affect the individual?s daily life. Therefore, it is cr... ver más
Revista: Applied Sciences

 
Varsha S. Lalapura, Veerender Reddy Bhimavarapu, J. Amudha and Hariram Selvamurugan Satheesh    
The Recurrent Neural Networks (RNNs) are an essential class of supervised learning algorithms. Complex tasks like speech recognition, machine translation, sentiment classification, weather prediction, etc., are now performed by well-trained RNNs. Local o... ver más
Revista: Algorithms

 
Shiplu Das, Sanjoy Pratihar, Buddhadeb Pradhan, Rutvij H. Jhaveri and Francesco Benedetto    
The main purpose of a detection system is to ascertain the state of an individual?s eyes, whether they are open and alert or closed, and then alert them to their level of fatigue. As a result of this, they will refrain from approaching an accident site. ... ver más
Revista: Information

 
Mondher Bouazizi, Chuheng Zheng, Siyuan Yang and Tomoaki Ohtsuki    
A growing focus among scientists has been on researching the techniques of automatic detection of dementia that can be applied to the speech samples of individuals with dementia. Leveraging the rapid advancements in Deep Learning (DL) and Natural Languag... ver más
Revista: Information

 
Hui Sheng, Min Liu, Jiyong Hu, Ping Li, Yali Peng and Yugen Yi    
Time-series data is an appealing study topic in data mining and has a broad range of applications. Many approaches have been employed to handle time series classification (TSC) challenges with promising results, among which deep neural network methods ha... ver más
Revista: Information