Inicio  /  Information  /  Vol: 13 Par: 9 (2022)  /  Artículo
ARTÍCULO
TITULO

Fake News Spreaders Detection: Sometimes Attention Is Not All You Need

Marco Siino    
Elisa Di Nuovo    
Ilenia Tinnirello and Marco La Cascia    

Resumen

Guided by a corpus linguistics approach, in this article we present a comparative evaluation of State-of-the-Art (SotA) models, with a special focus on Transformers, to address the task of Fake News Spreaders (i.e., users that share Fake News) detection. First, we explore the reference multilingual dataset for the considered task, exploiting corpus linguistics techniques, such as chi-square test, keywords and Word Sketch. Second, we perform experiments on several models for Natural Language Processing. Third, we perform a comparative evaluation using the most recent Transformer-based models (RoBERTa, DistilBERT, BERT, XLNet, ELECTRA, Longformer) and other deep and non-deep SotA models (CNN, MultiCNN, Bayes, SVM). The CNN tested outperforms all the models tested and, to the best of our knowledge, any existing approach on the same dataset. Fourth, to better understand this result, we conduct a post-hoc analysis as an attempt to investigate the behaviour of the presented best performing black-box model. This study highlights the importance of choosing a suitable classifier given the specific task. To make an educated decision, we propose the use of corpus linguistics techniques. Our results suggest that large pre-trained deep models like Transformers are not necessarily the first choice when addressing a text classification task as the one presented in this article. All the code developed to run our tests is publicly available on GitHub.

 Artículos similares

       
 
Fawaz Khaled Alarfaj and Jawad Abbas Khan    
The online spread of fake news on various platforms has emerged as a significant concern, posing threats to public opinion, political stability, and the dissemination of reliable information. Researchers have turned to advanced technologies, including ma... ver más
Revista: Algorithms

 
Pei-Xuan Li, Yu-Yun Huang, Chris Shei and Hsun-Ping Hsieh    
The growth of social platforms has lowered the barrier of entry into the media sector, allowing for the spread of false information and putting democratic politics and social security at peril. Preliminary analysis shows that posts sharing real news and ... ver más
Revista: Applied Sciences

 
Chenbo Fu, Xingyu Pan, Xuejiao Liang, Shanqing Yu, Xiaoke Xu and Yong Min    
In recent years, fake news detection and its characteristics have attracted a number of researchers. However, most detection algorithms are driven by data rather than theories, which causes the existing approaches to only perform well on specific dataset... ver más
Revista: Applied Sciences

 
Ruth S. Contreras-Espinosa and Jose Luis Eguia-Gomez    
Despite access to reliable information being essential for equal opportunities in our society, current school curricula only include some notions about media literacy in a limited context. Thus, it is necessary to create scenarios for reflection on and a... ver más
Revista: Computers

 
Alejandro Valencia-Arias, Diana María Arango-Botero, Sebastián Cardona-Acevedo, Sharon Soledad Paredes Delgado and Ada Gallegos    
The COVID-19 pandemic and the boom of fake news cluttering the internet have revealed the power of social media today. However, young people are not yet aware of their role in the digital age, even though they are the main users of social media. As a res... ver más
Revista: Informatics