Redirigiendo al acceso original de articulo en 19 segundos...
Inicio  /  Applied Sciences  /  Vol: 13 Par: 12 (2023)  /  Artículo
ARTÍCULO
TITULO

A Transformer-Based Approach to Authorship Attribution in Classical Arabic Texts

Fetoun Mansour AlZahrani and Maha Al-Yahya    

Resumen

Authorship attribution (AA) is a field of natural language processing that aims to attribute text to its author. Although the literature includes several studies on Arabic AA in general, applying AA to classical Arabic texts has not gained similar attention. This study focuses on investigating recent Arabic pretrained transformer-based models in a rarely studied domain with limited research contributions: the domain of Islamic law. We adopt an experimental approach to investigate AA. Because no dataset has been designed specifically for this task, we design and build our own dataset using Islamic law digital resources. We conduct several experiments on fine-tuning four Arabic pretrained transformer-based models: AraBERT, AraELECTRA, ARBERT, and MARBERT. Results of the experiments indicate that for the task of attributing a given text to its author, ARBERT and AraELECTRA outperform the other models with an accuracy of 96%. We conclude that pretrained transformer models, specifically ARBERT and AraELECTRA, fine-tuned using the Islamic legal dataset, show significant results in applying AA to Islamic legal texts.

 Artículos similares

       
 
Marco Siino, Elisa Di Nuovo, Ilenia Tinnirello and Marco La Cascia    
Guided by a corpus linguistics approach, in this article we present a comparative evaluation of State-of-the-Art (SotA) models, with a special focus on Transformers, to address the task of Fake News Spreaders (i.e., users that share Fake News) detection.... ver más
Revista: Information

 
Min-Hsien Weng, Shaoqun Wu and Mark Dyer    
With the rapidly growing number of scientific publications, researchers face an increasing challenge of discovering the current research topics and methodologies in a scientific domain. This paper describes an unsupervised topic detection approach that u... ver más
Revista: Applied Sciences

 
Jia Song, Xindi Tong, Xiaowei Xu and Kai Zhao    
In this paper, a real-time reentry guidance law for hypersonic vehicles is presented to accomplish rapid, high-precision, robust, and reliable reentry flights by leveraging the Time to Vector (Time2vec) and transformer networks. First, referring to the t... ver más
Revista: Aerospace

 
Samuel Kierszbaum, Thierry Klein and Laurent Lapasset    
We consider the problem of solving Natural Language Understanding (NLU) tasks characterized by domain-specific data. An effective approach consists of pre-training Transformer-based language models from scratch using domain-specific data before fine-tuni... ver más
Revista: Aerospace

 
Jawaher Alghamdi, Yuqing Lin and Suhuai Luo    
Efforts have been dedicated by researchers in the field of natural language processing (NLP) to detecting and combating fake news using an assortment of machine learning (ML) and deep learning (DL) techniques. In this paper, a review of the existing stud... ver más
Revista: Information