Inicio  /  Future Internet  /  Vol: 14 Par: 10 (2022)  /  Artículo
ARTÍCULO
TITULO

Author Identification from Literary Articles with Visual Features: A Case Study with Bangla Documents

Ankita Dhar    
Himadri Mukherjee    
Shibaprasad Sen    
Md Obaidullah Sk    
Amitabha Biswas    
Teresa Gonçalves and Kaushik Roy    

Resumen

Author identification is an important aspect of literary analysis, studied in natural language processing (NLP). It aids identify the most probable author of articles, news texts or social media comments and tweets, for example. It can be applied to other domains such as criminal and civil cases, cybersecurity, forensics, identification of plagiarizer, and many more. An automated system in this context can thus be very beneficial for society. In this paper, we propose a convolutional neural network (CNN)-based author identification system from literary articles. This system uses visual features along with a five-layer convolutional neural network for the identification of authors. The prime motivation behind this approach was the feasibility to identify distinct writing styles through a visualization of the writing patterns. Experiments were performed on 1200 articles from 50 authors achieving a maximum accuracy of 93.58%. Furthermore, to see how the system performed on different volumes of data, the experiments were performed on partitions of the dataset. The system outperformed standard handcrafted feature-based techniques as well as established works on publicly available datasets.

 Artículos similares

       
 
S. L. Litvinenko     Pág. 200 - 202
In the article the peculiarities of practical implementation of the concept of informatization of national cargo air carriers, based on the use of radiofrequency identification technology in order to improve the service processing of cargoes by direct in... ver más

 
José María De Luca    
Este artículo analiza la experiencia del autor, quien ha introducido un desarrollo virtual en tecnología educativa para la enseñanza de una metodología de identificación de fortalezas y debilidades  sectoriales, basadas en innovación tecnológica, todo el... ver más