|
|
|
Melania Nitu and Mihai Dascalu
Machine-generated content reshapes the landscape of digital information; hence, ensuring the authenticity of texts within digital libraries has become a paramount concern. This work introduces a corpus of approximately 60 k Romanian documents, including ...
ver más
|
|
|
|
|
|
|
Jiale Qian, Yunyan Du, Fuyuan Liang, Jiawei Yi, Nan Wang, Wenna Tu, Sheng Huang, Tao Pei and Ting Ma
Understanding the public?s diverse linguistic expressions about rainfall and flood provides a basis for flood disaster studies and enhances linguistic and cultural awareness. However, existing research tends to overlook linguistic complexity, potentially...
ver más
|
|
|
|
|
|
|
Jiajia Peng and Tianbing Tang
Image captioning, also recognized as the challenge of transforming visual data into coherent natural language descriptions, has persisted as a complex problem. Traditional approaches often suffer from semantic gaps, wherein the generated textual descript...
ver más
|
|
|
|
|
|
|
Emilio Matricciani
We propose that short-term memory (STM), when processing a sentence, uses two independent units in series. The clues for conjecturing this model emerge from studying many novels from Italian and English Literature. This simple model, referring to the sur...
ver más
|
|
|
|
|
|
|
Feng Li, Xuefeng Xi, Zhiming Cui, Dongyang Li and Wanting Zeng
Essays are a pivotal component of conventional exams; accurately, efficiently, and effectively grading them is a significant challenge for educators. Automated essay scoring (AES) is a complex task that utilizes computer technology to assist teachers in ...
ver más
|
|
|
|
|
|
|
Shelley Gupta, Archana Singh and Vivek Kumar
Virtual users generate a gigantic volume of unbalanced sentiments over various online crowd-sourcing platforms which consist of text, emojis, or a combination of both. Its accurate analysis brings profits to various industries and their services. The sta...
ver más
|
|
|
|
|
|
|
Barbara Brzic, Ivica Boticki and Marina Bagic Babac
Deception in computer-mediated communication represents a threat, and there is a growing need to develop efficient methods of detecting it. Machine learning models have, through natural language processing, proven to be extremely successful at detecting ...
ver más
|
|
|
|
|
|
|
Roberto Corizzo and Sebastian Leal-Arenas
Detection of AI-generated content is a crucially important task considering the increasing attention towards AI tools, such as ChatGPT, and the raised concerns with regard to academic integrity. Existing text classification approaches, including neural-n...
ver más
|
|
|
|
|
|
|
Michelle P. Banawan, Jinnie Shin, Tracy Arner, Renu Balyan, Walter L. Leite and Danielle S. McNamara
Academic discourse communities and learning circles are characterized by collaboration, sharing commonalities in terms of social interactions and language. The discourse of these communities is composed of jargon, common terminologies, and similarities i...
ver más
|
|
|
|
|
|
|
Tobias Nießner, Stefan Nießner and Matthias Schumann
How can useful information extracted from unstructured data be used to contribute to a better prediction of corporate failure or bankruptcy? In this research, we examine a data set of 2,163,147 financial statements of German companies that are triple cla...
ver más
|
|
|
|