|
|
|
Samuel R. Schrader and Eren Gultepe
The evaluation of similarities between natural languages often relies on prior knowledge of the languages being studied. We describe three methods for building phylogenetic trees and clustering languages without the use of language-specific information. ...
ver más
|
|
|
|
|
|
|
Hao Wang, Miao Li, Jianyong Duan, Li He and Qing Zhang
Previous work has demonstrated that end-to-end neural sequence models work well for document-level event role filler extraction. However, the end-to-end neural network model suffers from the problem of not being able to utilize global information, result...
ver más
|
|
|
|
|
|
|
Lukas Busch, Ruben van Heusden and Maarten Marx
Page stream segmentation (PSS) is the task of retrieving the boundaries that separate source documents given a consecutive stream of documents (for example, sequentially scanned PDF files). The task has recently gained more interest as a result of the di...
ver más
|
|
|
|
|
|
|
Hiromu Nakajima and Minoru Sasaki
Text classification is the task of estimating the genre of a document based on information such as word co-occurrence and frequency of occurrence. Text classification has been studied by various approaches. In this study, we focused on text classificatio...
ver más
|
|
|
|
|
|
|
Liliya Demidova, Dmitry Zhukov, Elena Andrianova and Vladimir Kalinin
To solve the problem of text clustering according to semantic groups, we suggest using a model of a unified lexico-semantic bond between texts and a similarity matrix based on it. Using lexico-semantic analysis methods, we can create ?term?document? matr...
ver más
|
|
|
|
|
|
|
Eduardo Cibrián, Jose María Álvarez-Rodríguez, Roy Mendieta and Juan Llorens
The use of different techniques and tools is a common practice to cover all stages in the development life-cycle of systems generating a significant number of work products. These artefacts are frequently encoded using diverse formats, and often require ...
ver más
|
|
|
|
|
|
|
Su Yang and Farzin Deravi
In this paper, a novel re-engineering mechanism for the generation of word embeddings is proposed for document-level sentiment analysis. Current approaches to sentiment analysis often integrate feature engineering with classification, without optimizing ...
ver más
|
|
|
|
|
|
|
Rami Malkawi, Mohammad Daradkeh, Ammar El-Hassan and Pavel Petrov
Automated citation analysis is becoming increasingly important in assessing the scientific quality of publications and identifying patterns of collaboration among researchers. However, little attention has been paid to analyzing the scientific content of...
ver más
|
|
|
|
|
|
|
Nilufa Yeasmin, Nosin Ibna Mahbub, Mrinal Kanti Baowaly, Bikash Chandra Singh, Zulfikar Alom, Zeyar Aung and Mohammad Abdul Azim
The novel coronavirus disease (COVID-19) has dramatically affected people?s daily lives worldwide. More specifically, since there is still insufficient access to vaccines and no straightforward, reliable treatment for COVID-19, every country has taken th...
ver más
|
|
|
|
|
|
|
Abdullah Y. Muaad, Hanumanthappa Jayappa, Mugahed A. Al-antari and Sungyoung Lee
Arabic text classification is a process to simultaneously categorize the different contextual Arabic contents into a proper category. In this paper, a novel deep learning Arabic text computer-aided recognition (ArCAR) is proposed to represent and recogni...
ver más
|
|
|
|