|
|
|
Yiming Liu, Hongtao Shan, Feng Nie, Gaoyu Zhang and George Xianzhi Yuan
The current popular approach to the extraction of document-level relations is mainly based on either a graph structure or serialization model method for the inference, but the graph structure method makes the model complicated, while the serialization mo...
ver más
|
|
|
|
|
|
|
Yifei Wang, Yongwei Wang, Hao Hu, Shengnan Zhou and Qinwu Wang
In order to solve the current problems in domain long text classification tasks, namely, the long length of a document, which makes it difficult for the model to capture key information, and the lack of expert domain knowledge, which leads to insufficien...
ver más
|
|
|
|
|
|
|
Marhanum Che Mohd Salleh, Rizal Mohd Nor, Faizal Yusof and Md Amiruzzaman
The aim of this research is to discuss the groundwork of building an Islamic Banking Document Screening Prototype based on a serverless architecture framework. This research first forms an algorithm for document matching based Vector Space Model (VCM) an...
ver más
|
|
|
|
|
|
|
Benjamin Shade and Eduardo G. Altmann
Quantifying the dissimilarity of two texts is an important aspect of a number of natural language processing tasks, including semantic information retrieval, topic classification, and document clustering. In this paper, we compared the properties and per...
ver más
|
|
|
|
|
|
|
Samuel R. Schrader and Eren Gultepe
The evaluation of similarities between natural languages often relies on prior knowledge of the languages being studied. We describe three methods for building phylogenetic trees and clustering languages without the use of language-specific information. ...
ver más
|
|
|
|
|
|
|
Liliya Demidova, Dmitry Zhukov, Elena Andrianova and Vladimir Kalinin
To solve the problem of text clustering according to semantic groups, we suggest using a model of a unified lexico-semantic bond between texts and a similarity matrix based on it. Using lexico-semantic analysis methods, we can create ?term?document? matr...
ver más
|
|
|
|
|
|
|
Eduardo Cibrián, Jose María Álvarez-Rodríguez, Roy Mendieta and Juan Llorens
The use of different techniques and tools is a common practice to cover all stages in the development life-cycle of systems generating a significant number of work products. These artefacts are frequently encoded using diverse formats, and often require ...
ver más
|
|
|
|
|
|
|
Shutian Deng, Gang Wang, Hongjun Wang and Fuliang Chang
Spain possesses a vast number of poems. Most have features that mean they present significantly different styles. A superficial reading of these poems may confuse readers due to their complexity. Therefore, it is of vital importance to classify the style...
ver más
|
|
|
|
|
|
|
Dauren Ayazbayev, Andrey Bogdanchikov, Kamila Orynbekova and Iraklis Varlamis
This work focuses on determining semantically close words and using semantic similarity in general in order to improve performance in information retrieval tasks. The semantic similarity of words is an important task with many applications from informati...
ver más
|
|
|
|
|
|
|
Adhi Dharma Wibawa, Arni Muarifah Amri, Arbintoro Mas, Syahrul Iman
Pág. 47 - 62
Opening job vacancies using the Internet will receive many applications quickly. Manually filtering resumes takes a lot of time and incurs huge costs. In addition, this manual screening process tends to be inaccurate due to fatigue conditions and fails i...
ver más
|
|
|
|