|
|
|
Hermilo Santiago-Benito , Diana-Margarita Córdova-Esparza , Noé-Alejandro Castro-Sánchez , Teresa García-Ramirez , Julio-Alejandro Romero-González and Juan Terven
This paper introduces a novel method for collecting and translating texts from the Mixtec to the Spanish language. The method comprises four primary steps. First, we collected a Mixtec?Spanish corpus that includes 4568 sentences from educational and reli...
ver más
|
|
|
|
|
|
|
Patricia Takako Endo, Guto Leoni Santos, Maria Eduarda de Lima Xavier, Gleyson Rhuan Nascimento Campos, Luciana Conceição de Lima, Ivanovitch Silva, Antonia Egli and Theo Lynn
Public health interventions to counter the COVID-19 pandemic have accelerated and increased digital adoption and use of the Internet for sourcing health information. Unfortunately, there is evidence to suggest that it has also accelerated and increased t...
ver más
|
|
|
|
|
|
|
A. V. Chizhik
Pág. 21 - 29
Digital technologies have led to the formation of the new level of socio-cultural space, which is expressed in the permanent presence of the phenomenon of virtual reality in everyday life. It is the main motivating tool for social and political transform...
ver más
|
|
|
|
|
|
|
Hyun-Jin Kim, Ji-Won Baek and Kyungyong Chung
This study proposes the optimization method of the associative knowledge graph using TF-IDF based ranking scores. The proposed method calculates TF-IDF weights in all documents and generates term ranking. Based on the terms with high scores from TF-IDF b...
ver más
|
|
|
|
|
|
|
Driss Namly,Karim Bouzoubaa,Abdellah Yousfi
Stop words are defined as words that frequently appear in texts without carrying any significant information. For the Arabic language, existing works suffer from two main drawbacks (i) the use of only proprietary corpus and (ii) the reliance of only the ...
ver más
|
|
|
|
|
|
|
Vasyl Lytvyn,Victoria Vysotska,Ihor Budz,Yaroslav Pelekh,Nataliia Sokulska,Roman Kovalchuk,Lyudmyla Dzyubyk,Oksana Tereshchuk,Myroslav Komar
Pág. 28 - 51
The peculiarities of the application of linguo-statistics technologies for the identification of the style of the author of text content of scientific and technical profile are considered. Quantitative linguistic analysis of a text uses the benefits of c...
ver más
|
|
|
|
|
|
|
Sardar Parhat, Mijit Ablimit and Askar Hamdulla
In this paper, based on the multilingual morphological analyzer, we researched the similar low-resource languages, Uyghur and Kazakh, short text classification. Generally, the online linguistic resources of these languages are noisy. So a preprocessing i...
ver más
|
|
|
|
|
|
|
Essam Kazem Al-Yasiri,Ahmed Al-Azawei
Regardless of the clear growth of Arabic texts on social networking sites (SNSs), it is still difficult to understand or summarize users' opinions or perspectives on a specific topic. Accordingly, Arabic text classification is one of the most challenging...
ver más
|
|
|
|