23 Artículos

Model of Lexico-Semantic Bonds between Texts for Creating Their Similarity Metrics and Developing Statistical Clustering Algorithm

Acceso

en línea

Liliya Demidova, Dmitry Zhukov, Elena Andrianova and Vladimir Kalinin

To solve the problem of text clustering according to semantic groups, we suggest using a model of a unified lexico-semantic bond between texts and a similarity matrix based on it. Using lexico-semantic analysis methods, we can create ?term?document? matr... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 16 Num: 0 Par: 4 Año: 2023

Social Media Opinion Analysis Model Based on Fusion of Text and Structural Features

Acceso

en línea

Jie Long, Zihan Li, Qi Xuan, Chenbo Fu, Songtao Peng and Yong Min

The opinion recognition for comments in Internet media is a new task in text analysis. It takes comment statements as the research object, by learning the opinion tendency in the original text with annotation, and then performing opinion tendency recogni... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 12 Año: 2023

Accuracy analysis of machine learning models using vectorization methods for heterogeneous text data classification tasks

Acceso

en línea

A.N. Alpatov,K.S. Popov,A.N. Chesalin Pág. 47 - 53

This paper investigates the problem of natural language processing using machine learning techniques, in particular, classification of unstructured heterogeneous text data sets. The paper presents a comparative analysis of some relevant and widely used m... ver más

Revista: International Journal of Open Information Technologies Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 7 Par: 0 Año: 2022

Social Mapping Based on Sentiment Analysis of Comments in Social Media

Acceso

en línea

Anna Chizhik,Svetlana Melnikova,Victor Zakharov Pág. 75 - 80

The paper is devoted to the testing results of the sentiment analysis algorithms. They were applied to downloaded from the social network VKontakte comments. Comments were written on posts in public communities related to the discussion of the news agend... ver más

Revista: International Journal of Open Information Technologies Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 11 Par: 0 Año: 2022

Greedy Texts Similarity Mapping

Acceso

en línea

Aliya Jangabylova, Alexander Krassovitskiy, Rustam Mussabayev and Irina Ualiyeva

The documents similarity metric is a substantial tool applied in areas such as determining topic in relation to documents, plagiarism detection, or problems necessary to capture the semantic, syntactic, or structural similarity of texts. Evaluated result... ver más

Revista: Computation Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 11 Año: 2022

Historical Vltava River Valley?Various Historical Sources within Web Mapping Environment

Acceso

en línea

Jirí Krejcí and Jirí Cajthaml

The article deals with a comprehensive information system of the historic Vltava River valley. This system contains a number of resources, which are described. For old maps, which are the basis of the whole system, their georeferencing and potential prob... ver más

Revista: ISPRS International Journal of Geo-Information Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 1 Año: 2022

A Study of Multilingual Toxic Text Detection Approaches under Imbalanced Sample Distribution

Acceso

en línea

Guizhe Song, Degen Huang and Zhifeng Xiao

Multilingual characteristics, lack of annotated data, and imbalanced sample distribution are the three main challenges for toxic comment analysis in a multilingual setting. This paper proposes a multilingual toxic text classifier which adopts a novel fus... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 5 Año: 2021

Vectorization of Floor Plans Based on EdgeGAN

Acceso

en línea

Shuai Dong, Wei Wang, Wensheng Li and Kun Zou

A 2D floor plan (FP) often contains structural, decorative, and functional elements and annotations. Vectorization of floor plans (VFP) is an object detection task that involves the localization and recognition of different structural primitives in 2D FP... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 5 Año: 2021

Identification of authorship of Ukrainian-language texts of journalistic style using neural networks

Acceso

en línea

Maksym Lupei,Alexander Mitsa,Volodymyr Repariuk,Vasyl Sharkan Pág. 30 - 36

The problem of development of an effective method for text authorship identification (on the material of publications of well-known Ukrainian journalists) is explored. Most existing methods require text preprocessing, which entails new costs when solving... ver más

Revista: Eastern-European Journal of Enterprise Technologies Formato: Electrónico

Tabla de contenido: Vol: 1 Num: 2 Par: PP Año: 2020

Building a Chatbot: Architecture Models and Text Vectorization Methods

Acceso

en línea

Anna V. Chizhik,Yulia A. Zherebtsova Pág. 50 - 56

In this paper, we review the recent progress in developing intelligent conversational agents (or chatbots), its current architectures (rule-based, retrieval based and generative-based models) and discuss the main advantages and disadvantages of the appro... ver más

Revista: International Journal of Open Information Technologies Formato: Electrónico

Tabla de contenido: Vol: 8 Num: 7 Par: 0 Año: 2020

« Anterior Página: 1 de 2 Siguiente »