29 Artículos

BibRank: Automatic Keyphrase Extraction Platform Using Metadata

Acceso

en línea

Abdelrhman Eldallal and Eduard Barbu

Automatic Keyphrase Extraction involves identifying essential phrases in a document. These keyphrases are crucial in various tasks, such as document classification, clustering, recommendation, indexing, searching, summarization, and text simplification. ... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 10 Año: 2023

Analyzing Indo-European Language Similarities Using Document Vectors

Acceso

en línea

Samuel R. Schrader and Eren Gultepe

The evaluation of similarities between natural languages often relies on prior knowledge of the languages being studied. We describe three methods for building phylogenetic trees and clustering languages without the use of language-specific information. ... ver más

Revista: Informatics Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 4 Año: 2023

Topic-Clustering Model with Temporal Distribution for Public Opinion Topic Analysis of Geospatial Social Media Data

Acceso

en línea

Chunchun Hu, Qin Liang, Nianxue Luo and Shuixiang Lu

Analysis of the spatiotemporal distribution of online public opinion topics can help understand the hotspots of public concern. The topic model is employed widely in public opinion topic clustering for social media data. In order to handle topic-clusteri... ver más

Revista: ISPRS International Journal of Geo-Information Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 7 Año: 2023

Using Deep-Learned Vector Representations for Page Stream Segmentation by Agglomerative Clustering

Acceso

en línea

Lukas Busch, Ruben van Heusden and Maarten Marx

Page stream segmentation (PSS) is the task of retrieving the boundaries that separate source documents given a consecutive stream of documents (for example, sequentially scanned PDF files). The task has recently gained more interest as a result of the di... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 16 Num: 0 Par: 5 Año: 2023

Model of Lexico-Semantic Bonds between Texts for Creating Their Similarity Metrics and Developing Statistical Clustering Algorithm

Acceso

en línea

Liliya Demidova, Dmitry Zhukov, Elena Andrianova and Vladimir Kalinin

To solve the problem of text clustering according to semantic groups, we suggest using a model of a unified lexico-semantic bond between texts and a similarity matrix based on it. Using lexico-semantic analysis methods, we can create ?term?document? matr... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 16 Num: 0 Par: 4 Año: 2023

Using topic modeling for communities clusterization in the VKontakte social network

Acceso

en línea

Sergey Gorshkov,Eugene Ilyushin,Anastasia Chernysheva,Viacheslav Goiko,Dmitry Namiot Pág. 12 - 17

Topic modeling is one of the most widely used methods in text analysis. It can be used to select topics as well as to find the topics distributed in each document from the corpus. In this article, we present a method for clustering co... ver más

Revista: International Journal of Open Information Technologies Formato: Electrónico

Tabla de contenido: Vol: 9 Num: 5 Par: 0 Año: 2021

Nature-Inspired Optimization Algorithms for Text Document Clustering?A Comprehensive Analysis

Acceso

en línea

Laith Abualigah, Amir H. Gandomi, Mohamed Abd Elaziz, Abdelazim G. Hussien, Ahmad M. Khasawneh, Mohammad Alshinwan and Essam H. Houssein

Text clustering is one of the efficient unsupervised learning techniques used to partition a huge number of text documents into a subset of clusters. In which, each cluster contains similar documents and the clusters contain dissimilar text documents. Na... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 12 Año: 2020

Unsupervised Text Feature Selection Using Memetic Dichotomous Differential Evolution

Acceso

en línea

Ibraheem Al-Jadir, Kok Wai Wong, Chun Che Fung and Hong Xie

Feature Selection (FS) methods have been studied extensively in the literature, and there are a crucial component in machine learning techniques. However, unsupervised text feature selection has not been well studied in document clustering problems. Feat... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 6 Año: 2020

Development of methods for pre-clustering and virtual merging of short documents for building domain dictionaries

Acceso

en línea

Oleksii Kungurtsev,Svitlana Zinovatna,Iana Potochniak,Nataliia Novikova Pág. 39 - 47

The aim of research is to improve the quality of domain dictionaries by expanding the corpus of the documents under study by using short documents. A document model is proposed that allows to define a short document and the need to combine it with other ... ver más

Revista: Eastern-European Journal of Enterprise Technologies Formato: Electrónico

Tabla de contenido: Vol: 5 Num: 2 Par: PP Año: 2020

Exploring Technology Influencers from Patent Data Using Association Rule Mining and Social Network Analysis

Acceso

en línea

Pranomkorn Ampornphan and Sutep Tongngam

A patent is an important document issued by the government to protect inventions or product design. Inventions consist of mechanical structures, production processes, quality improvements of products, and so on. Generally, goods or appliances in everyday... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 6 Año: 2020

« Anterior Página: 1 de 2 Siguiente »