Measurement of Text Similarity: A Survey

Jiapeng Wang and Yihong Dong

Resumen

Text similarity measurement is the basis of natural language processing tasks, which play an important role in information retrieval, automatic question answering, machine translation, dialogue systems, and document matching. This paper systematically combs the research status of similarity measurement, analyzes the advantages and disadvantages of current methods, develops a more comprehensive classification description system of text similarity measurement algorithms, and summarizes the future development direction. With the aim of providing reference for related research and application, the text similarity measurement method is described by two aspects: text distance and text representation. The text distance can be divided into length distance, distribution distance, and semantic distance; text representation is divided into string-based, corpus-based, single-semantic text, multi-semantic text, and graph-structure-based representation. Finally, the development of text similarity is also summarized in the discussion section.

Palabras claves

text similarity measure - text distance - text representation

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 11 Parte: 9 (2020)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Information
Applied Sciences
International Journal of Open Information Technologies

DOI

https://doi.org/10.3390/info11090421

Artículos similares

A robust way of steganography by using blocks of an image in spatial domain

Acceso

Ladeh S. Abdulraman, Sheerko R. Hma Salah, Halgurd S. Maghdid, Azhin T. Sabir Pág. 1 - 7

Steganography is a way to convey secret communication, with rapid electronic communication and high demand of using the internet, steganography has become a wide field of research and discussion. In this paper a new approach for hiding information in cov... ver más

Revista: Innovaciencia

Topic-based Social Influence Measurement for Social Networks

Acceso

Asso Hamzehei,Shanqing Jiang,Danai Koutra,Raymond Wong,Fang Chen

Social science studies have acknowledged that the social influence of individuals is not identical. Social networks structure and shared text can reveal immense information about users, their interests, and topic-based influence. Although some studies ha... ver más

Revista: Australasian Journal of Information Systems

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas