REVISTA
Information

TODAS

Redirigiendo al acceso original de articulo en 15 segundos...

Inicio / Information / Vol: 9 Par: 5 (2018) / Artículo

ARTÍCULO

TITULO

Double Distance-Calculation-Pruning for Similarity Search

Ives Renê Venturini Pola

Fernanda Paula Barbosa Pola and Danilo Medeiros Eler

Resumen

Many modern applications deal with complex data, where retrieval by similarity plays an important role. Complex data main comparison mechanisms are based on similarity predicates. They are usually immersed in metric spaces where distance functions are employed to express the similarity and a lower bound property is usually employed to prevent distance calculations. Retrieval by similarity is implemented by unary and binary operators. Most of the studies aimed at improving the efficiency of unary operators, either by using metric access methods or mathematical properties to prune parts of the search space during query answering. Studies on binary operators to solve similarity joins aim to improve efficiency and most of them use only the metric lower bound property for pruning. However, they are dependent on the query parameters, such as the range radius. In this paper, we propose a generic concept that uses both lower and upper bound properties based on the Metric Spaces Theory to increase the avoidance of element comparisons. The concept can be applied on any existing similarity retrieval method. We analyzed the prunability power increase and show an example of its application on classical join nested loops algorithms. Practical evaluation over both synthetic and real data sets shows that our method reduced the number of distance evaluations on similarity joins.

Palabras claves

information retrieval - similarity joins - metric indexing

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 9 Parte: 5 (2018)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Information
Inteligencia Artificial
Applied Sciences

DOI

https://doi.org/10.3390/info9050124

Artículos similares

Model of Lexico-Semantic Bonds between Texts for Creating Their Similarity Metrics and Developing Statistical Clustering Algorithm

Acceso

Liliya Demidova, Dmitry Zhukov, Elena Andrianova and Vladimir Kalinin

To solve the problem of text clustering according to semantic groups, we suggest using a model of a unified lexico-semantic bond between texts and a similarity matrix based on it. Using lexico-semantic analysis methods, we can create ?term?document? matr... ver más

Revista: Algorithms

The Identification of Ship Trajectories Using Multi-Attribute Compression and Similarity Metrics

Acceso

Chang Liu, Shize Zhang, Lufang Cao and Bin Lin

Automatic identification system (AIS) data record a ship?s position, speed over ground (SOG), course over ground (COG), and other behavioral attributes at specific time intervals during a ship?s voyage. At present, there are few studies in the literature... ver más

Revista: Journal of Marine Science and Engineering

Using Deep-Learned Vector Representations for Page Stream Segmentation by Agglomerative Clustering

Acceso

Lukas Busch, Ruben van Heusden and Maarten Marx

Page stream segmentation (PSS) is the task of retrieving the boundaries that separate source documents given a consecutive stream of documents (for example, sequentially scanned PDF files). The task has recently gained more interest as a result of the di... ver más

Revista: Algorithms

A Multiscale Deep Encoder?Decoder with Phase Congruency Algorithm Based on Deep Learning for Improving Diagnostic Ultrasound Image Quality

Acceso

Ryeonhui Kim, Kyuseok Kim and Youngjin Lee

Ultrasound imaging is widely used as a noninvasive lesion detection method in diagnostic medicine. Improving the quality of these ultrasound images is very important for accurate diagnosis, and deep learning-based algorithms have gained significant atten... ver más

Revista: Applied Sciences

Who Needs External References??Text Summarization Evaluation Using Original Documents

Acceso

Abdullah Al Foysal and Ronald Böck

Nowadays, individuals can be overwhelmed by a huge number of documents being present in daily life. Capturing the necessary details is often a challenge. Therefore, it is rather important to summarize documents to obtain the main information quickly. The... ver más

Revista: AI

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas