Redirigiendo al acceso original de articulo en 15 segundos...
Inicio  /  Information  /  Vol: 9 Par: 5 (2018)  /  Artículo
ARTÍCULO
TITULO

Double Distance-Calculation-Pruning for Similarity Search

Ives Renê Venturini Pola    
Fernanda Paula Barbosa Pola and Danilo Medeiros Eler    

Resumen

Many modern applications deal with complex data, where retrieval by similarity plays an important role. Complex data main comparison mechanisms are based on similarity predicates. They are usually immersed in metric spaces where distance functions are employed to express the similarity and a lower bound property is usually employed to prevent distance calculations. Retrieval by similarity is implemented by unary and binary operators. Most of the studies aimed at improving the efficiency of unary operators, either by using metric access methods or mathematical properties to prune parts of the search space during query answering. Studies on binary operators to solve similarity joins aim to improve efficiency and most of them use only the metric lower bound property for pruning. However, they are dependent on the query parameters, such as the range radius. In this paper, we propose a generic concept that uses both lower and upper bound properties based on the Metric Spaces Theory to increase the avoidance of element comparisons. The concept can be applied on any existing similarity retrieval method. We analyzed the prunability power increase and show an example of its application on classical join nested loops algorithms. Practical evaluation over both synthetic and real data sets shows that our method reduced the number of distance evaluations on similarity joins.

 Artículos similares

       
 
Liliya Demidova, Dmitry Zhukov, Elena Andrianova and Vladimir Kalinin    
To solve the problem of text clustering according to semantic groups, we suggest using a model of a unified lexico-semantic bond between texts and a similarity matrix based on it. Using lexico-semantic analysis methods, we can create ?term?document? matr... ver más
Revista: Algorithms

 
Chang Liu, Shize Zhang, Lufang Cao and Bin Lin    
Automatic identification system (AIS) data record a ship?s position, speed over ground (SOG), course over ground (COG), and other behavioral attributes at specific time intervals during a ship?s voyage. At present, there are few studies in the literature... ver más

 
Lukas Busch, Ruben van Heusden and Maarten Marx    
Page stream segmentation (PSS) is the task of retrieving the boundaries that separate source documents given a consecutive stream of documents (for example, sequentially scanned PDF files). The task has recently gained more interest as a result of the di... ver más
Revista: Algorithms

 
Ryeonhui Kim, Kyuseok Kim and Youngjin Lee    
Ultrasound imaging is widely used as a noninvasive lesion detection method in diagnostic medicine. Improving the quality of these ultrasound images is very important for accurate diagnosis, and deep learning-based algorithms have gained significant atten... ver más
Revista: Applied Sciences

 
Abdullah Al Foysal and Ronald Böck    
Nowadays, individuals can be overwhelmed by a huge number of documents being present in daily life. Capturing the necessary details is often a challenge. Therefore, it is rather important to summarize documents to obtain the main information quickly. The... ver más
Revista: AI