REVISTA
Computers

TODAS

Redirigiendo al acceso original de articulo en 19 segundos...

Inicio / Computers / Vol: 5 Par: 4 (2016) / Artículo

ARTÍCULO

TITULO

An Improved Retrievability-Based Cluster-Resampling Approach for Pseudo Relevance Feedback

Shariq Bashir

Resumen

Cluster-based pseudo-relevance feedback (PRF) is an effective approach for searching relevant documents for relevance feedback. Standard approach constructs clusters for PRF only on the basis of high similarity between retrieved documents. The standard approach works quite well if the retrieval bias of the retrieval model does not create any effect on the retrievability of documents. In our experiments we observed when a collection contains retrieval bias, then high retrievable documents of clusters are frequently retrieved at top positions for most of the queries, and these drift the relevance feedback away from relevant documents. For reducing (retrieval bias) noise, we enhance the standard cluster construction approach by constructing clusters on the basis of high similarity and retrievability. We call this retrievability and cluster-based PRF. This enhanced approach keeps only those documents in the clusters that are not frequently retrieve due to retrieval bias. Although this approach improves the effectiveness, however, it penalizes high retrievable documents even if these documents are most relevant to the clusters. To handle this problem, in a second approach, we extend the basic retrievability concept by mining frequent neighbors of the clusters. The frequent neighbors approach keeps only those documents in the clusters that are frequently retrieved with other neighbors of clusters and infrequently retrieved with those documents that are not part of the clusters. Experimental results show that two proposed extensions are helpful for identifying relevant documents for relevance feedback and increasing the effectiveness of queries.

Palabras claves

document clustering - machine learning - information retrieval - pseudo-relevance feedback - query expansion - retrieval bias - retrievability measure

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 5 Parte: 4 (2016)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Computers
South African Journal of Science and Technology
Applied Sciences

DOI

https://doi.org/10.3390/computers5040029

Artículos similares

A Survey of OCR in Arabic Language: Applications, Techniques, and Challenges

Acceso

Safiullah Faizullah, Muhammad Sohaib Ayub, Sajid Hussain and Muhammad Asad Khan

Optical character recognition (OCR) is the process of extracting handwritten or printed text from a scanned or printed image and converting it to a machine-readable form for further data processing, such as searching or editing. Automatic text extraction... ver más

Revista: Applied Sciences

Research Trends in the Use of Machine Learning Applied in Mobile Networks: A Bibliometric Approach and Research Agenda

Acceso

Vanessa García-Pineda, Alejandro Valencia-Arias, Juan Camilo Patiño-Vanegas, Juan José Flores Cueto, Diana Arango-Botero, Angel Marcelo Rojas Coronel and Paula Andrea Rodríguez-Correa

This article aims to examine the research trends in the development of mobile networks from machine learning. The methodological approach starts from an analysis of 260 academic documents selected from the Scopus and Web of Science databases and is based... ver más

Revista: Informatics

Knowledge-Based Intelligent Text Simplification for Biological Relation Extraction

Acceso

Jaskaran Gill, Madhu Chetty, Suryani Lim and Jennifer Hallinan

Relation extraction from biological publications plays a pivotal role in accelerating scientific discovery and advancing medical research. While vast amounts of this knowledge is stored within the published literature, extracting it manually from this co... ver más

Revista: Informatics

Relevance of Machine Learning Techniques in Water Infrastructure Integrity and Quality: A Review Powered by Natural Language Processing

Acceso

José García, Andres Leiva-Araos, Emerson Diaz-Saavedra, Paola Moraga, Hernan Pinto and Víctor Yepes

Water infrastructure integrity, quality, and distribution are fundamental for public health, environmental sustainability, economic development, and climate change resilience. Ensuring the robustness and quality of water infrastructure is pivotal for sec... ver más

Revista: Applied Sciences

Bibliometric and Visual Analysis of the Scientific Literature on Percutaneous Electrical Nerve Stimulation (PENS) for Pain Treatment

Acceso

Federica Monaco, Sergio Coluccia, Arturo Cuomo, Davide Nocerino, Daniela Schiavo, Gilda Pasta, Francesca Bifulco, Pasquale Buonanno, Vittorio Riccio, Marianna Leonardi, Francesco Perri, Alessandro Ottaiano, Francesco Sabbatino, Alessandro Vittori and Marco Cascella

Background: Percutaneous electrical nerve stimulation (PENS) is a minimally invasive peripheral neuromodulation approach implemented against chronic neuropathic and mixed pain. This bibliometric study aims to quantitatively evaluate the output of PENS fo... ver más

Revista: Applied Sciences

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas