Inicio  /  Information  /  Vol: 10 Par: 10 (2019)  /  Artículo
ARTÍCULO
TITULO

Blind Queries Applied to JSON Document Stores

Stefania Marrara    
Mauro Pelucchi and Giuseppe Psaila    

Resumen

Social Media, Web Portals and, in general, information systems offer their own Application Programming Interfaces (APIs), used to provide large data sets concerning every aspect of day-by-day life. APIs usually provide data sets as collections of JSON documents. The heterogeneous structure of JSON documents returned by different APIs constitutes a barrier to effectively query and analyze these data sets. The adoption of NoSQL document stores, such as MongoDB, is useful for gathering these data sets, but does not solve the problem of querying the final heterogeneous repository. The aim of this paper is to provide analysts with a tool, named HammerJDB, that allows for blind querying collections of JSON documents within a NoSQL document database. The idea below is that users may know the application domain but it may be that they are not aware of the real structures of the documents stored in the database?the tool for blind querying tries to bridge the gap, by adopting a query rewriting mechanism. This paper is an evolution of a technique for blind querying Open Data portals and of its implementation within the Hammer framework, presented in some previous work. In this paper, we evolve that approach in order to query a NoSQL document database by evolving the Hammer framework into the HammerJDB framework, which is able to work on MongoDB databases. The effectiveness of the new approach is evaluated on a data set (derived from a real-life one), containing job-vacancy ads collected from European job portals.

 Artículos similares

       
 
Diego Garat and Dina Wonsever    
In order to provide open access to data of public interest, it is often necessary to perform several data curation processes. In some cases, such as biological databases, curation involves quality control to ensure reliable experimental support for biolo... ver más
Revista: Information

 
Szabolcs Kocsis Szürke, Gergo Sütheö, Antal Apagyi, István Lakatos and Szabolcs Fischer    
Vehicle safety risk can be decreased by diagnosing the lithium-ion battery system of electric road vehicles. Real-time cell diagnostics can avoid unexpected occurrences. However, lithium-ion batteries in electric vehicles can significantly differ in desi... ver más
Revista: Algorithms

 
Aleksandar To?ic, Niki Hrovatin and Jernej Vicic    
In the past two decades, technological advancements in smart devices, IoT, and smart sensors have paved the way towards numerous implementations of indoor location systems. Indoor location has many important applications in numerous fields, including str... ver más
Revista: Applied Sciences

 
Priyank Kalgaonkar and Mohamed El-Sharkawy    
Artificial Intelligence (AI) combines computer science and robust datasets to mimic natural intelligence demonstrated by human beings to aid in problem-solving and decision-making involving consciousness up to a certain extent. From Apple?s virtual perso... ver más

 
Paolo Fosci and Giuseppe Psaila    
How to exploit the incredible variety of JSON data sets currently available on the Internet, for example, on Open Data portals? The traditional approach would require getting them from the portals, then storing them into some JSON document store and inte... ver más
Revista: Information