Inicio  /  Applied Sciences  /  Vol: 10 Par: 23 (2020)  /  Artículo
ARTÍCULO
TITULO

A New Big Data Benchmark for OLAP Cube Design Using Data Pre-Aggregation Techniques

Roberto Tardío    
Alejandro Maté and Juan Trujillo    

Resumen

In recent years, several new technologies have enabled OLAP processing over Big Data sources. Among these technologies, we highlight those that allow data pre-aggregation because of their demonstrated performance in data querying. This is the case of Apache Kylin, a Hadoop based technology that supports sub-second queries over fact tables with billions of rows combined with ultra high cardinality dimensions. However, taking advantage of data pre-aggregation techniques to designing analytic models for Big Data OLAP is not a trivial task. It requires very advanced knowledge of the underlying technologies and user querying patterns. A wrong design of the OLAP cube alters significantly several key performance metrics, including: (i) the analytic capabilities of the cube (time and ability to provide an answer to a query), (ii) size of the OLAP cube, and (iii) time required to build the OLAP cube. Therefore, in this paper we (i) propose a benchmark to aid Big Data OLAP designers to choose the most suitable cube design for their goals, (ii) we identify and describe the main requirements and trade-offs for effectively designing a Big Data OLAP cube taking advantage of data pre-aggregation techniques, and (iii) we validate our benchmark in a case study.

Palabras claves

 Artículos similares

       
 
Jianzhao Liu, Liping Gao, Fenghui Yuan, Yuedong Guo and Xiaofeng Xu    
Soil water shortage is a critical issue for the Southwest US (SWUS), the typical arid region that has experienced severe droughts over the past decades, primarily caused by climate change. However, it is still not quantitatively understood how soil water... ver más
Revista: Water

 
Masoud Jafari Shalamzari, Wanchang Zhang, Atefeh Gholami and Zhijie Zhang    
Site selection for runoff harvesting at large scales is a very complex task. It requires inclusion and spatial analysis of a multitude of accurately measured parameters in a time-efficient manner. Compared with direct measurements of runoff, which is tim... ver más
Revista: Water

 
H. Ardiyanti, D. Puspitarum, O. F. Maryana, W. A. Pujakesuma     Pág. 197 - 200
This article reports the results synthesis of composite Fe3O4/SiO2 and nanoparticles from natural resources (sugarcane bagasse). The synthesis of Fe3O4 and SiO2 nanoparticles used co-precipitation and sol-gel methods with SiO2 from sugarcane bagasse as a... ver más

 
Muhammad Abi Berkah Nadi, Sayed Ahmad Fauzan     Pág. 1 - 9
Recovery efforts following a disaster can be slow and painstaking work, and potentially put responders in harm's way. A system which helps identify defects in critical building elements (e.g., concrete columns) before responders must enter a structure ca... ver más

 
Triyana Muliawati, Dewi Suhika     Pág. 40 - 46
The development of student character starts from education process in campus life and residence. The environment is less comfortable and effective in the learning process will affect student achievement. To overcome this, the Institute of Technology of S... ver más