Redirigiendo al acceso original de articulo en 24 segundos...
Inicio  /  Algorithms  /  Vol: 14 Par: 10 (2021)  /  Artículo
ARTÍCULO
TITULO

Efficient and Portable Distribution Modeling for Large-Scale Scientific Data Processing with Data-Parallel Primitives

Hao-Yi Yang    
Zhi-Rong Lin and Ko-Chih Wang    

Resumen

The use of distribution-based data representation to handle large-scale scientific datasets is a promising approach. Distribution-based approaches often transform a scientific dataset into many distributions, each of which is calculated from a small number of samples. Most of the proposed parallel algorithms focus on modeling single distributions from many input samples efficiently, but these may not fit the large-scale scientific data processing scenario because they cannot utilize computing resources effectively. Histograms and the Gaussian Mixture Model (GMM) are the most popular distribution representations used to model scientific datasets. Therefore, we propose the use of multi-set histogram and GMM modeling algorithms for the scenario of large-scale scientific data processing. Our algorithms are developed by data-parallel primitives to achieve portability across different hardware architectures. We evaluate the performance of the proposed algorithms in detail and demonstrate use cases for scientific data processing.

 Artículos similares

       
 
Qian Huang, Chenghung Hsieh, Jiaen Hsieh and Chunchen Liu    
Artificial intelligence (AI) is fundamentally transforming smart buildings by increasing energy efficiency and operational productivity, improving life experience, and providing better healthcare services. Sudden Infant Death Syndrome (SIDS) is an unexpe... ver más
Revista: AI

 
Sefrani Isdarmayani Siregar, Syamsyarief Baqaruzi, Rachmad Hidayatullah     Pág. 307 - 312
As technology grows, so does the consumption of energy, especially electricity. Today, electricity has become the most used energy worldwide and the demand keeps on increasing every year. According to PLN annual report the highest electricity consumption... ver más

 
Yinan Hu, Geoffrey Z. Iwata, Lykourgos Bougas, John W. Blanchard, Arne Wickenbrock, Gerhard Jakob, Stephan Schwarz, Clemens Schwarzinger, Alexej Jerschow and Dmitry Budker    
Solid-state battery technology is motivated by the desire to deliver flexible power storage in a safe and efficient manner. The increasingly widespread use of batteries from mass production facilities highlights the need for a rapid and sensitive diagnos... ver más
Revista: Applied Sciences

 
Amirsalar Mansouri, Sanjay P. Singh and Khalid Sayood    
Epilepsy is one of the three most prevalent neurological disorders. A significant proportion of patients suffering from epilepsy can be effectively treated if their seizures are detected in a timely manner. However, detection of most seizures requires th... ver más
Revista: Algorithms

 
Jason J. Quinlan, Ahmed H. Zahran and Cormac J. Sreenan    
When we couple the rise in video streaming with the growing number of portable devices (smart phones, tablets, laptops), we see an ever-increasing demand for high-definition video online while on the move. Wireless networks are inherently characterised b... ver más
Revista: Information