ARTÍCULO
TITULO

State of the Art and Future Trends in Data Reduction for High-Performance Computing

Kira Duwe    
Jakob Lüttgau    
Georgiana Mania    
Jannek Squar    
Anna Fuchs    
Michael Kuhn    
Eugen Betke    
Thomas Ludwig    

Resumen

Research into data reduction techniques has gained popularity in recent years as storage capacity and performance become a growing concern. This survey paper provides an overview of leveraging points found in high-performance computing (HPC) systems and suitable mechanisms to reduce data volumes. We present the underlying theories and their application throughout the HPC stack and also discuss related hardware acceleration and reduction approaches. After introducing relevant use-cases, an overview of modern lossless and lossy compression algorithms and their respective usage at the application and file system layer is given. In anticipation of their increasing relevance for adaptive and in situ approaches, dimensionality reduction techniques are summarized with a focus on non-linear feature extraction. Adaptive approaches and in situ compression algorithms and frameworks follow. The key stages and new opportunities to deduplication are covered next. An unconventional but promising method is recomputation, which is proposed at last. We conclude the survey with an outlook on future developments.

 Artículos similares

       
 
Ilia Zaznov, Julian Martin Kunkel, Atta Badii and Alfonso Dufour    
This paper introduces a novel deep learning approach for intraday stock price direction prediction, motivated by the need for more accurate models to enable profitable algorithmic trading. The key problems addressed are effectively modelling complex limi... ver más
Revista: Applied Sciences

 
Roman Major, Maciej Gawlikowski, Marcin Surmiak, Karolina Janiczak, Justyna Wiecek, Przemyslaw Kurtyka, Martin Schwentenwein, Ewa Jasek-Gajda, Magdalena Kopernik and Juergen M. Lackner    
A major medical problem of state-of-the-art heart ventricular assist devices (LVADs) is device-induced thrombus formation due to inadequate blood-flow dynamics generated by the blood pump rotor. The latter is a highly complex device, with difficulties du... ver más
Revista: Applied Sciences

 
Norah Fahd Alhussainan, Belgacem Ben Youssef and Mohamed Maher Ben Ismail    
Brain tumor diagnosis traditionally relies on the manual examination of magnetic resonance images (MRIs), a process that is prone to human error and is also time consuming. Recent advancements leverage machine learning models to categorize tumors, such a... ver más
Revista: Computation

 
Zahid Masood, Muhammad Usama, Shahroz Khan, Konstantinos Kostas and Panagiotis D. Kaklis    
Generative models offer design diversity but tend to be computationally expensive, while non-generative models are computationally cost-effective but produce less diverse and often invalid designs. However, the limitations of non-generative models can be... ver más

 
Sharoon Saleem, Fawad Hussain and Naveed Khan Baloch    
Network on Chip (NoC) has emerged as a potential substitute for the communication model in modern computer systems with extensive integration. Among the numerous design challenges, application mapping on the NoC system poses one of the most complex and d... ver más
Revista: Algorithms