Redirigiendo al acceso original de articulo en 22 segundos...
ARTÍCULO
TITULO

Performance Limits Study of Stencil Codes on Modern GPGPUs

Ilya S. Pershin    
Vadim D. Levchenko    
Anastasia Y. Perepelkina    

Resumen

We study the performance limits of different algorithmic approaches to the implementation of a sample problem of wave equation solution with a cross stencil scheme. With this, we aim to find the highest limit of the achievable performance efficiency for stencil computing.To estimate the limits, we use a quantitative Roofline model to make a thorough analysis of the performance bottlenecks and develop the model further to account for the latency of different levels of GPU memory. These estimates provide an incentive to use spatial and temporal blocking algorithms. Thus, we study stepwise, domain decomposition, and domain decomposition with halo algorithms in that order. The knowledge of the limit incites the motivation to optimize the implementation. This led to the analysis of the block synchronization methods in CUDA, which is also provided in the text.  After all optimizations, we have achieved 90% of the peak performance, which amounts to more than 1 trillion cell updates per second on one consumer level GPU device.

 Artículos similares

       
 
Qianyang Li and Xingjun Zhang    
For time series forecasting, multivariate grey models are excellent at handling incomplete or vague information. The GM(1, N) model represents this group of models and has been widely used in various fields. However, constructing a meaningful GM(1, N) mo... ver más
Revista: Applied Sciences

 
Yoohan Ma, Geon Woo Sim, Sungjin Jo, Dong Choon Hyun, Jae-Seung Roh, Dongwook Ko and Jongbok Kim    
Flexible transparent electrodes are integral to the advancement of flexible optoelectronic devices such as flexible displays and solar cells. However, indium tin oxide (ITO), a traditional material used in transparent electrodes, exhibits a significant i... ver más
Revista: Applied Sciences

 
Sofía Ramos-Pulido, Neil Hernández-Gress and Gabriela Torres-Delgado    
Current research on the career satisfaction of graduates limits educational institutions in devising methods to attain high career satisfaction. Thus, this study aims to use data science models to understand and predict career satisfaction based on infor... ver más
Revista: Informatics

 
Sergejus Lebedevas and Tomas Cepaitis    
The decarbonization problem of maritime transport and new restrictions on CO2 emissions (MARPOL Annex VI Chapter 4, COM (2021)562) have prompted the development and practical implementation of new decarbonization solutions. One of them, along with the us... ver más

 
Alexios Alexiou, Ioannis Kolias, Nikolaos Aretakis and Konstantinos Mathioudakis    
An approach for preliminary aero-engine design, incorporating a mean-line code for the design of axial-flow, multi-stage compressors, is presented. The compressor mean-line code is developed and integrated within a framework for the preliminary design an... ver más
Revista: Aerospace