Redirigiendo al acceso original de articulo en 20 segundos...
Inicio  /  Buildings  /  Vol: 13 Par: 10 (2023)  /  Artículo
ARTÍCULO
TITULO

Research on the Application and Performance Optimization of GPU Parallel Computing in Concrete Temperature Control Simulation

Xuerui Zheng    
Jiping Jin    
Yajun Wang    
Min Yuan and Sheng Qiang    

Resumen

With the development of engineering technology, engineering has higher requirements for the accuracy and the scale of simulation calculation. The computational efficiency of traditional serial programs cannot meet the requirements of engineering. Therefore, reducing the calculation time of the temperature control simulation program has important engineering significance for real-time simulation of temperature field and stress field, and then adopting more reasonable temperature control and crack prevention measures. GPU parallel computing is introduced into the temperature control simulation program of massive concrete to solve this problem and the optimization is carried out. Considering factors such as GPU clock rate, number of cores, parallel overhead and Parallel Region, the improved GPU parallel algorithm analysis indicator formula is proposed. It makes up for the shortcomings of traditional formulas that focus only on time. According to this formula, when there are enough threads, the parallel effect is limited by the size of the parallel domain, and when the parallel domain is large enough, the efficiency is limited by the parallel overhead and the clock rate. This paper studies the optimal Kernel execution configuration. Shared memory is utilized to improve memory access efficiency by 155%. After solving the problem of bank conflicts, an accelerate rate of 437.5× was realized in the subroutine of the matrix transpose of the solver. The asynchronous parallel of data access and logical operation is realized on GPU by using CUDA Stream, which can overlap part of the data access time. On the basis of GPU parallelism, asynchronous parallelism can double the computing efficiency. Compared with the serial program, the accelerate rate of inner product matrix multiplication of the GPU asynchronous parallel program is 61.42×. This study further proposed a theoretical formula of data access overlap rate to guide the selection of the number of CUDA streams to achieve the optimal computing conditions. The GPU parallel program compiled and optimized by the CUDA Fortran platform can effectively improve the computational efficiency of the simulation program for concrete temperature control, and better serve engineering computing.

 Artículos similares

       
 
Christine Dewi, Danny Manongga, Hendry, Evangs Mailoa and Kristoko Dwi Hartomo    
Face mask detection is a technological application that employs computer vision methodologies to ascertain the presence or absence of a face mask on an individual depicted in an image or video. This technology gained significant attention and adoption du... ver más

 
Javid Misirli and Emiliano Casalicchio    
The Internet of Things (IoT) uptake brought a paradigm shift in application deployment. Indeed, IoT applications are not centralized in cloud data centers, but the computation and storage are moved close to the consumers, creating a computing continuum b... ver más
Revista: Future Internet

 
Sarah A. Chauncey and H. Patricia McKenna    
The purpose of this study is to advance conceptual understandings of the cognitive flexibility construct, in support of creativity and innovation in smart city civic spaces, employing the use of large language model artificial intelligence chatbots such ... ver más
Revista: Urban Science

 
Eliza Gabriela Brettfeld, Daria Gabriela Popa, Tanase Dobre, Corina Ioana Moga, Diana Constantinescu-Aruxandei and Florin Oancea    
In this study, we investigated the use of functionalized deep eutectic solvents (DESs) as a medium for CO2 capture integrated with CO2 desorption and biofixation in microalgal culture, as an approach for carbon capture, utilization, and storage (CCUS). T... ver más

 
Jian Yang, Ming Sun, Guohuang Yao, Haizhu Guo and Rumian Zhong    
This study explores an advanced prefabricated composite structure, namely ECC/RC composite shear walls with enhanced seismic performance. This performance enhancement is attributed to the strategic use of engineered cementitious composites (ECC) known fo... ver más
Revista: Buildings