Redirigiendo al acceso original de articulo en 17 segundos...
Inicio  /  Applied Sciences  /  Vol: 9 Par: 5 (2019)  /  Artículo
ARTÍCULO
TITULO

SURAA: A Novel Method and Tool for Loadbalanced and Coalesced SpMV Computations on GPUs

Thaha Muhammed    
Rashid Mehmood    
Aiiad Albeshri and Iyad Katib    

Resumen

Sparse matrix-vector (SpMV) multiplication is a vital building block for numerous scientific and engineering applications. This paper proposes SURAA (translates to speed in arabic), a novel method for SpMV computations on graphics processing units (GPUs).The novelty lies in the way we group matrix rows into different segments, and adaptively schedule various segments to different types of kernels. The sparse matrix data structure is created by sorting the rows of the matrix on the basis of the nonzero elements per row (?????? n p r ) and forming segments of equal size (containing approximately an equal number of nonzero elements per row) using the Freedman?Diaconis rule. The segments are assembled into three groups based on the mean ?????? n p r of the segments. For each group, we use multiple kernels to execute the group segments on different streams. Hence, the number of threads to execute each segment is adaptively chosen. Dynamic Parallelism available in Nvidia GPUs is utilized to execute the group containing segments with the largest mean ?????? n p r , providing improved load balancing and coalesced memory access, and hence more efficient SpMV computations on GPUs. Therefore, SURAA minimizes the adverse effects of the ?????? n p r variance by uniformly distributing the load using equal sized segments. We implement the SURAA method as a tool and compare its performance with the de facto best commercial (cuSPARSE) and open source (CUSP, MAGMA) tools using widely used benchmarks comprising 26 high ?????????????????????? n p r v a r i a n c e matrices from 13 diverse domains. SURAA outperforms the other tools by delivering 13.99x speedup on average. We believe that our approach provides a fundamental shift in addressing SpMV related challenges on GPUs including coalesced memory access, thread divergence, and load balancing, and is set to open new avenues for further improving SpMV performance in the future.

 Artículos similares

       
 
Shitu Chen, Ling Feng, Xuteng Bao, Zhe Jiang, Bowen Xing and Jingxiang Xu    
Path planning is crucial for unmanned surface vehicles (USVs) to navigate and avoid obstacles efficiently. This study evaluates and contrasts various USV path-planning algorithms, focusing on their effectiveness in dynamic obstacle avoidance, resistance ... ver más

 
Hang Yu, Yixi Zhao, Chongben Ni, Jinhong Ding, Tao Zhang, Ran Zhang and Xintian Jiang    
The diverse nature of hull components in shipbuilding has created a demand for intelligent robots capable of performing various tasks without pre-teaching or template-based programming. Visual perception of a target?s outline is crucial for path planning... ver más

 
Meng Ma, Zhirong Zhong, Zhi Zhai and Ruobin Sun    
There are hundreds of various sensors used for online Prognosis and Health Management (PHM) of LREs. Inspired by the fact that a limited number of key sensors are selected for inflight control purposes in LRE, it is practical to optimal placement of redu... ver más
Revista: Aerospace

 
Xiaoqin Xue, Chao Ren, Anchao Yin, Ying Zhou, Yuanyuan Liu, Cong Ding and Jiakai Lu    
In the domain of remote sensing research, the extraction of roads from high-resolution imagery remains a formidable challenge. In this paper, we introduce an advanced architecture called PCCAU-Net, which integrates Pyramid Pathway Input, CoordConv convol... ver más
Revista: Applied Sciences

 
Shangcong Zhang, Yongfang Li, Xuefei Chen, Ruyi Zhou, Ziran Wu and Taha Zarhmouti    
Fire pumps are the key components of water supply in a firefighting system. At present, there is a lack of fire water pump testing methods that intelligently detect faulty states. Existing testing approaches require manual operation, which leads to low e... ver más
Revista: Water