Home >

news ヘルプ

論文・著書情報


タイトル
和文: 
英文:Multi-level Blocking Optimization for Fast Sparse Matrix Vector Multiplication on GPUs 
著者
和文: 長坂 侑亮, 額田 彰, 松岡 聡.  
英文: Yusuke Nagasaka, Akira Nukada, Satoshi Matsuoka.  
言語 English 
掲載誌/書名
和文: 
英文: 
巻, 号, ページ        
出版年月 2015年11月15日 
出版者
和文: 
英文: 
会議名称
和文: 
英文:The International Conference for High Performance Computing, Networking, Storage and Analysis (SC15) 
開催地
和文: 
英文:Austin, Texas 
公式リンク http://sc15.supercomputing.org/sites/all/themes/SC15images/tech_poster/tech_poster_pages/post332.html
 
アブストラクト Many scientific and industrial simulations require solving large linear equations, whose bottleneck is sparse matrix vector multiplication (SpMV). Although some previous work has shown improvement of SpMV performance on GPU, the critical bottlenecks such as requirement of high memory bandwidth and low cache hit ratio due to random memory access to input vector still remain. We propose the state of the art sparse matrix format reducing memory access for GPU. Adaptive Multi-level Blocking (AMB) format compresses the column index by using 16-bit integer and several blocking optimizations, and we also devise effective SpMV kernel. We evaluated the performance of our approach for 62 positive definite large size matrices in single precision. AMB format achieves significant speedup of x2.83 on maximum and x1.75 on average compared to cuSparse library and x1.38 on maximum and x1.08 on average compared to yaSpMV, which is recently proposed fast SpMV library.

©2007 Tokyo Institute of Technology All rights reserved.