Cache-aware Sparse Matrix Formats for Kepler GPU

Yusuke Nagasaka; Akira Nukada; SATOSHI MATSUOKA

doi:http://dx.doi.org/10.1109/PADSW.2014.7097819

論文・著書情報

タイトル

和文:
英文:	Cache-aware Sparse Matrix Formats for Kepler GPU

著者

和文:	長坂侑亮, 額田彰, 松岡聡.
英文:	Yusuke Nagasaka, Akira Nukada, SATOSHI MATSUOKA.

言語

English

掲載誌/書名

和文:
英文:	2014 20th IEEE International Conference on Parallel and Distributed Systems ICPADS 2014

巻, 号, ページ

pp. 281-288

出版年月

2014年12月16日

出版者

和文:
英文:

会議名称

和文:
英文:	International Conference on Parallel and Distributed Systems (ICPADS2014)

開催地

和文:
英文:	Hsinchu

公式リンク

http://www.icpads.org/index.html

DOI

http://dx.doi.org/10.1109/PADSW.2014.7097819

アブストラクト

Scientific simulations often require solving ex- tremely large sparse linear equations, whose dominant kernel is sparse matrix vector multiplication. On modern many-core processors such as GPU, the operation has been known to pose significant bottleneck and thus would result in extremely poor efficiency, because of limited processor-to-memory bandwidth and low cache hit ratio due to random access to the input vector. Our family of new sparse matrix formats for many-core processors significantly increases the cache hit ratio and thus performance by segmenting the matrix along the columns, dividing the work among the many core up to the internal cache capacity, and aggregating the result later on. When compared to the best vendor libraries and recently proposed formats such as SELL-C-σ, our format achieved speedups of up to x2.0 for real datasets taken from the Florida Sparse Matrix Collection, x3.0 for the synthetic matrices in SpMV, and 1.68x for multi-node CG.

Home

各種検索

サポート

T2R2について

関連リンク

論文・著書情報