TY - GEN
T1 - Control formats for unsymmetric and symmetric sparse matrix-vector multiplications on OpenMP implementations
AU - Katagiri, Takahiro
AU - Sakurai, Takao
AU - Igai, Mitsuyoshi
AU - Ohshima, Satoshi
AU - Kuroda, Hisayasu
AU - Naono, Ken
AU - Nakajima, Kengo
PY - 2013/9/5
Y1 - 2013/9/5
N2 - In this paper, we propose "control formats" to obtain better thread performance of sparse matrix-vector multiplication (SpMV) for unsymmetric and symmetric matrices. By using the control formats, we established the following maximum speedups of SpMV in 16-thread execution on one node of the T2K Open Supercomputer: (1) 7.14x for an unsymmetric matrix by using the proposed Branchless Segmented Scan compared to the original Segmented Scan method; (2) 12.7x for a symmetric matrix by using the proposed Zero-element Computation-free method compared to a simple SpMV implementation.
AB - In this paper, we propose "control formats" to obtain better thread performance of sparse matrix-vector multiplication (SpMV) for unsymmetric and symmetric matrices. By using the control formats, we established the following maximum speedups of SpMV in 16-thread execution on one node of the T2K Open Supercomputer: (1) 7.14x for an unsymmetric matrix by using the proposed Branchless Segmented Scan compared to the original Segmented Scan method; (2) 12.7x for a symmetric matrix by using the proposed Zero-element Computation-free method compared to a simple SpMV implementation.
UR - http://www.scopus.com/inward/record.url?scp=84883285608&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84883285608&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-38718-0_24
DO - 10.1007/978-3-642-38718-0_24
M3 - Conference contribution
AN - SCOPUS:84883285608
SN - 9783642387173
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 236
EP - 248
BT - High Performance Computing for Computational Science, VECPAR 2012 - 10th International Conference, Revised Selected Papers
T2 - 10th International Conference on High Performance Computing for Computational Science, VECPAR 2012
Y2 - 17 July 2012 through 20 July 2012
ER -