Control formats for unsymmetric and symmetric sparse matrix-vector multiplications on OpenMP implementations

Takahiro Katagiri, Takao Sakurai, Mitsuyoshi Igai, Satoshi Ohshima, Hisayasu Kuroda, Ken Naono, Kengo Nakajima

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

In this paper, we propose "control formats" to obtain better thread performance of sparse matrix-vector multiplication (SpMV) for unsymmetric and symmetric matrices. By using the control formats, we established the following maximum speedups of SpMV in 16-thread execution on one node of the T2K Open Supercomputer: (1) 7.14x for an unsymmetric matrix by using the proposed Branchless Segmented Scan compared to the original Segmented Scan method; (2) 12.7x for a symmetric matrix by using the proposed Zero-element Computation-free method compared to a simple SpMV implementation.

Original languageEnglish
Title of host publicationHigh Performance Computing for Computational Science, VECPAR 2012 - 10th International Conference, Revised Selected Papers
Pages236-248
Number of pages13
DOIs
Publication statusPublished - Sep 5 2013
Externally publishedYes
Event10th International Conference on High Performance Computing for Computational Science, VECPAR 2012 - Kobe, Japan
Duration: Jul 17 2012Jul 20 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7851 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other10th International Conference on High Performance Computing for Computational Science, VECPAR 2012
Country/TerritoryJapan
CityKobe
Period7/17/127/20/12

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint

Dive into the research topics of 'Control formats for unsymmetric and symmetric sparse matrix-vector multiplications on OpenMP implementations'. Together they form a unique fingerprint.

Cite this