Scalable Direct-Iterative Hybrid Solver for Sparse Matrices on Multi-Core and Vector Architectures

Kenji Ono, Toshihiro Kato, Satoshi Ohshima, Takeshi Nanri

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    1 Citation (Scopus)

    Abstract

    In the present paper, we propose an efficient direct-iterative hybrid solver for sparse matrices that can derive the scalability of the latest multi-core, many-core, and vector architectures and examine the execution performance of the proposed SLOR-PCR method. We also present an efficient implementation of the PCR algorithm for SIMD and vector architectures so that it is easy to output instructions optimized by the compiler. The proposed hybrid method has high cache reusability, which is favorable for modern low B/F architecture because efficient use of the cache can mitigate the memory bandwidth limitation. The measured performance revealed that the SLOR-PCR solver showed excellent scalability up to 352 cores on the cc-NUMA environment, and the achieved performance was higher than that of the conventional Jacobi and Red-Black ordering method by a factor of 3.6 to 8.3 on the SIMD architecture. In addition, the maximum speedup in computation time was observed to be a factor of 6.3 on the cc-NUMA architecture with 352 cores.

    Original languageEnglish
    Title of host publicationProceedings of International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2020
    PublisherAssociation for Computing Machinery
    Pages11-21
    Number of pages11
    ISBN (Electronic)9781450372367
    DOIs
    Publication statusPublished - Jan 15 2020
    Event2020 International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2020 - Fukuoka, Japan
    Duration: Jan 15 2020Jan 17 2020

    Publication series

    NameACM International Conference Proceeding Series

    Conference

    Conference2020 International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2020
    Country/TerritoryJapan
    CityFukuoka
    Period1/15/201/17/20

    All Science Journal Classification (ASJC) codes

    • Human-Computer Interaction
    • Computer Networks and Communications
    • Computer Vision and Pattern Recognition
    • Software

    Fingerprint

    Dive into the research topics of 'Scalable Direct-Iterative Hybrid Solver for Sparse Matrices on Multi-Core and Vector Architectures'. Together they form a unique fingerprint.

    Cite this