Auto-tuning of hybrid MPI/OpenMP execution with code selection by ppOpen-AT

Takahiro Katagiri, Masaharu Matsumoto, Satoshi Ohshima

研究成果: 書籍/レポート タイプへの寄稿会議への寄与

7 被引用数 (Scopus)

抄録

In this paper, we propose an effective kernel implementation for an application of the finite difference method (FDM) by merging computations of central-difference and explicit time expansion schemes without IF statements inside the loops. The effectiveness of the implementation depends on the CPU architecture and execution situation, such as the problem size and the number of MPI processes and OpenMP threads. We adopt auto-tuning (AT) technology to select the best implementation. The AT function for the selection, referred to as «code selection», is implemented in an AT language, namely, ppOpen-AT. The results of experiments conducted using current advanced CPUs (Xeon Phi, Ivy Bridge, and FX10) indicated that crucial speedups of conventional AT are achieved by code selection. In particular, the heaviest kernels achieved speedups of 4.21x (Xeon Phi), 2.52x (Ivy Bridge), and 2.03x (FX10).

本文言語英語
ホスト出版物のタイトルProceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016
出版社Institute of Electrical and Electronics Engineers Inc.
ページ1488-1495
ページ数8
ISBN(電子版)9781509021406
DOI
出版ステータス出版済み - 7月 18 2016
外部発表はい
イベント30th IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2016 - Chicago, 米国
継続期間: 5月 23 20165月 27 2016

出版物シリーズ

名前Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016

その他

その他30th IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2016
国/地域米国
CityChicago
Period5/23/165/27/16

!!!All Science Journal Classification (ASJC) codes

  • コンピュータ ネットワークおよび通信

フィンガープリント

「Auto-tuning of hybrid MPI/OpenMP execution with code selection by ppOpen-AT」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル