Directive-Based Auto-Tuning for the Finite Difference Method on the Xeon Phi

Takahiro Katagiri, Satoshi Ohshima, Masaharu Matsumoto

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

In this paper, we present a directive-based auto-tuning (AT) framework, called ppOpen-AT, and demonstrate its effect using simulation code based on the Finite Difference Method (FDM). The framework utilizes well-known loop transformation techniques. However, the codes used are carefully designed to minimize the software stack in order to meet the requirements of a many-core architecture currently in operation. The results of evaluations conducted using ppOpen-AT indicate that maximum speedup factors greater than 550% are obtained when it is applied in eight nodes of the Intel Xeon Phi. Further, in the AT for data packing and unpacking, a 49% speedup factor for the whole application is achieved. By using it with strong scaling on 32 nodes in a cluster of the Xeon Phi, we also obtain 24% speedups for the overall execution.

Original languageEnglish
Title of host publicationProceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1221-1230
Number of pages10
ISBN (Electronic)0769555101, 9780769555102
DOIs
Publication statusPublished - Sep 29 2015
Event29th IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015 - Hyderabad, India
Duration: May 25 2015May 29 2015

Publication series

NameProceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015

Other

Other29th IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015
CountryIndia
CityHyderabad
Period5/25/155/29/15

Fingerprint

Finite difference method
Tuning

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Hardware and Architecture

Cite this

Katagiri, T., Ohshima, S., & Matsumoto, M. (2015). Directive-Based Auto-Tuning for the Finite Difference Method on the Xeon Phi. In Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015 (pp. 1221-1230). [7284452] (Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IPDPSW.2015.11

Directive-Based Auto-Tuning for the Finite Difference Method on the Xeon Phi. / Katagiri, Takahiro; Ohshima, Satoshi; Matsumoto, Masaharu.

Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015. Institute of Electrical and Electronics Engineers Inc., 2015. p. 1221-1230 7284452 (Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Katagiri, T, Ohshima, S & Matsumoto, M 2015, Directive-Based Auto-Tuning for the Finite Difference Method on the Xeon Phi. in Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015., 7284452, Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015, Institute of Electrical and Electronics Engineers Inc., pp. 1221-1230, 29th IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015, Hyderabad, India, 5/25/15. https://doi.org/10.1109/IPDPSW.2015.11
Katagiri T, Ohshima S, Matsumoto M. Directive-Based Auto-Tuning for the Finite Difference Method on the Xeon Phi. In Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015. Institute of Electrical and Electronics Engineers Inc. 2015. p. 1221-1230. 7284452. (Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015). https://doi.org/10.1109/IPDPSW.2015.11
Katagiri, Takahiro ; Ohshima, Satoshi ; Matsumoto, Masaharu. / Directive-Based Auto-Tuning for the Finite Difference Method on the Xeon Phi. Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015. Institute of Electrical and Electronics Engineers Inc., 2015. pp. 1221-1230 (Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015).
@inproceedings{adb8826113b044bc998fd5a6da611fbc,
title = "Directive-Based Auto-Tuning for the Finite Difference Method on the Xeon Phi",
abstract = "In this paper, we present a directive-based auto-tuning (AT) framework, called ppOpen-AT, and demonstrate its effect using simulation code based on the Finite Difference Method (FDM). The framework utilizes well-known loop transformation techniques. However, the codes used are carefully designed to minimize the software stack in order to meet the requirements of a many-core architecture currently in operation. The results of evaluations conducted using ppOpen-AT indicate that maximum speedup factors greater than 550{\%} are obtained when it is applied in eight nodes of the Intel Xeon Phi. Further, in the AT for data packing and unpacking, a 49{\%} speedup factor for the whole application is achieved. By using it with strong scaling on 32 nodes in a cluster of the Xeon Phi, we also obtain 24{\%} speedups for the overall execution.",
author = "Takahiro Katagiri and Satoshi Ohshima and Masaharu Matsumoto",
year = "2015",
month = "9",
day = "29",
doi = "10.1109/IPDPSW.2015.11",
language = "English",
series = "Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "1221--1230",
booktitle = "Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015",
address = "United States",

}

TY - GEN

T1 - Directive-Based Auto-Tuning for the Finite Difference Method on the Xeon Phi

AU - Katagiri, Takahiro

AU - Ohshima, Satoshi

AU - Matsumoto, Masaharu

PY - 2015/9/29

Y1 - 2015/9/29

N2 - In this paper, we present a directive-based auto-tuning (AT) framework, called ppOpen-AT, and demonstrate its effect using simulation code based on the Finite Difference Method (FDM). The framework utilizes well-known loop transformation techniques. However, the codes used are carefully designed to minimize the software stack in order to meet the requirements of a many-core architecture currently in operation. The results of evaluations conducted using ppOpen-AT indicate that maximum speedup factors greater than 550% are obtained when it is applied in eight nodes of the Intel Xeon Phi. Further, in the AT for data packing and unpacking, a 49% speedup factor for the whole application is achieved. By using it with strong scaling on 32 nodes in a cluster of the Xeon Phi, we also obtain 24% speedups for the overall execution.

AB - In this paper, we present a directive-based auto-tuning (AT) framework, called ppOpen-AT, and demonstrate its effect using simulation code based on the Finite Difference Method (FDM). The framework utilizes well-known loop transformation techniques. However, the codes used are carefully designed to minimize the software stack in order to meet the requirements of a many-core architecture currently in operation. The results of evaluations conducted using ppOpen-AT indicate that maximum speedup factors greater than 550% are obtained when it is applied in eight nodes of the Intel Xeon Phi. Further, in the AT for data packing and unpacking, a 49% speedup factor for the whole application is achieved. By using it with strong scaling on 32 nodes in a cluster of the Xeon Phi, we also obtain 24% speedups for the overall execution.

UR - http://www.scopus.com/inward/record.url?scp=84962291305&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84962291305&partnerID=8YFLogxK

U2 - 10.1109/IPDPSW.2015.11

DO - 10.1109/IPDPSW.2015.11

M3 - Conference contribution

AN - SCOPUS:84962291305

T3 - Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015

SP - 1221

EP - 1230

BT - Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2015

PB - Institute of Electrical and Electronics Engineers Inc.

ER -