TY - GEN
T1 - An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL
AU - Kitai, Naruya
AU - Takahashi, Daisuke
AU - Franchetti, Franz
AU - Katagiri, Takahiro
AU - Ohshima, Satoshi
AU - Nagai, Toru
N1 - Funding Information:
This work was supported by JSPS KAKENHI (Grant No. JP19H05662).
Publisher Copyright:
© 2021 IEEE.
PY - 2021/6
Y1 - 2021/6
N2 - In this paper, we propose an auto-tuning (AT) system by adapting the A64 Scalable Vector Extension for SPIRAL to generate discrete Fourier transform (DFT) implementations. The performance of our method is evaluated using the Supercomputer "Flow"at Nagoya University. The A64 scalable vector extension applied DFT codes are up to 1.98 times faster than scalar DFT codes and up to 3.63 times higher in terms of the SIMD instruction rate. In addition, we obtain a factor of maximum speedup 2.32 by adapting proposed AT system for loop unrolling.
AB - In this paper, we propose an auto-tuning (AT) system by adapting the A64 Scalable Vector Extension for SPIRAL to generate discrete Fourier transform (DFT) implementations. The performance of our method is evaluated using the Supercomputer "Flow"at Nagoya University. The A64 scalable vector extension applied DFT codes are up to 1.98 times faster than scalar DFT codes and up to 3.63 times higher in terms of the SIMD instruction rate. In addition, we obtain a factor of maximum speedup 2.32 by adapting proposed AT system for loop unrolling.
UR - http://www.scopus.com/inward/record.url?scp=85114444033&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85114444033&partnerID=8YFLogxK
U2 - 10.1109/IPDPSW52791.2021.00117
DO - 10.1109/IPDPSW52791.2021.00117
M3 - Conference contribution
AN - SCOPUS:85114444033
T3 - 2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 - In conjunction with IEEE IPDPS 2021
SP - 789
EP - 797
BT - 2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 - In conjunction with IEEE IPDPS 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021
Y2 - 17 May 2021
ER -