TY - GEN
T1 - Parallelization of GKV benchmark using OpenACC
AU - Morishita, Makoto
AU - Ohshima, Satoshi
AU - Katagiri, Takahiro
AU - Nagai, Toru
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/6
Y1 - 2021/6
N2 - The computing power of the Graphics Processing Unit (GPU) has received great attention in recent years, as 140 supercomputers with NVIDIA GPUs were ranked in the TOP500 for November 2020 [1]. However, CUDA, which is widely used in GPU programming, needs to be written at a low level and often requires the specialized knowledge of the GPU memory hierarchy and execution models. In this study, we used OpenACC [2], which semi-automatically generates kernel code by inserting directives into a program to speed up the application. The target application was benchmark program based on the plasma turbulence analysis code, gyrokinetic Vlasov code (GKV). With our implementation of OpenACC, kernel2, kernel3, and kernel4 of the benchmark were 31.43, 7.08, and 10.74 times faster, respectively, compared to CPU sequential execution. Thus, we succeeded in increasing the applications' speed. In the future, we will port the rest of the code to the GPU environment to run the entire GKV on GPUs.
AB - The computing power of the Graphics Processing Unit (GPU) has received great attention in recent years, as 140 supercomputers with NVIDIA GPUs were ranked in the TOP500 for November 2020 [1]. However, CUDA, which is widely used in GPU programming, needs to be written at a low level and often requires the specialized knowledge of the GPU memory hierarchy and execution models. In this study, we used OpenACC [2], which semi-automatically generates kernel code by inserting directives into a program to speed up the application. The target application was benchmark program based on the plasma turbulence analysis code, gyrokinetic Vlasov code (GKV). With our implementation of OpenACC, kernel2, kernel3, and kernel4 of the benchmark were 31.43, 7.08, and 10.74 times faster, respectively, compared to CPU sequential execution. Thus, we succeeded in increasing the applications' speed. In the future, we will port the rest of the code to the GPU environment to run the entire GKV on GPUs.
UR - http://www.scopus.com/inward/record.url?scp=85114409723&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85114409723&partnerID=8YFLogxK
U2 - 10.1109/IPDPSW52791.2021.00109
DO - 10.1109/IPDPSW52791.2021.00109
M3 - Conference contribution
AN - SCOPUS:85114409723
T3 - 2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 - In conjunction with IEEE IPDPS 2021
SP - 723
EP - 729
BT - 2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 - In conjunction with IEEE IPDPS 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021
Y2 - 17 May 2021
ER -