TY - GEN

T1 - 2-3-4 combination for parallel compression on the K computer

AU - Bi, Chongke

AU - Ono, Kenji

PY - 2014/1/1

Y1 - 2014/1/1

N2 - The development of supercomputers has successfully helped us to carry on complicated simulation with exploded size of dataset. For visualizing such kind of large-scale dataset, reducing the data size by using compression methods is one of the most useful approach. Moreover, parallelization of compression algorithm can greatly improve the efficiency and resolve the limitation of memory size. However, in parallel compression algorithm, interprocessor communication is indispensable, while it is also a bottleneck problem, especially for the general cases that the number of processors is not power-of-Two. Parallel POD (proper orthogonal decomposition) compression algorithm is such an example, the number of time steps must be power-of-Two for the binary swap scheme. A method that can fully resolve this problem with low computational cost will be very popular. In this paper, we proposed such an approach called 2-3-4 combination approach, which can be simply implemented and also reach high performance of parallel computing algorithms. Furthermore, our method can obtain the best balance among all parallel computing processors. This is achieved by transferring the non-power-of-Two problem into power-of-Two problem to fully use the best balance feature of binary swap method. We evaluate our approach through applying it to the parallel POD compression algorithm on the K computer.

AB - The development of supercomputers has successfully helped us to carry on complicated simulation with exploded size of dataset. For visualizing such kind of large-scale dataset, reducing the data size by using compression methods is one of the most useful approach. Moreover, parallelization of compression algorithm can greatly improve the efficiency and resolve the limitation of memory size. However, in parallel compression algorithm, interprocessor communication is indispensable, while it is also a bottleneck problem, especially for the general cases that the number of processors is not power-of-Two. Parallel POD (proper orthogonal decomposition) compression algorithm is such an example, the number of time steps must be power-of-Two for the binary swap scheme. A method that can fully resolve this problem with low computational cost will be very popular. In this paper, we proposed such an approach called 2-3-4 combination approach, which can be simply implemented and also reach high performance of parallel computing algorithms. Furthermore, our method can obtain the best balance among all parallel computing processors. This is achieved by transferring the non-power-of-Two problem into power-of-Two problem to fully use the best balance feature of binary swap method. We evaluate our approach through applying it to the parallel POD compression algorithm on the K computer.

UR - http://www.scopus.com/inward/record.url?scp=84899576727&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84899576727&partnerID=8YFLogxK

U2 - 10.1109/PacificVis.2014.28

DO - 10.1109/PacificVis.2014.28

M3 - Conference contribution

AN - SCOPUS:84899576727

SN - 9781479928736

T3 - IEEE Pacific Visualization Symposium

SP - 281

EP - 285

BT - Proceedings - 2014 IEEE Pacific Visualization Symposium, PacificVis 2014

PB - IEEE Computer Society

T2 - 2014 7th IEEE Pacific Visualization Symposium, PacificVis 2014

Y2 - 4 March 2014 through 7 March 2014

ER -