TY - GEN
T1 - 2-3-4 combination for parallel compression on the K computer
AU - Bi, Chongke
AU - Ono, Kenji
PY - 2014/1/1
Y1 - 2014/1/1
N2 - The development of supercomputers has successfully helped us to carry on complicated simulation with exploded size of dataset. For visualizing such kind of large-scale dataset, reducing the data size by using compression methods is one of the most useful approach. Moreover, parallelization of compression algorithm can greatly improve the efficiency and resolve the limitation of memory size. However, in parallel compression algorithm, interprocessor communication is indispensable, while it is also a bottleneck problem, especially for the general cases that the number of processors is not power-of-Two. Parallel POD (proper orthogonal decomposition) compression algorithm is such an example, the number of time steps must be power-of-Two for the binary swap scheme. A method that can fully resolve this problem with low computational cost will be very popular. In this paper, we proposed such an approach called 2-3-4 combination approach, which can be simply implemented and also reach high performance of parallel computing algorithms. Furthermore, our method can obtain the best balance among all parallel computing processors. This is achieved by transferring the non-power-of-Two problem into power-of-Two problem to fully use the best balance feature of binary swap method. We evaluate our approach through applying it to the parallel POD compression algorithm on the K computer.
AB - The development of supercomputers has successfully helped us to carry on complicated simulation with exploded size of dataset. For visualizing such kind of large-scale dataset, reducing the data size by using compression methods is one of the most useful approach. Moreover, parallelization of compression algorithm can greatly improve the efficiency and resolve the limitation of memory size. However, in parallel compression algorithm, interprocessor communication is indispensable, while it is also a bottleneck problem, especially for the general cases that the number of processors is not power-of-Two. Parallel POD (proper orthogonal decomposition) compression algorithm is such an example, the number of time steps must be power-of-Two for the binary swap scheme. A method that can fully resolve this problem with low computational cost will be very popular. In this paper, we proposed such an approach called 2-3-4 combination approach, which can be simply implemented and also reach high performance of parallel computing algorithms. Furthermore, our method can obtain the best balance among all parallel computing processors. This is achieved by transferring the non-power-of-Two problem into power-of-Two problem to fully use the best balance feature of binary swap method. We evaluate our approach through applying it to the parallel POD compression algorithm on the K computer.
UR - http://www.scopus.com/inward/record.url?scp=84899576727&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84899576727&partnerID=8YFLogxK
U2 - 10.1109/PacificVis.2014.28
DO - 10.1109/PacificVis.2014.28
M3 - Conference contribution
AN - SCOPUS:84899576727
SN - 9781479928736
T3 - IEEE Pacific Visualization Symposium
SP - 281
EP - 285
BT - Proceedings - 2014 IEEE Pacific Visualization Symposium, PacificVis 2014
PB - IEEE Computer Society
T2 - 2014 7th IEEE Pacific Visualization Symposium, PacificVis 2014
Y2 - 4 March 2014 through 7 March 2014
ER -