The very nature of distributed-memory parallel architectures demands the serious consideration of interprocessor communications, by which non-local memory access can be implemented. In this paper, we propose a method for optimization that utilizes asynchronous and bulk communications. We constructed an HPF- compiler, a subset of HPF, and evaluated it on the CM5, a true distributed-memory machine using three non-trivial benchmark programs. With optimized code, there was considerable improvement in communications over non-optimized code and we had much better results than when optimizing by means of CM Fortran.
|出版ステータス||出版済み - 1 1 1995|
|イベント||Proceedings of the 1995 Conference on Supercomputing - Barcelona, Spain|
継続期間: 7 3 1995 → 7 7 1995
|その他||Proceedings of the 1995 Conference on Supercomputing|
|Period||7/3/95 → 7/7/95|
All Science Journal Classification (ASJC) codes
- Computer Science(all)