Preliminary investigation of distributed shared memory system on a cluster of high performance clusters

Takeshi Nanri, Yoshitaka Watanabe, Hiyoyuki Sato, Masaaki Shimasaki

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper introduces design and basic performance of the DSM(distributed shared memory) system on a cluster of clusters. Networking devices such as Myrinet have improved the performance of cluster systems significantly. In addition to that, such kind of network devices introduced a new hierarchical architecture; a multi-cluster, a cluster of high-performance clusters. To ease the difficulty of programming with message passing, which is the conventional programming paradigm on cluster systems, many DSM (distributed shared memory) systems have been developed in recent years. However, there have been no DSM systems developed on multi-clusters. The DSM system consists of a runtime system to support basic functions for accessing virtual shared memory built on such environment. The functions are allocation of global data, read and write accesses to global data, synchronization of the whole system, and mutual exclusion. The authors have evaluated the performance of the runtime system, built on a SMP cluster, COMPaS, at RWCP(Real World Computing Partnership) in Tsukuba, Japan. The result shows that a read access to remote memory on the same cluster costs about 0.2msec, while a read access to remote memory on other cluster costs about 1.3msec. The execution time of LU decomposition on a multi-cluster consisting two clusters of three PCs is about 2.8times faster than the time on one PC.

Original languageEnglish
Title of host publicationEuropean Congress on Computational Methods in Applied Sciences and Engineering, ECCOMAS 2000
Publication statusPublished - Dec 1 2000
EventEuropean Congress on Computational Methods in Applied Sciences and Engineering, ECCOMAS 2000 - Barcelona, Spain
Duration: Sep 11 2000Sep 14 2000

Other

OtherEuropean Congress on Computational Methods in Applied Sciences and Engineering, ECCOMAS 2000
CountrySpain
CityBarcelona
Period9/11/009/14/00

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Applied Mathematics

Fingerprint Dive into the research topics of 'Preliminary investigation of distributed shared memory system on a cluster of high performance clusters'. Together they form a unique fingerprint.

  • Cite this

    Nanri, T., Watanabe, Y., Sato, H., & Shimasaki, M. (2000). Preliminary investigation of distributed shared memory system on a cluster of high performance clusters. In European Congress on Computational Methods in Applied Sciences and Engineering, ECCOMAS 2000