Implementation and evaluation of Fock matrix calculation program on the Cell processor

Hiroaki Honda, Tetsuo Hayashi, Yuichi Inadomi, Koji Inoue, Kazuaki J. Murakami

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Various processor architectures have been proposed until today, and the performance has improved remarkably. Recently, the Chip Multi-processors (CMPs), which has many processor cores onto a chip, are proposed for further performance improvement. The Cell processor is one of such CMP and shows high computational performance. Although this processor is designed for the multimedia, that high performance character can be utilized to molecular orbital calculation. In this study we implemented Fock matrix construction program on the Cell processor, and evaluated computational performance. As a result, there were two kinds of main stalls by the branch prediction and the data alignment, which are controlled by software mechanism for the simplification of the Cell processor hardware. It is possible to improve the performance about 30%, if the branch prediction hit ratio could be improved to 99%. For data alignment stall, a part of stalls, which is originated by data shuffle pipeline, could be decreased by preparing hardware data alignment mechanism.

Original languageEnglish
Title of host publicationComputation in Modern Science and Engineering - Proceedings of the International Conference on Computational Methods in Science and Engineering 2007 (ICCMSE 2007)
Pages64-67
Number of pages4
Edition2
DOIs
Publication statusPublished - Dec 1 2007
EventInternational Conference on Computational Methods in Science and Engineering 2007, ICCMSE 2007 - Corfu, Greece
Duration: Sep 25 2007Sep 30 2007

Publication series

NameAIP Conference Proceedings
Number2
Volume963
ISSN (Print)0094-243X
ISSN (Electronic)1551-7616

Other

OtherInternational Conference on Computational Methods in Science and Engineering 2007, ICCMSE 2007
CountryGreece
CityCorfu
Period9/25/079/30/07

Fingerprint

central processing units
evaluation
cells
chips
alignment
hardware
multimedia
predictions
simplification
molecular orbitals
computer programs
matrices

All Science Journal Classification (ASJC) codes

  • Physics and Astronomy(all)

Cite this

Honda, H., Hayashi, T., Inadomi, Y., Inoue, K., & Murakami, K. J. (2007). Implementation and evaluation of Fock matrix calculation program on the Cell processor. In Computation in Modern Science and Engineering - Proceedings of the International Conference on Computational Methods in Science and Engineering 2007 (ICCMSE 2007) (2 ed., pp. 64-67). (AIP Conference Proceedings; Vol. 963, No. 2). https://doi.org/10.1063/1.2836167

Implementation and evaluation of Fock matrix calculation program on the Cell processor. / Honda, Hiroaki; Hayashi, Tetsuo; Inadomi, Yuichi; Inoue, Koji; Murakami, Kazuaki J.

Computation in Modern Science and Engineering - Proceedings of the International Conference on Computational Methods in Science and Engineering 2007 (ICCMSE 2007). 2. ed. 2007. p. 64-67 (AIP Conference Proceedings; Vol. 963, No. 2).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Honda, H, Hayashi, T, Inadomi, Y, Inoue, K & Murakami, KJ 2007, Implementation and evaluation of Fock matrix calculation program on the Cell processor. in Computation in Modern Science and Engineering - Proceedings of the International Conference on Computational Methods in Science and Engineering 2007 (ICCMSE 2007). 2 edn, AIP Conference Proceedings, no. 2, vol. 963, pp. 64-67, International Conference on Computational Methods in Science and Engineering 2007, ICCMSE 2007, Corfu, Greece, 9/25/07. https://doi.org/10.1063/1.2836167
Honda H, Hayashi T, Inadomi Y, Inoue K, Murakami KJ. Implementation and evaluation of Fock matrix calculation program on the Cell processor. In Computation in Modern Science and Engineering - Proceedings of the International Conference on Computational Methods in Science and Engineering 2007 (ICCMSE 2007). 2 ed. 2007. p. 64-67. (AIP Conference Proceedings; 2). https://doi.org/10.1063/1.2836167
Honda, Hiroaki ; Hayashi, Tetsuo ; Inadomi, Yuichi ; Inoue, Koji ; Murakami, Kazuaki J. / Implementation and evaluation of Fock matrix calculation program on the Cell processor. Computation in Modern Science and Engineering - Proceedings of the International Conference on Computational Methods in Science and Engineering 2007 (ICCMSE 2007). 2. ed. 2007. pp. 64-67 (AIP Conference Proceedings; 2).
@inproceedings{7922867770d24403bdeede1f7a7162a1,
title = "Implementation and evaluation of Fock matrix calculation program on the Cell processor",
abstract = "Various processor architectures have been proposed until today, and the performance has improved remarkably. Recently, the Chip Multi-processors (CMPs), which has many processor cores onto a chip, are proposed for further performance improvement. The Cell processor is one of such CMP and shows high computational performance. Although this processor is designed for the multimedia, that high performance character can be utilized to molecular orbital calculation. In this study we implemented Fock matrix construction program on the Cell processor, and evaluated computational performance. As a result, there were two kinds of main stalls by the branch prediction and the data alignment, which are controlled by software mechanism for the simplification of the Cell processor hardware. It is possible to improve the performance about 30{\%}, if the branch prediction hit ratio could be improved to 99{\%}. For data alignment stall, a part of stalls, which is originated by data shuffle pipeline, could be decreased by preparing hardware data alignment mechanism.",
author = "Hiroaki Honda and Tetsuo Hayashi and Yuichi Inadomi and Koji Inoue and Murakami, {Kazuaki J.}",
year = "2007",
month = "12",
day = "1",
doi = "10.1063/1.2836167",
language = "English",
isbn = "9780735404786",
series = "AIP Conference Proceedings",
number = "2",
pages = "64--67",
booktitle = "Computation in Modern Science and Engineering - Proceedings of the International Conference on Computational Methods in Science and Engineering 2007 (ICCMSE 2007)",
edition = "2",

}

TY - GEN

T1 - Implementation and evaluation of Fock matrix calculation program on the Cell processor

AU - Honda, Hiroaki

AU - Hayashi, Tetsuo

AU - Inadomi, Yuichi

AU - Inoue, Koji

AU - Murakami, Kazuaki J.

PY - 2007/12/1

Y1 - 2007/12/1

N2 - Various processor architectures have been proposed until today, and the performance has improved remarkably. Recently, the Chip Multi-processors (CMPs), which has many processor cores onto a chip, are proposed for further performance improvement. The Cell processor is one of such CMP and shows high computational performance. Although this processor is designed for the multimedia, that high performance character can be utilized to molecular orbital calculation. In this study we implemented Fock matrix construction program on the Cell processor, and evaluated computational performance. As a result, there were two kinds of main stalls by the branch prediction and the data alignment, which are controlled by software mechanism for the simplification of the Cell processor hardware. It is possible to improve the performance about 30%, if the branch prediction hit ratio could be improved to 99%. For data alignment stall, a part of stalls, which is originated by data shuffle pipeline, could be decreased by preparing hardware data alignment mechanism.

AB - Various processor architectures have been proposed until today, and the performance has improved remarkably. Recently, the Chip Multi-processors (CMPs), which has many processor cores onto a chip, are proposed for further performance improvement. The Cell processor is one of such CMP and shows high computational performance. Although this processor is designed for the multimedia, that high performance character can be utilized to molecular orbital calculation. In this study we implemented Fock matrix construction program on the Cell processor, and evaluated computational performance. As a result, there were two kinds of main stalls by the branch prediction and the data alignment, which are controlled by software mechanism for the simplification of the Cell processor hardware. It is possible to improve the performance about 30%, if the branch prediction hit ratio could be improved to 99%. For data alignment stall, a part of stalls, which is originated by data shuffle pipeline, could be decreased by preparing hardware data alignment mechanism.

UR - http://www.scopus.com/inward/record.url?scp=71449123378&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=71449123378&partnerID=8YFLogxK

U2 - 10.1063/1.2836167

DO - 10.1063/1.2836167

M3 - Conference contribution

AN - SCOPUS:71449123378

SN - 9780735404786

T3 - AIP Conference Proceedings

SP - 64

EP - 67

BT - Computation in Modern Science and Engineering - Proceedings of the International Conference on Computational Methods in Science and Engineering 2007 (ICCMSE 2007)

ER -