CPCI Stack

Metric for Accurate Bottleneck Analysis on OoO Microprocessors

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Correctly understanding microarchitectural bottlenecks is important to optimize performance and energy of OoO (Out-of-Order) processors. Although CPI (Cycles Per Instruction) stack has been utilized for this purpose, it stacks architectural events heuristically by counting how many times the events occur, and the order of stacking affects the result, which may be misleading. It is because CPI stack does not consider the execution path of dynamic instructions. Critical path analysis (CPA) is a well-known method to identify the critical execution path of dynamic instruction execution on OoO processors. The critical path consists of the sequence of events that determines the execution time of a program on a certain processor. We develop a novel representation of CPCI stack (Cycles Per Critical Instruction stack), which is CPI stack based on CPA. The main challenge in constructing CPCI stack is how to analyze a large number of paths because CPA often results in numerous critical paths. In this paper, we show that there are more than ten to the tenth power critical paths in the execution of only one thousand instructions in 35 benchmarks out of 48 from SPEC CPU2006. Then, we propose a statistical method to analyze all the critical paths and show a case study using the benchmarks.

Original languageEnglish
Title of host publicationProceedings - 2017 5th International Symposium on Computing and Networking, CANDAR 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages166-172
Number of pages7
Volume2018-January
ISBN (Electronic)9781538620878
DOIs
Publication statusPublished - Apr 23 2018
Event5th International Symposium on Computing and Networking, CANDAR 2017 - Aomori, Japan
Duration: Nov 19 2017Nov 22 2017

Other

Other5th International Symposium on Computing and Networking, CANDAR 2017
CountryJapan
CityAomori
Period11/19/1711/22/17

Fingerprint

Critical path analysis
Microprocessor chips
Statistical methods

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Networks and Communications
  • Hardware and Architecture

Cite this

Tanimoto, T., Ono, T., & Koji, I. (2018). CPCI Stack: Metric for Accurate Bottleneck Analysis on OoO Microprocessors. In Proceedings - 2017 5th International Symposium on Computing and Networking, CANDAR 2017 (Vol. 2018-January, pp. 166-172). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CANDAR.2017.60

CPCI Stack : Metric for Accurate Bottleneck Analysis on OoO Microprocessors. / Tanimoto, Teruo; Ono, Takatsugu; Koji, Inoue.

Proceedings - 2017 5th International Symposium on Computing and Networking, CANDAR 2017. Vol. 2018-January Institute of Electrical and Electronics Engineers Inc., 2018. p. 166-172.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Tanimoto, T, Ono, T & Koji, I 2018, CPCI Stack: Metric for Accurate Bottleneck Analysis on OoO Microprocessors. in Proceedings - 2017 5th International Symposium on Computing and Networking, CANDAR 2017. vol. 2018-January, Institute of Electrical and Electronics Engineers Inc., pp. 166-172, 5th International Symposium on Computing and Networking, CANDAR 2017, Aomori, Japan, 11/19/17. https://doi.org/10.1109/CANDAR.2017.60
Tanimoto T, Ono T, Koji I. CPCI Stack: Metric for Accurate Bottleneck Analysis on OoO Microprocessors. In Proceedings - 2017 5th International Symposium on Computing and Networking, CANDAR 2017. Vol. 2018-January. Institute of Electrical and Electronics Engineers Inc. 2018. p. 166-172 https://doi.org/10.1109/CANDAR.2017.60
Tanimoto, Teruo ; Ono, Takatsugu ; Koji, Inoue. / CPCI Stack : Metric for Accurate Bottleneck Analysis on OoO Microprocessors. Proceedings - 2017 5th International Symposium on Computing and Networking, CANDAR 2017. Vol. 2018-January Institute of Electrical and Electronics Engineers Inc., 2018. pp. 166-172
@inproceedings{4d100fcf29c14a4984e4766813821547,
title = "CPCI Stack: Metric for Accurate Bottleneck Analysis on OoO Microprocessors",
abstract = "Correctly understanding microarchitectural bottlenecks is important to optimize performance and energy of OoO (Out-of-Order) processors. Although CPI (Cycles Per Instruction) stack has been utilized for this purpose, it stacks architectural events heuristically by counting how many times the events occur, and the order of stacking affects the result, which may be misleading. It is because CPI stack does not consider the execution path of dynamic instructions. Critical path analysis (CPA) is a well-known method to identify the critical execution path of dynamic instruction execution on OoO processors. The critical path consists of the sequence of events that determines the execution time of a program on a certain processor. We develop a novel representation of CPCI stack (Cycles Per Critical Instruction stack), which is CPI stack based on CPA. The main challenge in constructing CPCI stack is how to analyze a large number of paths because CPA often results in numerous critical paths. In this paper, we show that there are more than ten to the tenth power critical paths in the execution of only one thousand instructions in 35 benchmarks out of 48 from SPEC CPU2006. Then, we propose a statistical method to analyze all the critical paths and show a case study using the benchmarks.",
author = "Teruo Tanimoto and Takatsugu Ono and Inoue Koji",
year = "2018",
month = "4",
day = "23",
doi = "10.1109/CANDAR.2017.60",
language = "English",
volume = "2018-January",
pages = "166--172",
booktitle = "Proceedings - 2017 5th International Symposium on Computing and Networking, CANDAR 2017",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

TY - GEN

T1 - CPCI Stack

T2 - Metric for Accurate Bottleneck Analysis on OoO Microprocessors

AU - Tanimoto, Teruo

AU - Ono, Takatsugu

AU - Koji, Inoue

PY - 2018/4/23

Y1 - 2018/4/23

N2 - Correctly understanding microarchitectural bottlenecks is important to optimize performance and energy of OoO (Out-of-Order) processors. Although CPI (Cycles Per Instruction) stack has been utilized for this purpose, it stacks architectural events heuristically by counting how many times the events occur, and the order of stacking affects the result, which may be misleading. It is because CPI stack does not consider the execution path of dynamic instructions. Critical path analysis (CPA) is a well-known method to identify the critical execution path of dynamic instruction execution on OoO processors. The critical path consists of the sequence of events that determines the execution time of a program on a certain processor. We develop a novel representation of CPCI stack (Cycles Per Critical Instruction stack), which is CPI stack based on CPA. The main challenge in constructing CPCI stack is how to analyze a large number of paths because CPA often results in numerous critical paths. In this paper, we show that there are more than ten to the tenth power critical paths in the execution of only one thousand instructions in 35 benchmarks out of 48 from SPEC CPU2006. Then, we propose a statistical method to analyze all the critical paths and show a case study using the benchmarks.

AB - Correctly understanding microarchitectural bottlenecks is important to optimize performance and energy of OoO (Out-of-Order) processors. Although CPI (Cycles Per Instruction) stack has been utilized for this purpose, it stacks architectural events heuristically by counting how many times the events occur, and the order of stacking affects the result, which may be misleading. It is because CPI stack does not consider the execution path of dynamic instructions. Critical path analysis (CPA) is a well-known method to identify the critical execution path of dynamic instruction execution on OoO processors. The critical path consists of the sequence of events that determines the execution time of a program on a certain processor. We develop a novel representation of CPCI stack (Cycles Per Critical Instruction stack), which is CPI stack based on CPA. The main challenge in constructing CPCI stack is how to analyze a large number of paths because CPA often results in numerous critical paths. In this paper, we show that there are more than ten to the tenth power critical paths in the execution of only one thousand instructions in 35 benchmarks out of 48 from SPEC CPU2006. Then, we propose a statistical method to analyze all the critical paths and show a case study using the benchmarks.

UR - http://www.scopus.com/inward/record.url?scp=85050366795&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85050366795&partnerID=8YFLogxK

U2 - 10.1109/CANDAR.2017.60

DO - 10.1109/CANDAR.2017.60

M3 - Conference contribution

VL - 2018-January

SP - 166

EP - 172

BT - Proceedings - 2017 5th International Symposium on Computing and Networking, CANDAR 2017

PB - Institute of Electrical and Electronics Engineers Inc.

ER -