TY - GEN
T1 - A micro-vectorprocessor Architecture - Performance Modeling and Benchmarking -
AU - Hashimoto, Takashi
AU - Hironaka, Tetsuo
AU - Murakami, Kazuaki
AU - Yasuura, Hiroto
PY - 1993/8/1
Y1 - 1993/8/1
N2 - This paper proposes and examines some architectural features suitable for micro-vectorprocessors. Due to the I/O-pin bottleneck, micro-vectorprocessors should save the off-chip memory bandwidth by exploiting the on-chip register bandwidth instead. Those features include the vector-instruction-level multithreading and FIFO vector registers. There are three variations of multithreading: periodic, forced, and round-robin. The paper also formulates the performance of micro-vectorprocessors with such architectural features. And then, the paper evaluates the performance attainable by those micro-vectorprocessors through software simulation. From the benchmark results, it is found that the vector-instruction-level multithreading and FIFO vector registers can improve the performance of the micro-vectorprocessors with the half memory bandwidth comparable to that of ones with the full memory bandwidth. Furthermore, forced multithreading is found to be tolerant to the large memory access latency. From these results, the paper concludes that the forced multithreading at the vector-instruction level is a good candidate for the architectural features suitable to micro-vectorprocessors.
AB - This paper proposes and examines some architectural features suitable for micro-vectorprocessors. Due to the I/O-pin bottleneck, micro-vectorprocessors should save the off-chip memory bandwidth by exploiting the on-chip register bandwidth instead. Those features include the vector-instruction-level multithreading and FIFO vector registers. There are three variations of multithreading: periodic, forced, and round-robin. The paper also formulates the performance of micro-vectorprocessors with such architectural features. And then, the paper evaluates the performance attainable by those micro-vectorprocessors through software simulation. From the benchmark results, it is found that the vector-instruction-level multithreading and FIFO vector registers can improve the performance of the micro-vectorprocessors with the half memory bandwidth comparable to that of ones with the full memory bandwidth. Furthermore, forced multithreading is found to be tolerant to the large memory access latency. From these results, the paper concludes that the forced multithreading at the vector-instruction level is a good candidate for the architectural features suitable to micro-vectorprocessors.
UR - http://www.scopus.com/inward/record.url?scp=33749793639&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33749793639&partnerID=8YFLogxK
U2 - 10.1145/165939.166000
DO - 10.1145/165939.166000
M3 - Conference contribution
AN - SCOPUS:33749793639
T3 - Proceedings of the International Conference on Supercomputing
SP - 308
EP - 317
BT - Proceedings of the 7th International Conference on Supercomputing, ICS 1993
PB - Association for Computing Machinery
T2 - 7th International Conference on Supercomputing, ICS 1993
Y2 - 19 July 1993 through 23 July 1993
ER -