Fault-Tolerant Scheduling with Dynamic Number of Replicas in Heterogeneous Systems

Laiping Zhao, Yizhi Ren, Yang Xiang, Kouichi Sakurai

研究成果: Chapter in Book/Report/Conference proceedingConference contribution

44 引用 (Scopus)

抜粋

In the existing studies on fault-tolerant scheduling, the active replication schema makes use of ε + 1 replicas for each task to tolerate ε failures. However, in this paper, we show that it does not always lead to a higher reliability with more replicas. Besides, the more replicas implies more resource consumption and higher economic cost. To address this problem, with the target to satisfy the user's reliability requirement with minimum resources, this paper proposes a new fault tolerant scheduling algorithm: MaxRe. In the algorithm, we incorporate the reliability analysis into the active replication schema, and exploit a dynamic number of replicas for different tasks. Both the theoretical analysis and experiments prove that the MaxRe algorithm's schedule can certainly satisfy user's reliability requirements. And the MaxRe scheduling algorithm can achieve the corresponding reliability with at most 70% fewer resources than the FTSA algorithm.

元の言語英語
ホスト出版物のタイトルProceedings - 2010 12th IEEE International Conference on High Performance Computing and Communications, HPCC 2010
ページ434-441
ページ数8
DOI
出版物ステータス出版済み - 2010
イベント2010 12th IEEE International Conference on High Performance Computing and Communications, HPCC 2010 - Melbourne, VIC, オーストラリア
継続期間: 9 1 20109 3 2010

その他

その他2010 12th IEEE International Conference on High Performance Computing and Communications, HPCC 2010
オーストラリア
Melbourne, VIC
期間9/1/109/3/10

All Science Journal Classification (ASJC) codes

  • Computational Theory and Mathematics
  • Theoretical Computer Science

フィンガープリント Fault-Tolerant Scheduling with Dynamic Number of Replicas in Heterogeneous Systems' の研究トピックを掘り下げます。これらはともに一意のフィンガープリントを構成します。

  • これを引用

    Zhao, L., Ren, Y., Xiang, Y., & Sakurai, K. (2010). Fault-Tolerant Scheduling with Dynamic Number of Replicas in Heterogeneous Systems. : Proceedings - 2010 12th IEEE International Conference on High Performance Computing and Communications, HPCC 2010 (pp. 434-441) https://doi.org/10.1109/HPCC.2010.72