JISE


  [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14]


Journal of Information Science and Engineering, Vol. 30 No. 1, pp. 25-41


Approximate Reliability Evaluation of Large-Scale Distributed Systems


YUCHANG MO1, JIANMIN HAN1, ZHIZHENG ZHANG2, ZHUSHENG PAN1 AND FARONG ZHONG1
1Department of Computer Science and Engineering
Zhejiang Normal University
Jinhua, 321004 P.R. China
2School of Computer Science and Engineering
Southeast University
Nanjing, 210089 P.R. China


    Modern distributed systems may nowadays consist of hundreds of thousands of computing elements, and system design will go hand-in-hand by means of a partly bottom- up approach in which computing elements are glued together into a full-fledged system that can scale beyond imagination. This new kind of scalability and complexity poses unprecedented challenges for the traditional Binary Decision Diagram (BDD)- based reliability evaluation. In order to evaluate the complex structure functions of largescale distributed systems with limited computational resources, a common practice is to truncate the BDD. Using the truncated BDD (much smaller than exact BDD), approximate reliability can be calculated. In this paper, an efficient BDD truncation algorithm is proposed, which adopts a lazy mechanism and a writeback mechanism to reduce time and space consumption of truncation by means of implementing the sharing of sub-BDD. Two approximate reliability evaluation methods are proposed, and the corresponding truncation error estimators are developed based on the truncation limits or the bounds of exact reliability. A set of evaluation procedures are developed for different evaluation requirements.


Keywords: reliability evaluation, distributed system, binary decision diagram, approximate analysis, error estimation

  Retrieve PDF document (JISE_201401_02.pdf)