کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
492442 | 721571 | 2016 | 18 صفحه PDF | دانلود رایگان |
• Propose a three-phase Byzantine Fault Tolerance (BFT) mechanism.
• Integrate replication, checkpointing and message logging techniques.
• Develop the BFT mechanism in a transparent manner.
• Remove epidemic effects of Byzantine failures.
• Evaluate the BFT mechanism using a real-world simulation model.
A parallel and distributed simulation (federation) is composed of a number of simulation components (federates). Since the federates may be developed by different participants and executed on different platforms, they are subject to Byzantine failures. Moreover, the failure may propagate in the federation, resulting in epidemic effect. In this article, a three-phase (i.e., detection, location, and recovery) Byzantine Fault Tolerance (BFT) mechanism is proposed based on a transparent middleware approach. The replication, checkpointing and message logging techniques are integrated in the mechanism for the purpose of enhancing simulation performance and reducing fault tolerance cost. In addition, mechanisms are provided to remove the epidemic effects of Byzantine failures. Our experiments have verified the correctness of the three-phase BFT mechanism and illustrated its high efficiency and good scalability. For some simulation executions, the BFT mechanism may even achieve performance enhancement and Byzantine fault tolerance simultaneously.
Journal: Simulation Modelling Practice and Theory - Volume 60, January 2016, Pages 90–107