In: Proceedings of the 9th International Conference on Distributed Computing Systems; 1989, Newport Beach, CA, USA, pages 151-159. Washington: IEEE Comput. Soc. Press, 1989.
Abstract: Analytical models are presented that use extended Petri nets for fault-tolerant schemes used in distributed systems. Several different schemes are discussed in detail: rollback recovery with checkpointing, recovery blocks, N-version programming, and conversations. A methodology for evaluating a fault-tolerant scheme for a specific system configuration and the steps involved in building a Petri net model are described. The subnet primitives involved in building these models are identified and an algorithm for building the models automatically is described.
Keywords: evaluation (of) fault-tolerant technique; distributed systems; extended net; recovery (with) checkpointing; rollback recovery; subnet primitive.