Document Type

Technical Report

Publication Date

1993

Filename

WUCS-93-37.pdf

DOI:

10.7936/K7513WKN

Technical Report Number

WUCS-93-37

Abstract

This paper presents the analysis of an improved distributed checkpointing algorithm. It shows that the message volume of Koo and Toueg's distributed checkpointing algorithm approaches 3fN for large checkpoint intervals where N is the number of processes and processes randomly send messages to f other processes. Thus, the average mesage volume is O(n2). We show how Koo and Toueg's algorithm can be modified so as to avoid this O9n2) overhead and derive an accurate estimate of the message volume. The overhead is reduced by using dependency knowledge to substantially reduce the average message volume.

Comments

Permanent URL: http://dx.doi.org/10.7936/K7513WKN

Share

COinS