Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
The Totem single-ring ordering and membership protocol
Amir Y., Moser L., Melliar-Smith P., Agarwal D., Ciarfella P. ACM Transactions on Computer Systems13 (4):311-342,1995.Type:Article
Date Reviewed: Feb 1 1997

Totem is a single-ring protocol for high-performance, fault-tolerant distributed systems that must continue to operate despite network partitioning and re-merging and despite processor failure and restart. Totem provides totally ordered message delivery with good performance using a logical token-passing ring imposed on a broadcast domain.

After an introductory section, the authors present related work and highlight the differences of the Totem protocol. Significant literature on the subject is analyzed. Section 3 is dedicated to the distributed system model used in the Totem protocol design. Several terms related to protocol functioning are defined.

The objective of Totem is to provide the application with reliable message delivery and membership services. These services are described in section 4 of the paper. Section 5 is devoted to the total ordering protocol with the assumptions that the token is never lost; processor failures do not occur; and the network does not become partitioned. In section 6, the conditions are relaxed, and the protocol to handle token loss, processor failure and restart, and network partitioning and re-merging is presented. The protocol is described using a finite-state machine model. Data structures used, as well as pseudocode for the work performed by processors during different states of the model, are also given.

Sections 7 and 8 present the recovery protocol that maintains extended virtual synchrony during recovery after a failure, and the flow control mechanism that avoids message loss due to buffer overflow. Section 9 addresses implementation and performance. Future work is mentioned at the end of the paper.

The paper is well structured, but the presentation is not uniform, some aspects being described in great detail, while others are quickly summarized. The important works on the subject are included as references.

Reviewer:  V. Cristea Review #: CR124511 (9702-0114)
Bookmark and Share
 
Protocol Architecture (C.2.2 ... )
 
 
Distributed Systems (D.4.7 ... )
 
 
Fault-Tolerance (D.4.5 ... )
 
 
Network Communication (D.4.4 ... )
 
 
Network Communications (C.2.1 ... )
 
 
Network Operating Systems (C.2.4 ... )
 
  more  
Would you recommend this review?
yes
no
Other reviews under "Protocol Architecture": Date
Efficient at-most-once messages based on synchronized clocks
Liskov B., Shrira L., Wroclawski J. ACM Transactions on Computer Systems 9(2): 125-142, 1991. Type: Article
May 1 1992
Communications for cooperating systems
Cypser R., Addison-Wesley Longman Publishing Co., Inc., Boston, MA, 1991. Type: Book (9780201507751)
Oct 1 1992
Data communications: the implications of communication systems for protocol design
Goldstein B., Jaffe J. IBM Systems Journal 26(1): 122-136, 1987. Type: Article
Feb 1 1988
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy