Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
A Design Approach for Ultrareliable Real-Time Systems
Lala J., Harper R., Alger L. Computer24 (5):12-22,1991.Type:Article
Date Reviewed: Mar 1 1992

Ultrareliability has become important as designers have begun to incorporate digital computers into systems in which computer failures may involve loss of human life or loss of a critical mission. Examples include fly-by-wire aircraft, dynamically unstable aircraft, nuclear power plant controls, unmanned space vehicles, and autonomous undersea vehicles. Required failure probabilities are in the range of 10-4 to 10-10, depending on the nature of the mission and the likelihood that computer failure could result in loss of life. The authors describe their methodology for designing ultrareliable real-time computing systems and then describe an application of these techniques to the design of the Advanced Information Processing System (AIPS).

A dominant theme of the paper is that the system design must enable failure rates to be determined using analytical models, simulations, and proofs. The reason is that the required failure probabilities are too low to use life testing to determine the failure rate. Subjects covered include the tradeoffs between fault avoidance and fault tolerance, partitioning a design into fault containment regions, voting techniques for masking errors, and the use of fault-tolerant clocking in the presence of Byzantine faults. The authors compare approximate consensus and exact consensus in detecting erroneous outputs, concluding that only exact consensus is amenable to formal methods and analytical verification. This paper is a clear, concise description of the redundancy techniques used in ultrareliable real-time systems.

Reviewer:  Martin W. Sachs Review #: CR115427
Bookmark and Share
 
Reliability, Availability, And Serviceability (C.4 ... )
 
 
Real Time (J.7 ... )
 
 
Real-Time And Embedded Systems (C.3 ... )
 
 
Redundant Design (B.4.5 ... )
 
 
Redundant Design (B.1.3 ... )
 
 
Control Structure Reliability, Testing, And Fault-Tolerance (B.1.3 )
 
  more  
Would you recommend this review?
yes
no
Other reviews under "Reliability, Availability, And Serviceability": Date
Implementing fault-tolerant services using the state machine approach: a tutorial
Schneider F. ACM Computing Surveys 22(4): 299-319, 2001. Type: Article
Jul 1 1992
Network reliability and algebraic structures
Shier D., Clarendon Press, New York, NY, 1991. Type: Book (9780198533863)
Sep 1 1992
On building systems that will fail
Corbató F. Communications of the ACM 34(9): 72-81, 1991. Type: Article
Sep 1 1992
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy