Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
On soft error reliability of virtualization infrastructure
Xu X., Huang H. IEEE Transactions on Computers65 (12):3727-3739,2016.Type:Article
Date Reviewed: May 5 2017

Server rooms are often populated with powerful hardware running hypervisors, in turn hosting many guest virtual machines. Here, a soft hardware error may propagate in surprising ways affecting the hypervisor itself, or one or more guests. In contrast, if a single operating system (OS) runs directly on hardware, a soft error affects only that single machine. This paper investigates experimentally soft error propagation in a virtualized server room. A soft error happens when central processing unit (CPU) registers are afflicted by random bit flipping from possible--but unwanted and unplanned--survivable error sources. A soft error is different from, for instance, a hard failure caused by overheating.

This work is divided into two major parts: an extensive study of the propagation of soft errors, and a shorter discussion of options for fault tolerance. The propagation study injects faults using a simulation environment, and draws measurements and observations from instruction traces, fault locations, and crash analysis. The discussion considers existing fault tolerance techniques in light of those experimental results.

This work is a valuable contribution toward understanding virtualization behavior with regard to soft errors. However, it does not examine the real causes of random bit flipping; instead, it assumes occurrences as a given, emulating them by deterministic fault injection. Failing to ascertain how likely a failure mode may be does not compromise the study of propagation, but can be a shortcoming in the assessment of fault tolerance.

Reviewer:  A. Squassabia Review #: CR145243 (1707-0464)
Bookmark and Share
  Featured Reviewer  
 
Cloud Computing (C.2.4 ... )
 
 
Fault Tolerance (C.4 ... )
 
 
Software Development (K.6.3 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Cloud Computing": Date
Cloud security and privacy: an enterprise perspective on risks and compliance
Mather T., Kumaraswamy S., Latif S., O’Reilly Media, Inc., Sebastopol, CA, 2009.  336, Type: Book (9780596802769), Reviews: (1 of 3)
Dec 14 2009
Cloud security and privacy: an enterprise perspective on risks and compliance
Mather T., Kumaraswamy S., Latif S., O’Reilly Media, Inc., Sebastopol, CA, 2009.  336, Type: Book (9780596802769), Reviews: (2 of 3)
Jan 26 2010
Cloud security and privacy: an enterprise perspective on risks and compliance
Mather T., Kumaraswamy S., Latif S., O’Reilly Media, Inc., Sebastopol, CA, 2009.  336, Type: Book (9780596802769), Reviews: (3 of 3)
Mar 18 2010
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy