Computing Reviews, the leading online review service for computing literature.

Search

On soft error reliability of virtualization infrastructure
Xu X., Huang H. IEEE Transactions on Computers65 (12):3727-3739,2016.Type:Article

Date Reviewed: May 5 2017

Server rooms are often populated with powerful hardware running hypervisors, in turn hosting many guest virtual machines. Here, a soft hardware error may propagate in surprising ways affecting the hypervisor itself, or one or more guests. In contrast, if a single operating system (OS) runs directly on hardware, a soft error affects only that single machine. This paper investigates experimentally soft error propagation in a virtualized server room. A soft error happens when central processing unit (CPU) registers are afflicted by random bit flipping from possible--but unwanted and unplanned--survivable error sources. A soft error is different from, for instance, a hard failure caused by overheating. This work is divided into two major parts: an extensive study of the propagation of soft errors, and a shorter discussion of options for fault tolerance. The propagation study injects faults using a simulation environment, and draws measurements and observations from instruction traces, fault locations, and crash analysis. The discussion considers existing fault tolerance techniques in light of those experimental results. This work is a valuable contribution toward understanding virtualization behavior with regard to soft errors. However, it does not examine the real causes of random bit flipping; instead, it assumes occurrences as a given, emulating them by deterministic fault injection. Failing to ascertain how likely a failure mode may be does not compromise the study of propagation, but can be a shortcoming in the assessment of fault tolerance.

Reviewer: A. Squassabia	Review #: CR145243 (1707-0464)

Cloud Computing (C.2.4 ... )

Fault Tolerance (C.4 ... )

Software Development (K.6.3 ... )

Would you recommend this review?

yes

Other reviews under "Cloud Computing":	Date

Cloud security and privacy: an enterprise perspective on risks and compliance Mather T., Kumaraswamy S., Latif S., O’Reilly Media, Inc., Sebastopol, CA, 2009. 336, Type: Book (9780596802769), Reviews: (1 of 3)	Dec 14 2009

Cloud security and privacy: an enterprise perspective on risks and compliance Mather T., Kumaraswamy S., Latif S., O’Reilly Media, Inc., Sebastopol, CA, 2009. 336, Type: Book (9780596802769), Reviews: (2 of 3)	Jan 26 2010

Cloud security and privacy: an enterprise perspective on risks and compliance Mather T., Kumaraswamy S., Latif S., O’Reilly Media, Inc., Sebastopol, CA, 2009. 336, Type: Book (9780596802769), Reviews: (3 of 3)	Mar 18 2010

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy