Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
A scalable data chunk similarity based compression approach for efficient big sensing data processing on cloud
Yang C., Chen J. IEEE Transactions on Knowledge and Data Engineering29 (6):1144-1157,2017.Type:Article
Date Reviewed: Jul 25 2017

Marrying sensors and innovative industries (for example, think aircraft engines instrumented for real-time preventive maintenance) leads to the creation of a huge time series of data samples. One appealing approach to efficiently managing this flood of data is to take advantage of cloud storage and computing platforms. Even though current infrastructures are rather large, the focus on efficient data handling remains paramount for both performance and cost matters.

The paper suggests a novel way to use lossy data compression to better manage these resources. More specifically, the idea is to cluster different but somewhat correlated data from all managed streams (for example, values from temperature gauges located close to one another or sequences of values from slow-varying sensors), create dedicated dictionaries for those chunks of data samples, and then use them to compress the stream clusters. The authors describe how the phases of data chunk generation and compression present in this new approach can be scaled to cloud-like proportion by leveraging the now-standard MapReduce paradigm. Experiments over terabyte-sized meteorological datasets show that significant cost, space, and time savings can be expected.

The main problem with this paper, as far as I can see, is that it has not been carefully prepared and edited: the approximate English makes the ideas and somewhat formal derivations difficult to understand, and the algorithms are at best glorified extracts of Java code, with too many inaccuracies. I cannot recommend reading this paper; skimming its abstract should be enough to get a feel for the ideas.

Reviewer:  P. Jouvelot Review #: CR145441 (1710-0663)
Bookmark and Share
 
Cloud Computing (C.2.4 ... )
 
 
Data Compaction And Compression (E.4 ... )
 
 
Data Models (H.2.1 ... )
 
 
Sensors (I.2.9 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Cloud Computing": Date
Cloud security and privacy: an enterprise perspective on risks and compliance
Mather T., Kumaraswamy S., Latif S., O’Reilly Media, Inc., Sebastopol, CA, 2009.  336, Type: Book (9780596802769), Reviews: (1 of 3)
Dec 14 2009
Cloud security and privacy: an enterprise perspective on risks and compliance
Mather T., Kumaraswamy S., Latif S., O’Reilly Media, Inc., Sebastopol, CA, 2009.  336, Type: Book (9780596802769), Reviews: (2 of 3)
Jan 26 2010
Cloud security and privacy: an enterprise perspective on risks and compliance
Mather T., Kumaraswamy S., Latif S., O’Reilly Media, Inc., Sebastopol, CA, 2009.  336, Type: Book (9780596802769), Reviews: (3 of 3)
Mar 18 2010
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy