Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Maximizing benefits from crowdsourced data
Barbier G., Zafarani R., Gao H., Fung G., Liu H. Computational & Mathematical Organization Theory18 (3):257-279,2012.Type:Article
Date Reviewed: Feb 28 2013

Crowdsourcing involves the production of crowd-generated data. These might be data collected during crises, such as a large set of location-based tweets during a firestorm or severe flooding, or numerous SMS reports to a common system within a war zone. They also include marketplace tasks, such as those easily accessed on Amazon’s Mechanical Turk (MTurk) (http://mturk.com). Human intelligence is necessary to clean, label, or tabulate data in preparation for effective data mining. This involves human intelligence tasks (HITs), or as displayed on the MTurk logo, “artificial artificial intelligence.”

The authors provide a compelling and informative description of the processes involved in the collection and preparation of crowdsourced data for effective deployment. I like the focus on developing response systems to emergency or crisis situations. They also discuss the process for data mining crowd sentiment, for example, for commercial applications such as original t-shirt design (http://www.threadless.com/).

I was especially interested in the four sections on preparing data for mining, mining data from crowdsourcing, crowdsourced data for response coordination, and when not to crowdsource. These are central to the paper and clearly present the processes for using the data effectively to potentially save lives and respond with improved accuracy in disaster scenarios.

This paper would be of interest to those working on data projects or systems development where accurate categorization, tabulation, and noise reduction are required. It would also be useful to help determine if crowdsourcing is appropriate for a particular project. As the authors note, “The wisdom of the crowd allows for more accurate categorization than any other machine learning algorithm.”

Reviewer:  Alyx Macfadyen Review #: CR140967 (1306-0537)
Bookmark and Share
  Reviewer Selected
Featured Reviewer
 
 
Data Mining (H.2.8 ... )
 
 
Social Networking (H.3.4 ... )
 
 
Learning (I.2.6 )
 
Would you recommend this review?
yes
no
Other reviews under "Data Mining": Date
Feature selection and effective classifiers
Deogun J. (ed), Choubey S., Raghavan V. (ed), Sever H. (ed) Journal of the American Society for Information Science 49(5): 423-434, 1998. Type: Article
May 1 1999
Rule induction with extension matrices
Wu X. (ed) Journal of the American Society for Information Science 49(5): 435-454, 1998. Type: Article
Jul 1 1998
Predictive data mining
Weiss S., Indurkhya N., Morgan Kaufmann Publishers Inc., San Francisco, CA, 1998. Type: Book (9781558604032)
Feb 1 1999
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy