Computing Reviews, the leading online review service for computing literature.

Search

Maximizing benefits from crowdsourced data
Barbier G., Zafarani R., Gao H., Fung G., Liu H. Computational & Mathematical Organization Theory18 (3):257-279,2012.Type:Article

Date Reviewed: Feb 28 2013

Crowdsourcing involves the production of crowd-generated data. These might be data collected during crises, such as a large set of location-based tweets during a firestorm or severe flooding, or numerous SMS reports to a common system within a war zone. They also include marketplace tasks, such as those easily accessed on Amazon’s Mechanical Turk (MTurk) (http://mturk.com). Human intelligence is necessary to clean, label, or tabulate data in preparation for effective data mining. This involves human intelligence tasks (HITs), or as displayed on the MTurk logo, “artificial artificial intelligence.” The authors provide a compelling and informative description of the processes involved in the collection and preparation of crowdsourced data for effective deployment. I like the focus on developing response systems to emergency or crisis situations. They also discuss the process for data mining crowd sentiment, for example, for commercial applications such as original t-shirt design (http://www.threadless.com/). I was especially interested in the four sections on preparing data for mining, mining data from crowdsourcing, crowdsourced data for response coordination, and when not to crowdsource. These are central to the paper and clearly present the processes for using the data effectively to potentially save lives and respond with improved accuracy in disaster scenarios. This paper would be of interest to those working on data projects or systems development where accurate categorization, tabulation, and noise reduction are required. It would also be useful to help determine if crowdsourcing is appropriate for a particular project. As the authors note, “The wisdom of the crowd allows for more accurate categorization than any other machine learning algorithm.”

Reviewer: Alyx Macfadyen	Review #: CR140967 (1306-0537)

Data Mining (H.2.8 ... )

Social Networking (H.3.4 ... )

Learning (I.2.6 )

Would you recommend this review?

yes

Other reviews under "Data Mining":	Date

Feature selection and effective classifiers Deogun J. (ed), Choubey S., Raghavan V. (ed), Sever H. (ed) Journal of the American Society for Information Science 49(5): 423-434, 1998. Type: Article	May 1 1999

Rule induction with extension matrices Wu X. (ed) Journal of the American Society for Information Science 49(5): 435-454, 1998. Type: Article	Jul 1 1998

Predictive data mining Weiss S., Indurkhya N., Morgan Kaufmann Publishers Inc., San Francisco, CA, 1998. Type: Book (9781558604032)	Feb 1 1999

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy