Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Crowdsourced data management: a survey
Li G., Wang J., Zheng Y., Franklin M. IEEE Transactions on Knowledge and Data Engineering28 (9):2296-2319,2016.Type:Article
Date Reviewed: Apr 18 2017

Many data management and analytics tasks, notably entity resolution, sentiment analysis, and image recognition, cannot always be fulfilled through automated software processes alone, but also require the application of human cognition. Human computation capabilities can be harnessed using crowdsourced platforms. This paper “surveys and synthesizes a [broad range] of existing studies on crowdsourced data management” and then “outlines key factors that [should] be considered to improve crowdsourced data management.”

A major focus of the paper is on three key problems in crowdsourced data management, namely quality control, cost control, and latency control. Quality control covers how to prevent low-quality results, “such as eliminating low-quality workers.” Cost control addresses the issue of how to ensure that costs are not more than necessary to complete the crowdsourcing tasks. One way of doing this is using pruning algorithms to eliminate unnecessary tasks. Latency control discusses strategies for meeting established time constraints, such as pricing.

The paper gives considerable attention to crowdsourced operators that have been proposed to improve real-world applications, including filtering, find, and search operators. “Crowdsourcing systems that integrate [crowdsourced] relational database management systems ... to process computer-hard queries” are discussed. Two crowdsourcing platforms, Amazon Mechanical Turk and CrowdFlower, are examined.

The paper is very thorough, clear, and detailed. Those readers who follow crowdsourced data management should find this paper a very valuable reference.

Reviewer:  David G. Hill Review #: CR145203 (1707-0477)
Bookmark and Share
  Featured Reviewer  
 
Data Models (H.2.1 ... )
 
 
Electronic Mail (H.4.3 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Data Models": Date
A transient hypergraph-based model for data access
Watters C., Shepherd M. ACM Transactions on Information Systems 8(2): 77-102, 2001. Type: Article
Jun 1 1991
Toward a unified framework for version modeling in engineering databases
Katz R. ACM Computing Surveys 22(4): 375-409, 2001. Type: Article
Feb 1 1993
Graph data model and its data language
Kunii H., Springer-Verlag New York, Inc., New York, NY, 1990. Type: Book (9780387700588)
Dec 1 1991
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy