Computing Reviews, the leading online review service for computing literature.

Search

On truth discovery in social sensing: a maximum likelihood estimation approach
Wang D., Kaplan L., Le H., Abdelzaher T. IPSN 2012 (Proceedings of the 11th International Conference on Information Processing in Sensor Networks, Beijing, China, Apr 16-20, 2012)233-244.2012.Type:Proceedings

Date Reviewed: Jun 1 2012

Truth discovery in noisy social sensing data represents a challenging task. This paper presents a maximum likelihood estimation algorithm that computes the probability that a given measurement is true based on sensory data collected through human tasks. The authors describe numerous application scenarios, such as geotagging campaigns for which people use sensors and collect data for mutual interest. The central focus of the paper is on associating true or false values with observations given only the measurements that are sent, without having any a priori knowledge about the sources sending the sensor data. The proposed expectation maximization (EM) algorithm finds the maximum likelihood estimation of parameters using an incomplete data statistical model. The EM algorithm uses an observation matrix of social sensing data as input, and yields the maximum likelihood of each participant’s reliability, together with the variable correctness (true or false). For the evaluation, the authors compare the above approach with Bayesian interpretation and three other fact-finder schemes, varying the number of participants and the number of observations per participant, respectively. The estimation accuracy is highest for the EM algorithm compared with all other approaches. Furthermore, the geotagging case study in which participants visiting a park were asked to geotag and report the location of litter, possibly misinterpreting litter and location, provides highly relevant evidence that EM finds litter patterns with the greatest accuracy. Another example considers ten events during Hurricane Irene reported by the media via Twitter. Here, the results demonstrate the value of the EM algorithm by reporting all ten events correctly from a large volume of noisy data (600,000 analyzed tweets). The main contribution of this paper is its proof of the accuracy of the EM algorithm for identifying reliable information from social sensing data.

Reviewer: George Popescu	Review #: CR140220 (1210-1061)

Miscellaneous (H.4.m )

Would you recommend this review?

yes

Other reviews under "Miscellaneous":	Date

Privacy through pseudonymity in user-adaptive systems Kobsa A., Schreck J. ACM Transactions on Internet Technology 3(2): 149-183, 2003. Type: Article	Jun 12 2003

A coding scheme as a basis for the production of customized abstracts Craven T. Journal of Information Science 13(1): 51-58, 1987. Type: Article	Mar 1 1988

Charting the unknown: how computer mapping at Harvard became GIS Chrisman N., ESRI Press, 2006. 280, Type: Book (9781589481183)	Oct 18 2006

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy