Computing Reviews, the leading online review service for computing literature.

Search

Generalization-based privacy preservation and discrimination prevention in data publishing and mining
Hajian S., Domingo-Ferrer J., Farràs O. Data Mining and Knowledge Discovery28 (5-6):1158-1188,2014.Type:Article

Date Reviewed: Jan 20 2015

Publishing data for secondary analysis benefits applications ranging from medical research to policy making; however, it also incurs the risks of invasion of individual privacy and creation of discrimination. This paper develops a data sanitization framework to offer both privacy preservation and discrimination prevention in data publishing. The authors use k-anonymity to define privacy and the concept of &agr;-protectiveness to define discrimination, which intuitively measures the difference of sensitive outcomes (for example, benefits) between protected and unprotected social groups. Based on these definitions, the authors propose enhancing Incognito (a full-domain generalization framework) to support both k-anonymity and alpha-protection. An evaluation using both general and specific data analysis metrics is presented. In conclusion, the authors have developed a new data sanitization method that preserves privacy and prevents discrimination in data publishing. This work is among the first few attempting to address two problems simultaneously. However, a set of key questions remains unanswered. For example, how may privacy preservation and discrimination prevention interfere with each other? It is critical to understand their interplay and how that may affect the utility of anonymized data. The proposed method only considers the very basic privacy definition, k-anonymity, which is known to offer only very limited privacy protection. How can more advanced definitions such as l-diversity, t-closeness, and differential privacy be included?

Reviewer: Ting Wang	Review #: CR143097 (1505-0414)

Data Mining (H.2.8 ... )

Privacy (K.4.1 ... )

Security, Integrity, And Protection (H.2.7 ... )

Would you recommend this review?

yes

Other reviews under "Data Mining":	Date

Feature selection and effective classifiers Deogun J. (ed), Choubey S., Raghavan V. (ed), Sever H. (ed) Journal of the American Society for Information Science 49(5): 423-434, 1998. Type: Article	May 1 1999

Rule induction with extension matrices Wu X. (ed) Journal of the American Society for Information Science 49(5): 435-454, 1998. Type: Article	Jul 1 1998

Predictive data mining Weiss S., Indurkhya N., Morgan Kaufmann Publishers Inc., San Francisco, CA, 1998. Type: Book (9781558604032)	Feb 1 1999

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy