Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Generalization-based privacy preservation and discrimination prevention in data publishing and mining
Hajian S., Domingo-Ferrer J., Farràs O. Data Mining and Knowledge Discovery28 (5-6):1158-1188,2014.Type:Article
Date Reviewed: Jan 20 2015

Publishing data for secondary analysis benefits applications ranging from medical research to policy making; however, it also incurs the risks of invasion of individual privacy and creation of discrimination. This paper develops a data sanitization framework to offer both privacy preservation and discrimination prevention in data publishing.

The authors use k-anonymity to define privacy and the concept of &agr;-protectiveness to define discrimination, which intuitively measures the difference of sensitive outcomes (for example, benefits) between protected and unprotected social groups. Based on these definitions, the authors propose enhancing Incognito (a full-domain generalization framework) to support both k-anonymity and alpha-protection. An evaluation using both general and specific data analysis metrics is presented.

In conclusion, the authors have developed a new data sanitization method that preserves privacy and prevents discrimination in data publishing. This work is among the first few attempting to address two problems simultaneously. However, a set of key questions remains unanswered. For example, how may privacy preservation and discrimination prevention interfere with each other? It is critical to understand their interplay and how that may affect the utility of anonymized data. The proposed method only considers the very basic privacy definition, k-anonymity, which is known to offer only very limited privacy protection. How can more advanced definitions such as l-diversity, t-closeness, and differential privacy be included?

Reviewer:  Ting Wang Review #: CR143097 (1505-0414)
Bookmark and Share
 
Data Mining (H.2.8 ... )
 
 
Privacy (K.4.1 ... )
 
 
Security, Integrity, And Protection (H.2.7 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Data Mining": Date
Feature selection and effective classifiers
Deogun J. (ed), Choubey S., Raghavan V. (ed), Sever H. (ed) Journal of the American Society for Information Science 49(5): 423-434, 1998. Type: Article
May 1 1999
Rule induction with extension matrices
Wu X. (ed) Journal of the American Society for Information Science 49(5): 435-454, 1998. Type: Article
Jul 1 1998
Predictive data mining
Weiss S., Indurkhya N., Morgan Kaufmann Publishers Inc., San Francisco, CA, 1998. Type: Book (9781558604032)
Feb 1 1999
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy