Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Behavior-based clustering and analysis of interestingness measures for association rule mining
Tew C., Giraud-Carrier C., Tanner K., Burton S. Data Mining and Knowledge Discovery28 (4):1004-1045,2014.Type:Article
Date Reviewed: May 26 2015

When onions and tomatoes are in a basket, it is likely that lettuce will be found also. Data mining can discover a large number of association rules like this one. Through research in association rule mining, a variety of measures have been defined to determine how interesting a pattern is so that only strong patterns with high degrees of interestingness are identified.

This paper analyzes 61 known measures and uses 110 different datasets on each to provide a categorization. The number of attributes in these datasets (coming from the University of California at Irvine, the gene expression medical data repositories, and from multi-class protein folding) varies between four and 1559, and the attribute-value pairs between eight and 3121.

The contribution of this research is in the juxtaposition of theoretical definitions with empirical behaviors of the measures. Several past discrepancies are found, and equivalences between some existing measures have been determined, reducing their total number to 21. The consequences of this work are significant. Instead of using computationally complex measures, similar alternatives can be chosen. But eventually it is the knowledge domain that remains the single deal breaker in choosing an approach, thus making the research on these association rules relevant mostly to theoreticians.

The paper is well written, with a comprehensive study of other works, sound theory, and support from visual presentations in discussing the findings. It is these visual presentations that are easily remembered and can be used as quick references whenever rule mining alternatives might be considered.

Reviewer:  Goran Trajkovski Review #: CR143463 (1508-0719)
Bookmark and Share
  Reviewer Selected
Featured Reviewer
 
 
Clustering (H.3.3 ... )
 
 
Data Mining (H.2.8 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Clustering": Date
Concepts and effectiveness of the cover-coefficient-based clustering methodology for text databases
Can F. (ed), Ozkarahan E. ACM Transactions on Database Systems 15(3): 483-517, 1990. Type: Article
Dec 1 1992
A parallel algorithm for record clustering
Omiecinski E., Scheuermann P. ACM Transactions on Database Systems 15(3): 599-624, 1990. Type: Article
Nov 1 1992
Organization of clustered files for consecutive retrieval
Deogun J., Raghavan V., Tsou T. ACM Transactions on Database Systems 9(4): 646-671, 1984. Type: Article
Jun 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy