Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
MGR: an information theory based hierarchical divisive clustering algorithm for categorical data
Qin H., Ma X., Herawan T., Zain J. Knowledge-Based Systems67 401-411,2014.Type:Article
Date Reviewed: Nov 18 2014

Most of the clustering literature deals with numeric data. This paper exposes a novel algorithm for clustering categorical data by following an “old school” top-down procedure. The main idea is very similar to clustering trees [1] with the following difference: the splitting criterion is based on the average information gain of the attributes, named mean gain ratio (MGR) here.

The contribution looks marginal even though the experiments show the superiority of the proposal on nine University of California at Irvine (UCI) benchmarks and artificial datasets in comparison to three previous approaches. Most of the references date back to ten years ago, and modern data mining issues seem to be out of the scope of MGR (for example, numerous attributes, heterogeneous and linked data). Experiments on a real, recent case study would have been a plus for convincing the reader of the relevance of this nth clustering algorithm.

Reviewer:  Julien Velcin Review #: CR142944 (1502-0172)
1) De Raedt, L.; Blockeel, H. Using logical decision trees for clustering. In Proc. of the 7th International Workshop on Inductive Logic Springer, 1997, 133–140.
Bookmark and Share
 
Clustering (H.3.3 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Clustering": Date
Concepts and effectiveness of the cover-coefficient-based clustering methodology for text databases
Can F. (ed), Ozkarahan E. ACM Transactions on Database Systems 15(3): 483-517, 1990. Type: Article
Dec 1 1992
A parallel algorithm for record clustering
Omiecinski E., Scheuermann P. ACM Transactions on Database Systems 15(3): 599-624, 1990. Type: Article
Nov 1 1992
Organization of clustered files for consecutive retrieval
Deogun J., Raghavan V., Tsou T. ACM Transactions on Database Systems 9(4): 646-671, 1984. Type: Article
Jun 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy