Computing Reviews, the leading online review service for computing literature.

Search

MGR: an information theory based hierarchical divisive clustering algorithm for categorical data
Qin H., Ma X., Herawan T., Zain J. Knowledge-Based Systems67 401-411,2014.Type:Article

Date Reviewed: Nov 18 2014

Most of the clustering literature deals with numeric data. This paper exposes a novel algorithm for clustering categorical data by following an “old school” top-down procedure. The main idea is very similar to clustering trees [1] with the following difference: the splitting criterion is based on the average information gain of the attributes, named mean gain ratio (MGR) here. The contribution looks marginal even though the experiments show the superiority of the proposal on nine University of California at Irvine (UCI) benchmarks and artificial datasets in comparison to three previous approaches. Most of the references date back to ten years ago, and modern data mining issues seem to be out of the scope of MGR (for example, numerous attributes, heterogeneous and linked data). Experiments on a real, recent case study would have been a plus for convincing the reader of the relevance of this nth clustering algorithm.

Reviewer: Julien Velcin	Review #: CR142944 (1502-0172)

1)	De Raedt, L.; Blockeel, H. Using logical decision trees for clustering. In Proc. of the 7th International Workshop on Inductive Logic Springer, 1997, 133–140.

Clustering (H.3.3 ... )

Would you recommend this review?

yes

Other reviews under "Clustering":	Date

Concepts and effectiveness of the cover-coefficient-based clustering methodology for text databases Can F. (ed), Ozkarahan E. ACM Transactions on Database Systems 15(3): 483-517, 1990. Type: Article	Dec 1 1992

A parallel algorithm for record clustering Omiecinski E., Scheuermann P. ACM Transactions on Database Systems 15(3): 599-624, 1990. Type: Article	Nov 1 1992

Organization of clustered files for consecutive retrieval Deogun J., Raghavan V., Tsou T. ACM Transactions on Database Systems 9(4): 646-671, 1984. Type: Article	Jun 1 1985

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy