Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Methods for exploratory cluster analysis
Kaski S., Nikkilä J., Kohonen T. In Intelligent exploration of the web. Heidelberg, Germany,  Physica-Verlag GmbH,  2003. Type:Book Chapter
Date Reviewed: Nov 14 2003

This chapter reports on a study that used self-organizing maps to detect clusters and the dominating variables in these clusters in the high-dimensional data set being analyzed. This approach can be used in preliminary data mining, when the hypothesis is being formulated and there is no prior knowledge about the data.

Analyzing high-dimensional data sets is a nontrivial task. A wide variety of methods, for example glyphs, scatter plots, parallel coordinate plots, and VERI [1], have been suggested. This chapter uses a graphical map display in which the proximity of the location represents the similarity of the data points. This is done using a self-organizing map (SOM) algorithm. The graphical display is a grid of map units. Each unit has an associated model vector, defining the points of the data set associated with the unit under consideration, and with a dimensionality that is the same as that of the data set. The model vector is computed in an iterative process that is either batch or stochastic.

Kaski, Nikkilä, and Kohonen developed a SOM system for textual databases, called WEBSOM, and it was able to detect a new category in the patent data. A new method is suggested for detecting and visualizing gaps between clusters. By means of a contribution profile, the contribution of the set of variables in a cluster can be evaluated. The areas where the contribution of all the variables is similar can be grouped together while characterizing the cluster.

The chapter is well written, with appropriate examples and figures. The only drawback is that the introduction does not mention what the later sections cover.

Reviewer:  Pragyansmita Nayak Review #: CR128571 (0403-0328)
1) Osbourne, G. Visual Empirical Region of Influence (VERI), http://www.sandia.gov/imrl/XVisionScience/Xartvis.htm, Accessed 11/6/03.
Bookmark and Share
  Featured Reviewer  
 
Clustering (H.3.3 ... )
 
 
Indexing Methods (H.3.1 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Clustering": Date
Concepts and effectiveness of the cover-coefficient-based clustering methodology for text databases
Can F. (ed), Ozkarahan E. ACM Transactions on Database Systems 15(3): 483-517, 1990. Type: Article
Dec 1 1992
A parallel algorithm for record clustering
Omiecinski E., Scheuermann P. ACM Transactions on Database Systems 15(3): 599-624, 1990. Type: Article
Nov 1 1992
Organization of clustered files for consecutive retrieval
Deogun J., Raghavan V., Tsou T. ACM Transactions on Database Systems 9(4): 646-671, 1984. Type: Article
Jun 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy