Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Multivalued type proximity measure and concept of mutual similarity value useful for clustering symbolic patterns
Guru D., Kiranagi B., Nagabhushan P. Pattern Recognition Letters25 (10):1203-1213,2004.Type:Article
Date Reviewed: Jan 19 2005

Guru, Kiranagi, and Nagabhushan introduce a nonsymmetric similarity measure for objects (patterns) that are defined by interval type data features. In the computations, each feature is taken as a separate similarity dimension. The overall similarity between two objects (in the paper it is referred to as “mutual similarity value”) is obtained by adding their similarities to each other (two vectors) using the triangular law of addition. The object-by-object similarity matrix is used for the implementation of a single-link type agglomerative clustering algorithm.

The authors assess their approach using three different data sets, and show that it is effective. They also compare their results with that of five other methods that use various similarity measures. Based on these experimental observations, the authors state that their approach is superior, due to reasons such as computational efficiency and no need for a user provided input parameter. However, note that, with small data sets--for example eight objects and five data futures of the “fats and oils data” used in the experiments--and with the computational power of today’s computers, the efficiency cannot be a concern. Scalability of the effectiveness in large data sets needs further investigation.

Another nonsymmetric similarity measure, the cover coefficient concept [1], is introduced for clustering textual databases. It employs the document word frequencies as the data features. Although the data feature types are different, both similarity measures are comparable (but not identical): with some imagination, computations of the measures can conceptually be mapped to each other.

Reviewer:  F. Can Review #: CR130690 (0508-0956)
1) Can, F.; Ozkarahan, E.A. Concepts and effectiveness of the cover coefficient-based clustering methodology for text databases. ACM Trans. Database Syst. 15, 4(1990), 483–517.
Bookmark and Share
 
Similarity Measures (I.5.3 ... )
 
 
Algorithms (I.5.3 ... )
 
 
Statistical (I.5.1 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Similarity Measures": Date
Internet search based on text intuitionistic fuzzy similarity
Szczepaniak P., Niewiadomski A. In Intelligent exploration of the web. Heidelberg, Germany: Physica-Verlag GmbH, 2003. Type: Book Chapter
Feb 13 2004
Similarity measures of intuitionistic fuzzy sets based on Hausdorff distance
Hung W., Yang M. Pattern Recognition Letters 25(14): 1603-1611, 2004. Type: Article
Jul 12 2005
In defense of one-vs-all classification
Rifkin R., Klautau A. The Journal of Machine Learning Research 5101-141, 2004. Type: Article
Jul 19 2005
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy