Computing Reviews, the leading online review service for computing literature.

Search

Multivalued type proximity measure and concept of mutual similarity value useful for clustering symbolic patterns
Guru D., Kiranagi B., Nagabhushan P. Pattern Recognition Letters25 (10):1203-1213,2004.Type:Article

Date Reviewed: Jan 19 2005

Guru, Kiranagi, and Nagabhushan introduce a nonsymmetric similarity measure for objects (patterns) that are defined by interval type data features. In the computations, each feature is taken as a separate similarity dimension. The overall similarity between two objects (in the paper it is referred to as “mutual similarity value”) is obtained by adding their similarities to each other (two vectors) using the triangular law of addition. The object-by-object similarity matrix is used for the implementation of a single-link type agglomerative clustering algorithm. The authors assess their approach using three different data sets, and show that it is effective. They also compare their results with that of five other methods that use various similarity measures. Based on these experimental observations, the authors state that their approach is superior, due to reasons such as computational efficiency and no need for a user provided input parameter. However, note that, with small data sets--for example eight objects and five data futures of the “fats and oils data” used in the experiments--and with the computational power of today’s computers, the efficiency cannot be a concern. Scalability of the effectiveness in large data sets needs further investigation. Another nonsymmetric similarity measure, the cover coefficient concept [1], is introduced for clustering textual databases. It employs the document word frequencies as the data features. Although the data feature types are different, both similarity measures are comparable (but not identical): with some imagination, computations of the measures can conceptually be mapped to each other.

Reviewer: F. Can	Review #: CR130690 (0508-0956)

1)	Can, F.; Ozkarahan, E.A. Concepts and effectiveness of the cover coefficient-based clustering methodology for text databases. ACM Trans. Database Syst. 15, 4(1990), 483–517.

Similarity Measures (I.5.3 ... )

Algorithms (I.5.3 ... )

Statistical (I.5.1 ... )

Would you recommend this review?

yes

Other reviews under "Similarity Measures":	Date

Internet search based on text intuitionistic fuzzy similarity Szczepaniak P., Niewiadomski A. In Intelligent exploration of the web. Heidelberg, Germany: Physica-Verlag GmbH, 2003. Type: Book Chapter	Feb 13 2004

Similarity measures of intuitionistic fuzzy sets based on Hausdorff distance Hung W., Yang M. Pattern Recognition Letters 25(14): 1603-1611, 2004. Type: Article	Jul 12 2005

In defense of one-vs-all classification Rifkin R., Klautau A. The Journal of Machine Learning Research 5101-141, 2004. Type: Article	Jul 19 2005

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy