Computing Reviews

A novel probabilistic clustering model for heterogeneous networks
Deng Z., Xu X. Machine Learning104(1):1-24,2016.Type:Article
Date Reviewed: 08/31/16

Following up works tagged as link mining, in which we fully consider the links between objects in the data mining process [1], this paper addresses the clustering task when performed on heterogeneous networks.

The novelty of the model, called PROCESS (short for probabilistic clustering model for heterogeneous networks), lies in the handling of heterogeneity: in addition to the direct relations between objects (for example, friendship), the authors add the possibility of relating the objects on the basis of shared properties (for example, both objects are “red” or “married”) and taking relations between properties into account. The expectation-maximization (EM) framework is used in order to estimate the parameters with a variant of the message passing algorithm, which is not easy to follow if you are unfamiliar with this kind of optimization procedure. The comparison with other models, including a previous model proposed by the same authors [2] and spectral relational clustering [3], shows that PROCESS is better for cluster quality, measured with the normalized mutual information and the F-measure, with a linear runtime.

The experiments were performed on eight artificial datasets and one real dataset of a bibliographic network extracted from the ACM Digital Library. The latter is not so convincing, for it seems that no relation between the properties (here, terms) is considered. This weakens the demonstration of PROCESS superiority. Besides, neither the dataset nor the implementation of PROCESS is available to the community.


1)

Getoor, L.; Diehl, C. P. Link mining: a survey. ACM SIGKDD Explorations Newsletter 7, 2(2005), 3–12.


2)

Xu, X.; Deng, Z. H. BibClus: a clustering algorithm of bibliographic networks by message passing on center linkage structure. In Proc. of the 11th International Conference on Data Mining. IEEE, 2011, 864–873.


3)

Ng, A.; Jordan, M.; Weiss, Y. Advances in neural information processing systems 14, volume 2. MIT Press, Cambridge, MA, 2002.

Reviewer:  Julien Velcin Review #: CR144719 (1612-0930)

Reproduction in whole or in part without permission is prohibited.   Copyright 2024 ComputingReviews.com™
Terms of Use
| Privacy Policy