Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Integrating community matching and outlier detection for mining evolutionary community outliers
Gupta M., Gao J., Sun Y., Koutra D.  KDD 2012 (Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Beijing, China, Aug 12-16, 2012)859-867.2012.Type:Proceedings
Date Reviewed: May 28 2013

Several algorithms are able to identify certain so-called communities, regions of data points with more cohesion than the rest. But what happens if these communities and memberships evolve and both concepts are not necessarily preserved over time?

This easily accessible paper answers the question of how to detect outliers--data points whose behavior is different from the rest of the community they initially belonged to--in evolving datasets. Think of stockbrokers who deviate from investment trends or scientific authors who change their co-authorship networks.

Initial inputs to the proposed detection algorithm are P and Q, two partitions of a (constant) set of objects in a varying number of communities, corresponding to the two points in time to be compared. Obviously, a single comparison of community memberships P and Q in terms of a correspondence matrix S is too naive to discriminate between an outlier of a community and its core members. Therefore, the authors introduce an additional “outlierness” score, A, for a given object with regard to a community of Q. Because outlierness is not a crisp concept, the total outlierness score has to be constrained by a certain threshold to obtain a convergent algorithm.

Besides a rigorous exposition of the algorithm (sufficient to actually implement it), the authors also describe some theoretical properties, such as convergence and running time. Using both synthetic and real datasets (for example, subsets of data from the Internet Movie DataBase and the Digital Bibliography and Library Project), the authors convincingly demonstrate the applicability of their approach.

I definitely recommend this paper to researchers in theoretical or applied computer science with an interest in (statistical) communities and outlier detection.

Reviewer:  Christoph F. Strnadl Review #: CR141247 (1308-0731)
Bookmark and Share
  Featured Reviewer  
 
General (H.1.0 )
 
 
Miscellaneous (H.4.m )
 
Would you recommend this review?
yes
no
Other reviews under "General": Date
On models and modelling in human-computer co-operation
Oberquelle H.  Readings on cognitive ergonomics - mind and computers (, Gmunden, Austria,431984. Type: Proceedings
Oct 1 1985
Goal and plan knowledge representations: from stories to text editors and programs
Black J., Kay D., Soloway E., MIT Press, Cambridge, MA, 1987. Type: Book (9789780262031257)
Dec 1 1988
Enterprise architecture planning
Spewak S., Hill S., QED Information Sciences, Inc., Wellesley, MA, 1993. Type: Book (9780894354366)
Sep 1 1993
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy