Empirical comparison of algorithms for network community detection

Top Cited Papers

26 April 2010

conference paper
conference paper
Published by Association for Computing Machinery (ACM)

p. 631-640
https://doi.org/10.1145/1772690.1772755

Abstract

Detecting clusters or communities in large real-world graphs such as large social or information networks is a problem of considerable interest. In practice, one typically chooses an objective f unction that captures the intuition of a network cluster as set of nod es with better internal connectivity than external connectivity, and then one applies approximation algorithms or heuristics to extract sets of nodes that are related to the objective function and that "lo ok like" good communities for the application of interest. In this paper, we explore a range of network community detec- tion methods in order to compare them and to understand their rela- tive performance and the systematic biases in the clusters t hey iden- tify. We evaluate several common objective functions that are used to formalize the notion of a network community, and we examine several different classes of approximation algorithms tha t aim to optimize such objective functions. In addition, rather tha n simply fixing an objective and asking for an approximation to the bes t clus- ter of any size, we consider a size-resolved version of the optimiza- tion problem. Considering community quality as a function of its size provides a much finer lens with which to examine communit y detection algorithms, since objective functions and approximation algorithms often have non-obvious size-dependent behavior.

Keywords

This publication has 31 references indexed in Scilit:

Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters
Internet Mathematics, 2009
Resolution limit in community detection
Proceedings of the National Academy of Sciences, 2007
Modularity and community structure in networks
Proceedings of the National Academy of Sciences of the United States of America, 2006
Finding local community structure in networks
Physical Review E, 2005
Finding community structure in very large networks
Physical Review E, 2004
Modularity from fluctuations in random graphs and complex networks
Physical Review E, 2004
On clusterings
Journal of the ACM, 2004
A nonlinear programming algorithm for solving semidefinite programs via low-rank factorization
Mathematical Programming, 2003
Normalized cuts and image segmentation
Ieee Transactions On Pattern Analysis and Machine Intelligence, 2000
Multicommodity max-flow min-cut theorems and their use in designing approximation algorithms
Journal of the ACM, 1999

Cited by 601 articles