Finding and testing network communities by lumped Markov chains

Carlo Piccardi

doi:10.1371/journal.pone.0027028

Finding and testing network communities by lumped Markov chains

PLoS One. 2011;6(11):e27028. doi: 10.1371/journal.pone.0027028. Epub 2011 Nov 3.

Author

Carlo Piccardi¹

Affiliation

¹ Department of Electronics and Information, Politecnico di Milano, Milano, Italy. carlo.piccardi@polimi.it

Abstract

Identifying communities (or clusters), namely groups of nodes with comparatively strong internal connectivity, is a fundamental task for deeply understanding the structure and function of a network. Yet, there is a lack of formal criteria for defining communities and for testing their significance. We propose a sharp definition that is based on a quality threshold. By means of a lumped Markov chain model of a random walker, a quality measure called "persistence probability" is associated to a cluster, which is then defined as an "α-community" if such a probability is not smaller than α. Consistently, a partition composed of α-communities is an "α-partition." These definitions turn out to be very effective for finding and testing communities. If a set of candidate partitions is available, setting the desired α-level allows one to immediately select the α-partition with the finest decomposition. Simultaneously, the persistence probabilities quantify the quality of each single community. Given its ability in individually assessing each single cluster, this approach can also disclose single well-defined communities even in networks that overall do not possess a definite clusterized structure.

MeSH terms

Cluster Analysis
Markov Chains*
Probability