On the spectral characterization and scalable mining of network communities

Yang Bo*, Jiming Liu, Jianfeng Feng

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

51 Citations (Scopus)

Abstract

Network communities refer to groups of vertices within which their connecting links are dense but between which they are sparse. A network community mining problem (or NCMP for short) is concerned with the problem of finding all such communities from a given network. A wide variety of applications can be formulated as NCMPs, ranging from social and/or biological network analysis to web mining and searching. So far, many algorithms addressing NCMPs have been developed and most of them fall into the categories of either optimization based or heuristic methods. Distinct from the existing studies, the work presented in this paper explores the notion of network communities and their properties based on the dynamics of a stochastic model naturally introduced. In the paper, a relationship between the hierarchical community structure of a network and the local mixing properties of such a stochastic model has been established with the large-deviation theory. Topological information regarding to the community structures hidden in networks can be inferred from their spectral signatures. Based on the above-mentioned relationship, this work proposes a general framework for characterizing, analyzing, and mining network communities. Utilizing the two basic properties of metastability, i.e., being locally uniform and temporarily fixed, an efficient implementation of the framework, called the LM algorithm, has been developed that can scalably mine communities hidden in large-scale networks. The effectiveness and efficiency of the LM algorithm have been theoretically analyzed as well as experimentally validated.

Original languageEnglish
Article number5645622
Pages (from-to)326-337
Number of pages12
JournalIEEE Transactions on Knowledge and Data Engineering
Volume24
Issue number2
DOIs
Publication statusPublished - Feb 2012

Scopus Subject Areas

  • Information Systems
  • Computer Science Applications
  • Computational Theory and Mathematics

User-Defined Keywords

  • community structure
  • large-deviation theory
  • local mixing
  • Markov chain
  • Social network

Fingerprint

Dive into the research topics of 'On the spectral characterization and scalable mining of network communities'. Together they form a unique fingerprint.

Cite this