Adaptive Local Clustering over Attributed Graphs

Haoran Zheng, Renchi Yang*, Jianliang Xu

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Given a graph G and a seed node vs, the objective of local graph clustering (LGC) is to identify a subgraph Cs∈G (a.k.a. local cluster) surrounding vs in time roughly linear with the size of Cs. This approach yields personalized clusters without needing to access the entire graph, which makes it highly suitable for numerous applications involving large graphs. However, most existing solutions merely rely on the topological connectivity between nodes in G, rendering them vulnerable to missing or noisy links that are commonly present in real-world graphs.

To address this issue, this paper resorts to leveraging the complementary nature of graph topology and node attributes to enhance local clustering quality. To effectively exploit the attribute information, we first formulate the LGC as an estimation of the bidirectional diffusion distribution (BDD), which is specialized for capturing the multi-hop affinity between nodes in the presence of attributes. Furthermore, we propose LACA, an efficient and effective approach for LGC that achieves superb empirical performance on multiple real datasets while maintaining strong locality. The core components of LACA include (i) a fast and theoretically-grounded preprocessing technique for node attributes, (ii) an adaptive algorithm for diffusing any vectors over G with rigorous theoretical guarantees and expedited convergence, and (iii) an effective three-step scheme for BDD approximation. Extensive experiments, comparing 17 competitors on 8 real datasets, show that LACA outperforms all competitors in terms of result quality measured against ground truth local clusters, while also being up to orders of magnitude faster.
Original languageEnglish
Title of host publicationProceedings of the 41st IEEE International Conference on Data Engineering, ICDE 2025
Place of PublicationHong Kong
PublisherIEEE
Number of pages21
Publication statusPublished - 21 May 2025
Event41st IEEE International Conference on Data Engineering, ICDE 2025 - The Hong Kong Polytechnic University, Hong Kong, China
Duration: 19 May 202523 May 2025
https://ieee-icde.org/2025/
https://ieee-icde.org/2025/research-papers/

Publication series

NameInternational Conference on Data Engineering

Conference

Conference41st IEEE International Conference on Data Engineering, ICDE 2025
Abbreviated titleICDE 2025
Country/TerritoryChina
CityHong Kong
Period19/05/2523/05/25
Internet address

User-Defined Keywords

  • local cluster
  • attributes
  • graph diffusion
  • adaptive algorithm

Fingerprint

Dive into the research topics of 'Adaptive Local Clustering over Attributed Graphs'. Together they form a unique fingerprint.

Cite this