Efficient Maximal Motif-Clique Enumeration over Large Heterogeneous Information Networks

Yingli Zhou, Yixiang Fang*, Chenhao Ma, Tianci Hou, Xin Huang

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

Abstract

In the heterogeneous information network (HIN), a motif-clique is a "complete graph" for a given motif (or a small connected graph) that could capture the desired relationship in the motif. The maximal motif-cliques of HINs have found various applications in community discovery, recommendation, and biological network analysis. The state-of-the-art algorithm for enumerating maximal motif-cliques may have to explore all possible subgraphs of a maximal motif-clique and check whether a maximal motif-clique has been enumerated at each recursive step, which is very time-consuming. To improve the efficiency of enumeration, in this paper, we develop efficient algorithms for maximal motif-clique enumeration over large HINs. We first introduce an order-based framework to avoid duplicated enumeration, which results in lower time complexity compared to the existing algorithm. We then propose a pivot-based pruning strategy, which significantly reduces the search space. We further optimize the process of identifying the candidate sets and locating the subgraphs containing the maximal motif-cliques. Extensive experiments on five real-world HINs demonstrate that our proposed algorithm achieves high efficiency and is up to three orders of magnitude faster than the state-of-the-art algorithm.

Original languageEnglish
Pages (from-to)2946-2959
Number of pages14
JournalProceedings of the VLDB Endowment
Volume17
Issue number11
DOIs
Publication statusPublished - 30 Aug 2024
Event50th International Conference on Very Large Data Bases, VLDB 2024 - Guangzhou, China
Duration: 26 Aug 202430 Aug 2024
https://vldb.org/2024/ (Conference website)
https://dl.acm.org/loi/pvldb/group/d2020.y2024 (Conference proceedings)

Scopus Subject Areas

  • Computer Science (miscellaneous)
  • General Computer Science

Fingerprint

Dive into the research topics of 'Efficient Maximal Motif-Clique Enumeration over Large Heterogeneous Information Networks'. Together they form a unique fingerprint.

Cite this