TY - JOUR
T1 - Butterfly-core community search over labeled graphs
AU - Dong, Zheng
AU - Huang, Xin
AU - Yuan, Guorui
AU - Zhu, Hengshu
AU - Xiong, Hui
N1 - Funding Information:
This work was supported by the Natural Science Foundation of China under Grant No. 61836013 and HK RGC Grant No. 22200320.
PY - 2021/7
Y1 - 2021/7
N2 - Community search aims at finding densely connected subgraphs for query vertices in a graph. While this task has been studied widely in the literature, most of the existing works only focus on finding homogeneous communities rather than heterogeneous communities with different labels. In this paper, we motivate a new problem of cross-group community search, namely Butterfly-Core Community (BCC), over a labeled graph, where each vertex has a label indicating its properties and an edge between two vertices indicates their cross relationship. Specifically, for two query vertices with different labels, we aim to find a densely connected cross community that contains two query vertices and consists of butterfly networks, where each wing of the butterflies is induced by a k-core search based on one query vertex and two wings are connected by these butterflies. We first develop a heuristic algorithm achieving 2-approximation to the optimal solution. Furthermore, we design fast techniques of query distance computations, leader pair identifications, and index-based BCC local explorations. Extensive experiments on seven real datasets and four useful case studies validate the effectiveness and efficiency of our BCC and its multi-labeled extension models.
AB - Community search aims at finding densely connected subgraphs for query vertices in a graph. While this task has been studied widely in the literature, most of the existing works only focus on finding homogeneous communities rather than heterogeneous communities with different labels. In this paper, we motivate a new problem of cross-group community search, namely Butterfly-Core Community (BCC), over a labeled graph, where each vertex has a label indicating its properties and an edge between two vertices indicates their cross relationship. Specifically, for two query vertices with different labels, we aim to find a densely connected cross community that contains two query vertices and consists of butterfly networks, where each wing of the butterflies is induced by a k-core search based on one query vertex and two wings are connected by these butterflies. We first develop a heuristic algorithm achieving 2-approximation to the optimal solution. Furthermore, we design fast techniques of query distance computations, leader pair identifications, and index-based BCC local explorations. Extensive experiments on seven real datasets and four useful case studies validate the effectiveness and efficiency of our BCC and its multi-labeled extension models.
UR - http://vldb.org/pvldb/volumes/14/paper/Butterfly-Core%20Community%20Search%20over%20Labeled%20Graphs
UR - http://www.scopus.com/inward/record.url?scp=85119694105&partnerID=8YFLogxK
U2 - 10.14778/3476249.3476258
DO - 10.14778/3476249.3476258
M3 - Conference article
AN - SCOPUS:85119694105
SN - 2150-8097
VL - 14
SP - 2006
EP - 2018
JO - Proceedings of the VLDB Endowment
JF - Proceedings of the VLDB Endowment
IS - 11
T2 - 47th International Conference on Very Large Data Bases, VLDB 2021
Y2 - 16 August 2021 through 20 August 2021
ER -