TY - GEN
T1 - PP-DBLP
T2 - 18th IEEE International Conference on Data Mining Workshops, ICDMW 2018
AU - HUANG, Xin
AU - Jiang, Jiaxin
AU - CHOI, Koon Kau
AU - XU, Jianliang
AU - ZHANG, Zhiwei
AU - SONG, Celine
N1 - Funding Information:
This work is supported by the Hong Kong General Research Fund (GRF) Project Nos. HKBU 12200917, 12232716, 12258116, 12632816, and National Natural Science Foundation of China (NSFC) Project Nos. 61702435, 61602395.
PY - 2018/7/2
Y1 - 2018/7/2
N2 - In many online social networks (e.g., Facebook, Google+, Twitter, and Instagram), users prefer to hide her/his partial or all relationships, which makes such private relationships not visible to public users or even friends. This leads to a new graph model called public-private networks, where each user has her/his own perspective of the network including the private connections. Recently, public-private network analysis has attracted significant research interest in the literature. A great deal of important graph computing problems (e.g., shortest paths, centrality, PageRank, and reachability tree) has been studied. However, due to the limited data sources and privacy concerns, proposed approaches are not tested on real-world datasets, but on synthetic datasets by randomly selecting vertices as private ones. Therefore, real-world datasets of public-private networks are essential and urgently needed for such algorithms in the evaluation of efficiency and effectiveness. In this paper, we generate four public-private networks from real-world DBLP records, called PP-DBLP. We take published articles as public information and regard ongoing collaborations as the hidden information, which is only known by the authors. Our released datasets of PP-DBLP offer the prospects for verifying various kinds of efficient public-private analysis algorithms in a fair way. In addition, motivated by widely existing attributed graphs, we propose an advanced model of attributed public-private graphs where vertices have not only private edges but also private attributes. We also discuss open problems on attributed public-private graphs. Preliminary experimental results on our generated real-world datasets verify the effectiveness and efficiency of public-private models and state-of-the-art algorithms.
AB - In many online social networks (e.g., Facebook, Google+, Twitter, and Instagram), users prefer to hide her/his partial or all relationships, which makes such private relationships not visible to public users or even friends. This leads to a new graph model called public-private networks, where each user has her/his own perspective of the network including the private connections. Recently, public-private network analysis has attracted significant research interest in the literature. A great deal of important graph computing problems (e.g., shortest paths, centrality, PageRank, and reachability tree) has been studied. However, due to the limited data sources and privacy concerns, proposed approaches are not tested on real-world datasets, but on synthetic datasets by randomly selecting vertices as private ones. Therefore, real-world datasets of public-private networks are essential and urgently needed for such algorithms in the evaluation of efficiency and effectiveness. In this paper, we generate four public-private networks from real-world DBLP records, called PP-DBLP. We take published articles as public information and regard ongoing collaborations as the hidden information, which is only known by the authors. Our released datasets of PP-DBLP offer the prospects for verifying various kinds of efficient public-private analysis algorithms in a fair way. In addition, motivated by widely existing attributed graphs, we propose an advanced model of attributed public-private graphs where vertices have not only private edges but also private attributes. We also discuss open problems on attributed public-private graphs. Preliminary experimental results on our generated real-world datasets verify the effectiveness and efficiency of public-private models and state-of-the-art algorithms.
KW - Attributed Networks
KW - PP DBLP
KW - Public Private Graphs
KW - Real world Datasets
KW - Social Network Analysis
UR - http://www.scopus.com/inward/record.url?scp=85062862027&partnerID=8YFLogxK
U2 - 10.1109/ICDMW.2018.00142
DO - 10.1109/ICDMW.2018.00142
M3 - Conference proceeding
AN - SCOPUS:85062862027
T3 - IEEE International Conference on Data Mining Workshops, ICDMW
SP - 986
EP - 989
BT - Proceedings - 18th IEEE International Conference on Data Mining Workshops, ICDMW 2018
A2 - Tong, Hanghang
A2 - Li, Zhenhui
A2 - Zhu, Feida
A2 - Yu, Jeffrey
PB - IEEE Computer Society
Y2 - 17 November 2018 through 20 November 2018
ER -