TY - JOUR
T1 - Multidimensional Balance-Based Cluster Boundary Detection for High-Dimensional Data
AU - Cao, Xiaofeng
AU - Qiu, Baozhi
AU - Li, Xiangli
AU - Shi, Zenglin
AU - Xu, Guandong
AU - Xu, Jianliang
N1 - Funding Information:
Manuscript received December 6, 2016; revised September 17, 2017, February 27, 2018, and July 31, 2018; accepted October 3, 2018. Date of publication October 31, 2018; date of current version May 23, 2019. The work of G. Xu was supported by the Australian Research Council Linkage Project Scheme under Grant LP170100891. The work of J. Xu was supported by HK-RGC under Project HKBU12200817 and Project C1008-16G. (Corresponding authors: Guandong Xu; Jianliang Xu.) X. Cao is with the School of Information Engineering, Zhengzhou University, Zhengzhou 450001, China, and also with the Advanced Analytics Institute, University of Technology Sydney, Sydney, NSW 2008, Australia (e-mail: [email protected]).
PY - 2019/6
Y1 - 2019/6
N2 - The balance of neighborhood space around a central point is an important concept in cluster analysis. It can be used to effectively detect cluster boundary objects. The existing neighborhood analysis methods focus on the distribution of data, i.e., analyzing the characteristic of the neighborhood space from a single perspective, and could not obtain rich data characteristics. In this paper, we analyze the high-dimensional neighborhood space from multiple perspectives. By simulating each dimension of a data point's k nearest neighbors space (k NNs) as a lever, we apply the lever principle to compute the balance fulcrum of each dimension after proving its inevitability and uniqueness. Then, we model the distance between the projected coordinate of the data point and the balance fulcrum on each dimension and construct the DHBlan coefficient to measure the balance of the neighborhood space. Based on this theoretical model, we propose a simple yet effective cluster boundary detection algorithm called Lever. Experiments on both low- and high-dimensional data sets validate the effectiveness and efficiency of our proposed algorithm.
AB - The balance of neighborhood space around a central point is an important concept in cluster analysis. It can be used to effectively detect cluster boundary objects. The existing neighborhood analysis methods focus on the distribution of data, i.e., analyzing the characteristic of the neighborhood space from a single perspective, and could not obtain rich data characteristics. In this paper, we analyze the high-dimensional neighborhood space from multiple perspectives. By simulating each dimension of a data point's k nearest neighbors space (k NNs) as a lever, we apply the lever principle to compute the balance fulcrum of each dimension after proving its inevitability and uniqueness. Then, we model the distance between the projected coordinate of the data point and the balance fulcrum on each dimension and construct the DHBlan coefficient to measure the balance of the neighborhood space. Based on this theoretical model, we propose a simple yet effective cluster boundary detection algorithm called Lever. Experiments on both low- and high-dimensional data sets validate the effectiveness and efficiency of our proposed algorithm.
KW - Balance principle
KW - cluster boundary
KW - high-dimensional space
KW - unlimited lever
UR - http://www.scopus.com/inward/record.url?scp=85055865040&partnerID=8YFLogxK
U2 - 10.1109/TNNLS.2018.2874458
DO - 10.1109/TNNLS.2018.2874458
M3 - Journal article
C2 - 30387747
AN - SCOPUS:85055865040
SN - 2162-237X
VL - 30
SP - 1867
EP - 1880
JO - IEEE Transactions on Neural Networks and Learning Systems
JF - IEEE Transactions on Neural Networks and Learning Systems
IS - 6
M1 - 8517163
ER -