TY - JOUR
T1 - Label distribution feature selection based on hierarchical structure and neighborhood granularity
AU - Lu, Xiwen
AU - Qian, Wenbin
AU - Dai, Shiming
AU - Huang, Jintao
N1 - This work is supported by the National Natural Science Foundation of China (No. 62366019 and No. 61966016), the Jiangxi Provincial Natural Science Foundation, China (No. 20224BAB202020), and the National Key Research and Development Program of China (No. 2022YFD1600202).
Publisher Copyright:
© 2024 Elsevier B.V.
PY - 2024/12
Y1 - 2024/12
N2 - Label Distribution Learning (LDL) addresses label ambiguity in datasets but struggles with high-dimensional data due to irrelevant features. Label Distribution Feature Selection (LDFS) methods can effectively unravel the issues, but they often overlook the advantages of utilizing hierarchical relationships among data, which can improve feature discriminability. Furthermore, these methods inadequately consider the granulation process, directly affecting the important features’ identification. To overcome these challenges, this study proposes a novel LDFS approach incorporating hierarchical structures and neighborhood granularity. Our algorithm proceeds in three stages: initially, it forms a multi-granular representation of data to reveal hierarchical relationships; subsequently, in the granulation process, it employs a variable precision rough set model, leveraging neighborhood granularity for a nuanced feature relevance assessment; and finally, it synthesizes these findings via a fusion strategy, culminating in a hierarchical feature ranking. Extensive experiments are conducted on thirteen benchmark datasets against five different algorithms in terms of six evaluation metrics. The results show that our method outperforms competitors in about 80% of the cases, demonstrating its effectiveness and generalization.
AB - Label Distribution Learning (LDL) addresses label ambiguity in datasets but struggles with high-dimensional data due to irrelevant features. Label Distribution Feature Selection (LDFS) methods can effectively unravel the issues, but they often overlook the advantages of utilizing hierarchical relationships among data, which can improve feature discriminability. Furthermore, these methods inadequately consider the granulation process, directly affecting the important features’ identification. To overcome these challenges, this study proposes a novel LDFS approach incorporating hierarchical structures and neighborhood granularity. Our algorithm proceeds in three stages: initially, it forms a multi-granular representation of data to reveal hierarchical relationships; subsequently, in the granulation process, it employs a variable precision rough set model, leveraging neighborhood granularity for a nuanced feature relevance assessment; and finally, it synthesizes these findings via a fusion strategy, culminating in a hierarchical feature ranking. Extensive experiments are conducted on thirteen benchmark datasets against five different algorithms in terms of six evaluation metrics. The results show that our method outperforms competitors in about 80% of the cases, demonstrating its effectiveness and generalization.
KW - Feature selection
KW - Granular computing
KW - Hierarchical structure
KW - Label distribution learning
KW - Rough sets
UR - http://www.scopus.com/inward/record.url?scp=85199776318&partnerID=8YFLogxK
U2 - 10.1016/j.inffus.2024.102588
DO - 10.1016/j.inffus.2024.102588
M3 - Journal article
AN - SCOPUS:85199776318
SN - 1566-2535
VL - 112
JO - Information Fusion
JF - Information Fusion
M1 - 102588
ER -