TY - JOUR
T1 - Fuzzy neighborhood-based partial label feature selection via label iterative disambiguation
AU - Li, Junqi
AU - Qian, Wenbin
AU - Yang, Wenji
AU - Liu, Suxuan
AU - Huang, Jintao
N1 - Funding Information:
This work is supported by the National Natural Science Foundation of China (No.62366019, No.62366018 and No.61966016), Jiangxi Provincial Natural Science Foundation, China (No.20242BAB23014, No.20224BAB202020, and No.20224BAB202015), and Scientific Research Project of State Grid Corporation of Jiangxi Province, China (No.521820240023).
Publisher Copyright:
© 2025 Elsevier Inc. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
PY - 2025/4
Y1 - 2025/4
N2 - Partial label learning is a specific weakly supervised learning framework in which each training sample is associated with a candidate label set in which the ground-truth label is concealed. Feature selection can remove redundant and irrelevant features to improve the generalization performance of the classification model. However, the impact of ambiguous labels is an essential challenge when adopting feature selection for partial label data. In this paper, a novel two-stage feature selection method is proposed, called fuzzy neighborhood-based partial label feature selection with label iterative disambiguation. In the first stage, the proposed method addresses the issue of noise labels by employing a neighborhood-based iterative strategy to enlarge the gap between ground-truth labels and noisy labels. Subsequently, the labeling confidence induced by label disambiguation is utilized to enhance the robustness of feature selection. In the second stage, feature significance is evaluated using three metrics based on fuzzy neighborhoods. Specifically, fuzzy dependency is obtained using fuzzy relations and labeling confidence. Fuzzy neighborhood entropy-based information gain is proposed as an uncertainty measure. Furthermore, the similarity between samples in the same fuzzy neighborhood is used to estimate neighborhood consistency. The fusion of the above metrics can select more discriminative features for partial label learning. Finally, experimental results on eight controlled UCI datasets and five real-world datasets demonstrate that the proposed method achieves superior performance than other compared methods.
AB - Partial label learning is a specific weakly supervised learning framework in which each training sample is associated with a candidate label set in which the ground-truth label is concealed. Feature selection can remove redundant and irrelevant features to improve the generalization performance of the classification model. However, the impact of ambiguous labels is an essential challenge when adopting feature selection for partial label data. In this paper, a novel two-stage feature selection method is proposed, called fuzzy neighborhood-based partial label feature selection with label iterative disambiguation. In the first stage, the proposed method addresses the issue of noise labels by employing a neighborhood-based iterative strategy to enlarge the gap between ground-truth labels and noisy labels. Subsequently, the labeling confidence induced by label disambiguation is utilized to enhance the robustness of feature selection. In the second stage, feature significance is evaluated using three metrics based on fuzzy neighborhoods. Specifically, fuzzy dependency is obtained using fuzzy relations and labeling confidence. Fuzzy neighborhood entropy-based information gain is proposed as an uncertainty measure. Furthermore, the similarity between samples in the same fuzzy neighborhood is used to estimate neighborhood consistency. The fusion of the above metrics can select more discriminative features for partial label learning. Finally, experimental results on eight controlled UCI datasets and five real-world datasets demonstrate that the proposed method achieves superior performance than other compared methods.
KW - Feature selection
KW - Fuzzy neighborhood rough sets
KW - Granular computing
KW - Partial label learning
KW - Uncertainty measure
UR - http://www.scopus.com/inward/record.url?scp=85214342030&partnerID=8YFLogxK
U2 - 10.1016/j.ijar.2024.109358
DO - 10.1016/j.ijar.2024.109358
M3 - Journal article
AN - SCOPUS:85214342030
SN - 0888-613X
VL - 179
JO - International Journal of Approximate Reasoning
JF - International Journal of Approximate Reasoning
M1 - 109358
ER -