TY - JOUR
T1 - Granular Ball-Guided Disambiguation for Partial Multilabel Feature Selection via Maximum Consistency Minimum Uncertainty
AU - Xu, Fankang
AU - Qian, Wenbin
AU - Shu, Wenhao
AU - Huang, Jintao
AU - Ding, Weiping
AU - Xia, Shuyin
N1 - Funding information:
This work was supported in part by the National Natural Science Foundation of China under Grant 62366019, Grant 62266018, and Grant 61966016; in part by Jiangxi Provincial Natural Science Foundation, China, under Grant 20242BAB23014; and in part by the National Key Research and Development Program of China under Grant 2024YFF1307305. (Corresponding author: Wenbin Qian.)
Publisher Copyright:
© 2025 IEEE
PY - 2025/10/23
Y1 - 2025/10/23
N2 - Partial multilabel feature selection (PMLFS) is a prevalent subject that aims to enhance the performance of multilabel learning (MLL) in the context of noisy labels. In PMLFS, a crucial aspect is handling the false positive labels hidden in the candidate label set, as the imprecise annotations could mislead the feature selection process. However, many existing approaches for partial label disambiguation rely on topology information and tend to be error-prone. Besides, feature selection frameworks are often built upon a linear regression model, leading to a reliance on the classifier and a deficiency in exploring local structures. Focusing on the issues above, this article proposes a novel two-stage PMLFS method, resorting to the ideology of granular computing. In the first stage, a label disambiguation method is developed using label-specific information. Specifically, a specific granular ball computing model is designed to characterize the distribution of datapoints labeled differently, and therefore, using the affinity relationships among samples and balls, the label-specific information concealed in the data distribution can be captured for label disambiguation. In the second stage, a filter-based feature selection method that explores the local structure of samples is presented. This method relies on a devised fuzzy decision neighborhood rough set (FDNRS) to capture more detailed membership information by maximizing the neighborhood consistency of samples' related labels. Simultaneously, the feature selection method minimizes the uncertainty derived from unrelated labels. Extensive experiments on 12 datasets in terms of four evaluation metrics demonstrated the effectiveness of the proposed approach.
AB - Partial multilabel feature selection (PMLFS) is a prevalent subject that aims to enhance the performance of multilabel learning (MLL) in the context of noisy labels. In PMLFS, a crucial aspect is handling the false positive labels hidden in the candidate label set, as the imprecise annotations could mislead the feature selection process. However, many existing approaches for partial label disambiguation rely on topology information and tend to be error-prone. Besides, feature selection frameworks are often built upon a linear regression model, leading to a reliance on the classifier and a deficiency in exploring local structures. Focusing on the issues above, this article proposes a novel two-stage PMLFS method, resorting to the ideology of granular computing. In the first stage, a label disambiguation method is developed using label-specific information. Specifically, a specific granular ball computing model is designed to characterize the distribution of datapoints labeled differently, and therefore, using the affinity relationships among samples and balls, the label-specific information concealed in the data distribution can be captured for label disambiguation. In the second stage, a filter-based feature selection method that explores the local structure of samples is presented. This method relies on a devised fuzzy decision neighborhood rough set (FDNRS) to capture more detailed membership information by maximizing the neighborhood consistency of samples' related labels. Simultaneously, the feature selection method minimizes the uncertainty derived from unrelated labels. Extensive experiments on 12 datasets in terms of four evaluation metrics demonstrated the effectiveness of the proposed approach.
KW - Feature selection
KW - granular ball computing
KW - multilabel learning (MLL)
KW - neighborhood rough set
UR - https://www.scopus.com/pages/publications/105019673798
U2 - 10.1109/TNNLS.2025.3610795
DO - 10.1109/TNNLS.2025.3610795
M3 - Journal article
AN - SCOPUS:105019673798
SN - 2162-237X
JO - IEEE Transactions on Neural Networks and Learning Systems
JF - IEEE Transactions on Neural Networks and Learning Systems
ER -