TY - JOUR
T1 - Discriminative Suprasphere Embedding for Fine-Grained Visual Categorization
AU - Ye, Shuo
AU - Peng, Qinmu
AU - Sun, Wenju
AU - Xu, Jiamiao
AU - Wang, Yu
AU - You, Xinge
AU - Cheung, Yiu-Ming
N1 - Publisher Copyright:
© 2022 IEEE
Funding Information:
This work was supported in part by the Natural Science Foundation of China (NSFC) under Grant 61772220 and in part by the Key Research and Development Plan of Hubei Province under Grant 2020BAB027.
Publisher Copyright:
IEEE
PY - 2024/4
Y1 - 2024/4
N2 - Despite the great success of the existing work in fine-grained visual categorization (FGVC), there are still several unsolved challenges, e.g., poor interpretation and vagueness contribution. To circumvent this drawback, motivated by the hypersphere embedding method, we propose a discriminative suprasphere embedding (DSE) framework, which can provide intuitive geometric interpretation and effectively extract discriminative features. Specifically, DSE consists of three modules. The first module is a suprasphere embedding (SE) block, which learns discriminative information by emphasizing weight and phase. The second module is a phase activation map (PAM) used to analyze the contribution of local descriptors to the suprasphere feature representation, which uniformly highlights the object region and exhibits remarkable object localization capability. The last module is a class contribution map (CCM), which quantitatively analyzes the network classification decision and provides insight into the domain knowledge about classified objects. Comprehensive experiments on three benchmark datasets demonstrate the effectiveness of our proposed method in comparison with state-of-the-art methods.
AB - Despite the great success of the existing work in fine-grained visual categorization (FGVC), there are still several unsolved challenges, e.g., poor interpretation and vagueness contribution. To circumvent this drawback, motivated by the hypersphere embedding method, we propose a discriminative suprasphere embedding (DSE) framework, which can provide intuitive geometric interpretation and effectively extract discriminative features. Specifically, DSE consists of three modules. The first module is a suprasphere embedding (SE) block, which learns discriminative information by emphasizing weight and phase. The second module is a phase activation map (PAM) used to analyze the contribution of local descriptors to the suprasphere feature representation, which uniformly highlights the object region and exhibits remarkable object localization capability. The last module is a class contribution map (CCM), which quantitatively analyzes the network classification decision and provides insight into the domain knowledge about classified objects. Comprehensive experiments on three benchmark datasets demonstrate the effectiveness of our proposed method in comparison with state-of-the-art methods.
KW - Deep hypersphere embedding
KW - discriminative localization
KW - fine-grained visual categorization (FGVC)
KW - weakly supervised learning
UR - http://www.scopus.com/inward/record.url?scp=85139453249&partnerID=8YFLogxK
U2 - 10.1109/TNNLS.2022.3202534
DO - 10.1109/TNNLS.2022.3202534
M3 - Journal article
SN - 2162-237X
VL - 35
SP - 5092
EP - 5102
JO - IEEE Transactions on Neural Networks and Learning Systems
JF - IEEE Transactions on Neural Networks and Learning Systems
IS - 4
ER -