TY - JOUR
T1 - Joint Semantic Preserving Sparse Hashing for Cross-Modal Retrieval
AU - Hu, Zhikai
AU - Cheung, Yiu-ming
AU - Li, Mengke
AU - Lan, Weichao
AU - Zhang, Donglin
AU - Liu, Qiang
N1 - This work was supported in part by the NSFC/Research Grants Council (RGC) Joint Research Scheme under Grant N_HKBU214/21, in part by the General Research Fund of RGC under Grant 12201321 and Grant 12202622, in part by the National Natural Science Foundation of China under Grant 61991401, Grant U20A20189, and Grant 62161160338, in part by NSFC under Grant 62202204, and in part by the Fundamental Research Funds for the Central Universities under Grant JUSRP123032.
Publisher copyright:
© 2023 The Authors.
PY - 2024/4
Y1 - 2024/4
N2 - Supervised cross-modal hashing has received wide attention in recent years. However, existing methods primarily rely on sample-wise semantic relationships to evaluate the semantic similarity between samples, overlooking the impact of label distribution on enhancing retrieval performance. Moreover, the limited representation capability of traditional dense hash codes hinders the preservation of semantic relationship. To overcome these challenges, we propose a new method, Joint Semantic Preserving Sparse Hashing (JSPSH). Specifically, we introduce a new concept of cluster-wise semantic relationship, which leverages label distribution to indicate which samples are more suitable for clustering. Then, we jointly utilize sample-wise and cluster-wise semantic relationships to supervise the learning of hash codes. In this way, JSPSH preserves both kinds of semantic relationships to ensure that more samples with similar semantics are clustered together, thereby achieving better retrieval results. Furthermore, we utilize high-dimensional sparse hash codes that offer stronger representation capability to preserve such more complex semantics. Finally, an interaction term is introduced in hash functions learning stage to further narrow the gap between modalities. Experimental results on three large-scale datasets demonstrate the effectiveness of JSPSH in achieving superior retrieval performance. Codes are available at https://github.com/hutt94/JSPSH.
AB - Supervised cross-modal hashing has received wide attention in recent years. However, existing methods primarily rely on sample-wise semantic relationships to evaluate the semantic similarity between samples, overlooking the impact of label distribution on enhancing retrieval performance. Moreover, the limited representation capability of traditional dense hash codes hinders the preservation of semantic relationship. To overcome these challenges, we propose a new method, Joint Semantic Preserving Sparse Hashing (JSPSH). Specifically, we introduce a new concept of cluster-wise semantic relationship, which leverages label distribution to indicate which samples are more suitable for clustering. Then, we jointly utilize sample-wise and cluster-wise semantic relationships to supervise the learning of hash codes. In this way, JSPSH preserves both kinds of semantic relationships to ensure that more samples with similar semantics are clustered together, thereby achieving better retrieval results. Furthermore, we utilize high-dimensional sparse hash codes that offer stronger representation capability to preserve such more complex semantics. Finally, an interaction term is introduced in hash functions learning stage to further narrow the gap between modalities. Experimental results on three large-scale datasets demonstrate the effectiveness of JSPSH in achieving superior retrieval performance. Codes are available at https://github.com/hutt94/JSPSH.
KW - Codes
KW - Encoding
KW - Hash functions
KW - Quantization (signal)
KW - Semantics
KW - Sparse matrices
KW - Task analysis
KW - Cross-modal retrieval
KW - hashing
KW - cluster-wise semantics
KW - sample-wise semantics
KW - discrete optimization
KW - clustering
UR - http://www.scopus.com/inward/record.url?scp=85168697161&partnerID=8YFLogxK
U2 - 10.1109/TCSVT.2023.3307608
DO - 10.1109/TCSVT.2023.3307608
M3 - Journal article
SN - 1051-8215
VL - 34
SP - 2989
EP - 3002
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
IS - 4
ER -