Discrete Elective Hashing with Incomplete Labels for Efficient Cross-Modal Retrieval

Donglin Zhang, Chang-Xing Li, Mengke Li, Zhikai Hu*

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

Recently, supervised cross-modal hashing methods have gained considerable attention due to their ability to mine credible semantic relationships between multi-modal data. These methods typically rely on labels to explore semantic relationships provided that labels are always reliable, which, however, may not be true from the practical perspective. In fact, labels may be incomplete, i.e., true-label incomplete and fine-grained incomplete, which makes the performance of the existing methods deteriorated. To this end, we propose a method called Discrete Elective Hashing with Incomplete Labels (DEH-IL), which is designed to alleviate the impact of incomplete labels. Specifically, we introduce a relaxed label scheme that allows the algorithm to automatically mine potential missing information from incomplete labels, which is beneficial for exploring intra-class relationships. Moreover, we propose a novel elective loss that aggregates all estimations from incomplete labels to mine inter-class relationships. Since elective loss does not rely on any single estimation, it can effectively mitigate estimation errors arising from incomplete labels. By combining these two components, DEH-IL can effectively explore both intra-class and inter-class relationships through incomplete labels. Experimental results on benchmark datasets demonstrate the effectiveness of the proposed method.
Original languageEnglish
Article number160
Number of pages20
JournalACM Transactions on Multimedia Computing, Communications and Applications
Volume21
Issue number6
Early online date20 May 2025
DOIs
Publication statusPublished - Jun 2025

User-Defined Keywords

  • cross-modal retrieval
  • hashing
  • incomplete label
  • multi-label

Fingerprint

Dive into the research topics of 'Discrete Elective Hashing with Incomplete Labels for Efficient Cross-Modal Retrieval'. Together they form a unique fingerprint.

Cite this