TsCA: On the Semantic Consistency Alignment via Conditional Transport for Compositional Zero-Shot Learning

  • Miaoge Li
  • , Jingcai Guo*
  • , Richard Yi Da Xu
  • , Dongsheng Wang
  • , Xiaofeng Cao
  • , Zhijie Rao
  • , Song Guo
  • *Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Compositional Zero-Shot Learning (CZSL) aims to recognize novel state-object compositions by leveraging the shared knowledge of their primitive components. Despite considerable progress, effectively calibrating the bias between semantically similar multimodal representations, as well as generalizing pre-trained knowledge to novel compositional contexts, remains an enduring challenge. In this paper, our interest is to revisit the conditional transport (CT) theory and its homology to the visual-semantics interaction in CZSL and further, propose a novel Trisets Consistency Alignment framework (dubbed TsCA) that well-addresses these issues. Concretely, we utilize three distinct yet semantically homologous sets, i.e., patches, primitives, and compositions, to construct pairwise CT costs to minimize their semantic discrepancies. To further ensure the consistency transfer within these sets, we implement a cycle-consistency constraint that refines the learning by guaranteeing the feature consistency of the self-mapping during transport flow, regardless of modality. Moreover, we extend the CT plans to an open-world setting, which enables the model to effectively filter out unfeasible pairs, thereby speeding up the inference as well as increasing the accuracy. Extensive experiments are conducted to verify the effectiveness of the proposed method. The code is available at https://github.com/keepgoingjkg/TsCA.

Original languageEnglish
Title of host publicationProceedings of the 34th International Joint Conference on Artificial Intelligence, IJCAI 2025
EditorsJames Kwok
PublisherInternational Joint Conferences on Artificial Intelligence
Pages5607-5615
Number of pages9
ISBN (Electronic)9781956792065
DOIs
Publication statusPublished - 16 Aug 2025
Event34th International Joint Conference on Artificial Intelligence, IJCAI 2025 - Montreal, Canada
Duration: 16 Aug 202522 Aug 2025
https://www.ijcai.org/proceedings/2025/ (Conference proceedings)
https://2025.ijcai.org/ (Conference website)
https://2025.ijcai.org/montreal-at-a-glance/ (Conference program)

Publication series

NameIJCAI International Joint Conference on Artificial Intelligence
PublisherInternational Joint Conferences on Artificial Intelligence
ISSN (Print)1045-0823

Conference

Conference34th International Joint Conference on Artificial Intelligence, IJCAI 2025
Country/TerritoryCanada
CityMontreal
Period16/08/2522/08/25
Internet address

User-Defined Keywords

  • Classification
  • Transfer
  • low-shot
  • semi-supervised learning
  • un-supervised learning
  • Cost-sensitive learning
  • Few-shot learning

Fingerprint

Dive into the research topics of 'TsCA: On the Semantic Consistency Alignment via Conditional Transport for Compositional Zero-Shot Learning'. Together they form a unique fingerprint.

Cite this