Abstract
Multi-modal learning has shown advantages in improving weakly-supervised medical image segmentation (WS- MIS). However, most current works are based on paired data, which is infeasible to collect in certain scenarios. Although modal translation can be used to generate paired data, it often leads to low-quality translations, such as local deformations or irrational textures, without prior knowledge. This paper proposes a discriminative-aware image translation method, which introduces class activation maps (CAMs) to localize discriminative areas, thus overcoming the lack of pixel-wise annotations in WS-MIS. In addition, we design a CAM-correlation constraint that facilitates multi-modal complementary information exchange to enhance the consistency between CAMs generated from different modalities. Experimental results show that our method outperforms recent weakly-supervised segmentation works when using unpaired multi-modal data.
Original language | English |
---|---|
Title of host publication | 2024 IEEE International Conference on Multimedia and Expo, ICME 2024 |
Publisher | IEEE |
Number of pages | 6 |
ISBN (Electronic) | 9798350390155 |
ISBN (Print) | 9798350390162 |
DOIs | |
Publication status | Published - 16 Jul 2024 |
Event | 2024 IEEE International Conference on Multimedia and Expo, ICME 2024 - Niagara Falls Marriott, Niagra Falls, Canada Duration: 15 Jul 2024 → 19 Jul 2024 https://ieeexplore.ieee.org/xpl/conhome/10685847/proceeding (Conference proceeding) https://2024.ieeeicme.org/ (Conference website) https://2024.ieeeicme.org/program/ (Conference schedule) |
Publication series
Name | Proceedings - IEEE International Conference on Multimedia and Expo |
---|---|
ISSN (Print) | 1945-7871 |
ISSN (Electronic) | 1945-788X |
Conference
Conference | 2024 IEEE International Conference on Multimedia and Expo, ICME 2024 |
---|---|
Country/Territory | Canada |
City | Niagra Falls |
Period | 15/07/24 → 19/07/24 |
Internet address |
|
Scopus Subject Areas
- Computer Networks and Communications
- Computer Science Applications
User-Defined Keywords
- CAM
- Image translation
- Multi-modal medical images
- Weakly-supervised learning