Temporal Neighboring Multi-modal Transformer with Missingness-Aware Prompt for Hepatocellular Carcinoma Prediction

Jingwen Xu, Ye Zhu, Fei Lyu, Grace Lai-Hung Wong, Pong C. Yuen*

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Early prediction of hepatocellular carcinoma (HCC) is necessary to facilitate appropriate surveillance strategy and reduce cancer mortality. Incorporating CT scans and clinical time series can greatly increase the accuracy of predictive models. However, there are two challenges to effective multi-modal learning: (a) CT scans and clinical time series suffer from temporal misalignment. (b) CT scans can be missing compared with clinical time series. To tackle the above challenges, we propose a Temporal Neighboring Multi-modal Transformer with Missingness Aware Prompt (TNformer-MP) to integrate clinical time series and available CT scans for HCC prediction. To explore the inter-modality temporal correspondence, a Temporal Neighboring Multi-modal Tokenizer (TN-MT) is exploited to fuse CT embedding into neighboring clinical time series tokens across multiple scales. To mitigate the performance drop caused by missing CT modality, TNformer-MP exploits a Missingness-aware Prompt-driven Multi-modal Tokenizer (MP-MT) that adjusts the encoding of clinical time series tokens with learnable prompts. Experiments conducted on large-scale multi-modal datasets of 36,353 patients show that our method achieves superior performance compared to existing methods.
Original languageEnglish
Title of host publicationMedical Image Computing and Computer Assisted Intervention – MICCAI 2024
Subtitle of host publication27th International Conference, Marrakesh, Morocco, October 6–10, 2024, Proceedings, Part I
EditorsMarius George Linguraru, Qi Dou, Aasa Feragen, Stamatia Giannarou, Ben Glocker, Karim Lekadir, Julia A. Schnabel
Place of PublicationCham
PublisherSpringer
Pages79-88
Number of pages10
Edition1st
ISBN (Electronic)9783031723780
ISBN (Print)9783031723773
DOIs
Publication statusPublished - 2 Oct 2024
Event27th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2024 - Marrakesh, Morocco
Duration: 6 Oct 202410 Oct 2024
https://link.springer.com/book/10.1007/978-3-031-72083-3 (Conference Proceedings (Part IV))
https://link.springer.com/book/10.1007/978-3-031-72111-3 (Conference Proceedings (Part VIII))
https://conferences.miccai.org/2024/en/default.asp (Conference website)

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume15001
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349
NameMICCAI: International Conference on Medical Image Computing and Computer-Assisted Intervention

Conference

Conference27th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2024
Country/TerritoryMorocco
CityMarrakesh
Period6/10/2410/10/24
Internet address

User-Defined Keywords

  • Hepatocellular carcinoma
  • Multi-modal learning
  • Temporal neighboring
  • Prompt

Fingerprint

Dive into the research topics of 'Temporal Neighboring Multi-modal Transformer with Missingness-Aware Prompt for Hepatocellular Carcinoma Prediction'. Together they form a unique fingerprint.

Cite this