AHIVE: Anatomy-aware Hierarchical Vision Encoding for Interactive Radiology Report Retrieval

Sixing Yan*, William K. Cheung, Ivor W. Tsang, Keith Chiu, Terence M. Tong, Ka Chun Cheung, Simon See

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Automatic radiology report generation using deep learning models has been recently explored and found promising. Neural decoders are commonly used for the report generation where irrelevant and unfaithful contents are unavoidable. The retrieval-based approach alleviates the limitation by identifying reports which are relevant to the input to assist the generation. To achieve clinically accurate report retrieval we make reference to clinicians' diagnostic steps of examining a radiology image where anatomical and diagnostic details are typically focused and propose a novel hierarchical visual concept representation called anatomy-aware hierarchical vision encoding (AHIVE). To learn AHIVE we first derive a methodology to extract hierarchical diagnostic descriptions from radiology reports and develop a CLIP-based framework for the model training. Also the hierarchical architecture of AHIVE is designed to support interactive report retrieval so that report revision made at one layer can be propagated to the subsequent ones to trigger other necessary revisions. We conduct extensive experiments and show that AHIVE can outperform the SOTA vision-language retrieval methods in terms of clinical accuracy by a large margin. We provide also a case study to illustrate how it enables interactive report retrieval.
Original languageEnglish
Title of host publicationProceedings of 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024
PublisherIEEE
Pages14324-14333
Number of pages10
Publication statusPublished - Jun 2024
EventThe IEEE / CVF Computer Vision and Pattern Recognition Conference, CVPR 2024 - Seattle Convention Center, Seattle, United States
Duration: 17 Jun 202421 Jun 2024
https://cvpr.thecvf.com/ (conference website)
https://cvpr2023.thecvf.com/virtual/2023/calendar (Link to conference schedule)
https://media.eventhosts.cc/Conferences/CVPR2024/CVPR_main_conf_2024.pdf (Link to conference booklet)
https://openaccess.thecvf.com/CVPR2024 (Conference proceedings)

Publication series

NameProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

Conference

ConferenceThe IEEE / CVF Computer Vision and Pattern Recognition Conference, CVPR 2024
Abbreviated titleCVPR 2024
Country/TerritoryUnited States
CitySeattle
Period17/06/2421/06/24
Internet address

Scopus Subject Areas

  • Computer Vision and Pattern Recognition
  • Artificial Intelligence
  • Health Informatics

Fingerprint

Dive into the research topics of 'AHIVE: Anatomy-aware Hierarchical Vision Encoding for Interactive Radiology Report Retrieval'. Together they form a unique fingerprint.

Cite this