Skip to main navigation Skip to search Skip to main content

CURV: Coherent Uncertainty-Aware Reasoning in Vision-Language Models for X-Ray Report Generation

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Vision-language models have been explored for radiology report generation with promising results. Yet, uncertainty elaborated in findings and the reasoning process for reaching clinical impressions are seldom explicitly modeled, reducing the clinical accuracy and trustworthiness of the generated reports. We present CURV, a novel framework that alleviates the limitations through integrated awareness of uncertainty and explicit reasoning capabilities. Our approach consists of three key components: (1) an uncertainty modeling mechanism that teaches the model to recognize and express appropriate levels of diagnostic confidence, (2) a structured reasoning framework that generates intermediate explanatory steps connecting visual findings to clinical impressions, and (3) a reasoning coherence reward that ensures logical consistency among findings, reasoning, and impressions. We implement CURV through a three-stage training pipeline that combines uncertainty-aware fine-tuning, reasoning initialization, and reinforcement learning. In particular, we adopt a comprehensive reward function addresses multiple aspects of report quality, incorporating medical term matching, uncertainty expression evaluation, and semantic coherence evaluation. Experimental results demonstrate that CURV generates clinically relevant reports with appropriate uncertainty expressions and transparent reasoning traces, significantly outperforming previous methods. CURV represents a substantial advancement toward interpretable and trustworthy AI-generated radiology reports, with broader implications for the deployment of vision-language models in high-stakes clinical environments where uncertainty awareness and reasoning transparency are essential.
Original languageEnglish
Title of host publication39th Conference on Neural Information Processing Systems, NeurIPS 2025
PublisherNeural Information Processing Systems Foundation
Pages1-28
Number of pages28
Publication statusPublished - 2 Dec 2025
Event39th Conference on Neural Information Processing Systems, NeurIPS 2025 - San Diego, United States
Duration: 2 Dec 20257 Dec 2025
https://neurips.cc/Conferences/2025 (Conference website)
https://neurips.cc/virtual/2025/loc/san-diego/papers.html (Conference schedule)
https://proceedings.neurips.cc/paper_files/paper/2025 (Conference proceedings)

Conference

Conference39th Conference on Neural Information Processing Systems, NeurIPS 2025
Abbreviated titleNeurIPS 2025
Country/TerritoryUnited States
CitySan Diego
Period2/12/257/12/25
Internet address

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Fingerprint

Dive into the research topics of 'CURV: Coherent Uncertainty-Aware Reasoning in Vision-Language Models for X-Ray Report Generation'. Together they form a unique fingerprint.

Cite this