Abstract
Vision-language models have been explored for radiology report generation with promising results. Yet, uncertainty elaborated in findings and the reasoning process for reaching clinical impressions are seldom explicitly modeled, reducing the clinical accuracy and trustworthiness of the generated reports. We present CURV, a novel framework that alleviates the limitations through integrated awareness of uncertainty and explicit reasoning capabilities. Our approach consists of three key components: (1) an uncertainty modeling mechanism that teaches the model to recognize and express appropriate levels of diagnostic confidence, (2) a structured reasoning framework that generates intermediate explanatory steps connecting visual findings to clinical impressions, and (3) a reasoning coherence reward that ensures logical consistency among findings, reasoning, and impressions. We implement CURV through a three-stage training pipeline that combines uncertainty-aware fine-tuning, reasoning initialization, and reinforcement learning. In particular, we adopt a comprehensive reward function addresses multiple aspects of report quality, incorporating medical term matching, uncertainty expression evaluation, and semantic coherence evaluation. Experimental results demonstrate that CURV generates clinically relevant reports with appropriate uncertainty expressions and transparent reasoning traces, significantly outperforming previous methods. CURV represents a substantial advancement toward interpretable and trustworthy AI-generated radiology reports, with broader implications for the deployment of vision-language models in high-stakes clinical environments where uncertainty awareness and reasoning transparency are essential.
| Original language | English |
|---|---|
| Title of host publication | 39th Conference on Neural Information Processing Systems, NeurIPS 2025 |
| Publisher | Neural Information Processing Systems Foundation |
| Pages | 1-28 |
| Number of pages | 28 |
| Publication status | Published - 2 Dec 2025 |
| Event | 39th Conference on Neural Information Processing Systems, NeurIPS 2025 - San Diego, United States Duration: 2 Dec 2025 → 7 Dec 2025 https://neurips.cc/Conferences/2025 (Conference website) https://neurips.cc/virtual/2025/loc/san-diego/papers.html (Conference schedule) https://proceedings.neurips.cc/paper_files/paper/2025 (Conference proceedings) |
Conference
| Conference | 39th Conference on Neural Information Processing Systems, NeurIPS 2025 |
|---|---|
| Abbreviated title | NeurIPS 2025 |
| Country/Territory | United States |
| City | San Diego |
| Period | 2/12/25 → 7/12/25 |
| Internet address |
|
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Fingerprint
Dive into the research topics of 'CURV: Coherent Uncertainty-Aware Reasoning in Vision-Language Models for X-Ray Report Generation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver