Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning

Zhou, Zijian; Shi, Miaojing; Wei, Meng; Alabi, Oluwatosin; Yue, Zijie; Vercauteren, Tom

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.06728 (cs)

[Submitted on 11 Mar 2024]

Title:Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning

Authors:Zijian Zhou, Miaojing Shi, Meng Wei, Oluwatosin Alabi, Zijie Yue, Tom Vercauteren

View PDF HTML (experimental)

Abstract:Radiology report generation (RRG) has attracted significant attention due to its potential to reduce the workload of radiologists. Current RRG approaches are still unsatisfactory against clinical standards. This paper introduces a novel RRG method, \textbf{LM-RRG}, that integrates large models (LMs) with clinical quality reinforcement learning to generate accurate and comprehensive chest X-ray radiology reports. Our method first designs a large language model driven feature extractor to analyze and interpret different regions of the chest X-ray image, emphasizing specific regions with medical significance. Next, based on the large model's decoder, we develop a multimodal report generator that leverages multimodal prompts from visual features and textual instruction to produce the radiology report in an auto-regressive way. Finally, to better reflect the clinical significant and insignificant errors that radiologists would normally assign in the report, we introduce a novel clinical quality reinforcement learning strategy. It utilizes the radiology report clinical quality (RadCliQ) metric as a reward function in the learning process. Extensive experiments on the MIMIC-CXR and IU-Xray datasets demonstrate the superiority of our method over the state of the art.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.06728 [cs.CV]
	(or arXiv:2403.06728v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.06728

Submission history

From: Zijian Zhou [view email]
[v1] Mon, 11 Mar 2024 13:47:11 UTC (558 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators