Published October 30, 2025 | Version v1
Conference paper Open

GRPO-RAD: GROUP RELATIVE POLICY OPTIMIZATION FOR RADIOLOGY REPORT SUMMARIZATION

  • 1. ROR icon Dalhousie University
  • 2. ROR icon Vector Institute

Description

Abstract:

We investigate Group Relative Policy Optimization (GRPO) for radiology report summarization using Qwen 3.0 models. GRPO enables optimization of composite reward functions combining syntactic and semantic measures, addressing limitations of traditional supervised fine-tuning. Our comprehensive evaluation on MIMIC-III demonstrates that GRPO consistently outperforms baseline and supervised fine-tuning approaches across multiple metrics including ROUGE-L and F1-RadGraph.

Files

GRPO_Rad-SMASH 2025.pdf

Files (187.5 kB)

Name Size Download all
md5:e5fde18dc97fe3e3be69378d5ba06dd2
187.5 kB Preview Download

Additional details

Software

Repository URL
https://github.com/FargolN/grpo_rad
Programming language
Python