Skip to content

mk-runner/Awesome-Radiology-Report-Generation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 

Repository files navigation

Awesome-Radiology-Report-Generation

Awesome PRs Welcome

We collect existing papers on radiology report generation published in prominent conferences and journals. If you find this helpful, we kindly ask that you consider citing the following reference.

@misc{cvpr-2025-mlrg,
      title={Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation}, 
      author={Kang Liu and Zhuoqi Ma and Xiaolu Kang and Yunan Li and Kun Xie and Zhicheng Jiao and Qiguang Miao},
      year={2025},
      eprint={2502.20056},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2502.20056}, 
}

Table of Contents

Foundation Models for Medicine

  • CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation (arXiv'2401) [paper][code]
  • XrayGPT: Chest Radiographs Summarization using Large Medical Vision-Language Models (ACLW'24)[paper][code]
  • Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training (ICML'24) [paper][code]
  • A generalist vision--language foundation model for diverse biomedical tasks (Nature Medicine'24)[paper][code]
  • ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training (arXiv'2311)[paper][code]
  • CXR-CLIP: Toward Large Scale Chest X-ray Language-Image Pre-training (MICCAI'23)[paper][code]
  • GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-efficient Medical Image Recognition (ICCV'21)[paper][code]
  • CXR-LLAVA: a multimodal large language model for interpreting chest X-ray images (arXiv'2310)[paper][code]
  • LLaVA-OneVision: Easy Visual Task Transfer (arXiv'2408)[paper][code]
  • Advancing Medical Radiograph Representation Learning: A Hybrid Pre-training Paradigm with Multilevel Semantic Granularity (arXiv'2410) [paper]
  • MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging (arXiv'2410)[paper]
  • BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs (ICLR'24)[paper][code]
  • Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning (NIPS'24) [paper]

Papers

2025

Nature Medicine'25

  • A generalist medical language model for disease diagnosis assistance [paper][code]

Nature Communications'25

  • Towards a holistic framework for multimodal LLM in 3D brain CT radiology report generation [paper]
  • A clinically accessible small multimodal radiology model and evaluation metric for chest X-ray findings [paper][code]

TPAMI'25

  • Diagnostic Captioning by Cooperative Task Interactions and Sample-graph Consistency [paper]

Nature Computational Science'25

  • Evaluating and Mitigating Bias in AI-Based Medical Text Generation [paper] [code]
  • Toward fair AI-driven medical text generation [paper]

CVPR'25

  • Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation [paper] [code]
  • FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models [paper]
  • DART: Disease-aware Image-Text Alignment and Self-correcting Re-alignment for Trustworthy Radiology Report Generation [paper]
  • CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report Generation on CheXpert Plus Dataset [paper][code]
  • VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge [paper][code]
  • Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation [paper][code]

AAAI'25

  • Radiology Report Generation via Multi-objective Preference Optimization [paper]
  • HC-LLM: Historical-Constrained Large Language Models for Radiology Report Generation [paper][code]
  • LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts [paper][code]
  • MEPNet: Medical Entity-balanced Prompting Network for Brain CT Report Generation [paper][code]
  • Overcoming Heterogeneous Data in Federated Medical Vision-Language Pre-training: A Triple-Embedding Model Selector Approach [paper][code]
  • DAMPER: A Dual-Stage Medical Report Generation Framework with Coarse-Grained MeSH Alignment and Fine-Grained Hypergraph Matching [paper]

ICML'25

  • MedRAX: Medical Reasoning Agent for Chest X-ray [paper][code]

ACL'25

  • RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection [paper][code]
  • Libra: Leveraging Temporal Images for Biomedical Radiology Analysis [paper][code]

COLING'25

  • KIA: Knowledge-Guided Implicit Vision-Language Alignment for Chest X-Ray Report Generation [paper]
  • CmEAA: Cross-modal Enhancement and Alignment Adapter for Radiology Report Generation [paper]

NAACL'25

  • DDGIP: Radiology Report Generation Through Disease Description Graph and Informed Prompting [paper]
  • VividMed: Vision Language Model with Versatile Visual Grounding for Medicine [paper][code]
  • Fact-aware multimodal retrieval augmentation for accurate medical radiology report generation [paper][code]

ICASSP'25

  • A Novel Single Continuous Shot Multiple Lesions Endoscopy Report Generation [paper]
  • CoMT: Chain-of-Medical-Thought Reduces Hallucination in Medical Report Generation [paper][code]

ISBI'25

  • R2Gen-Mamba: A Selective State Space Model for Radiology Report Generation [paper][code]
  • Prompt-Guided Radiology Report Generation Utilizing SAM [paper]

Radiology'25

  • Privacy-ensuring Open-weights Large Language Models Are Competitive with Closed-weights GPT-4o in Extracting Chest Radiography Findings from Free-Text Reports [paper]
  • Diagnostic accuracy and clinical value of a domain-specific multimodal generative AI model for chest radiograph report generation [paper]
  • RadSearch, a Semantic Search Model for Accurate Radiology Report Retrieval with Large Language Model Integration [paper]

TMM'25

  • Adaptive Medical Topic Learning for Enhanced Fine-grained Cross-modal Alignment in Medical Report Generation[paper]

TIP'25

  • Cross-Modal Causal Representation Learning for Radiology Report Generation [paper][code]

TMI'25

  • Spatio-Temporal and Retrieval-Augmented Modeling for Chest X-Ray Report Generation [paper][code]
  • Unlocking the Potential of Weakly Labeled Data: A Co-Evolutionary Learning Framework for Abnormality Detection and Report Generation [paper][code]
  • Large Language Model with Region-guided Referring and Grounding for CT Report Generation [[paper][code]

MedIA'25

  • Report is a mixture of topics: Topic-guided radiology report generation [paper][code]

Information Fusion'25

  • Enhancing discriminative ability in multimodal LLMs: A contrastive learning approach for CT report generation [paper]

ESWA'25

  • Recalibrated cross-modal alignment network for radiology report generation with weakly supervised contrastive learning [paper]
  • HKRG: Hierarchical knowledge integration for radiology report generation [paper][code]
  • RRGMambaFormer: A hybrid Transformer-Mamba architecture for radiology report generation [paper][code]

KBS'25

  • Context-enhanced framework for medical image report generation using multimodal contexts [paper][code]
  • Abnormal-region-aware Multi-modal Feature Fusion for medical report generation [paper]

JBHI'25

  • Adapter-Enhanced Hierarchical Cross-Modal Pre-training for Lightweight Medical Report Generation [paper][code]
  • Benchmarking Radiology Report Generation from Noisy Free-Texts [paper]

Pattern Recognition Letters'25

  • Integrating clinical knowledge and imaging for medical report generation [paper]

WWW Companion'25

  • Diversity-Augmented Diffusion Network With LLM Assistance For Radiology Report Generation [paper]

arXiv'25

  • GIT-CXR: End-to-End Transformer for Chest X-Ray Report Generation [paper]
  • Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation [paper][code]
  • RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment [paper]
  • MedRAX: Medical Reasoning Agent for Chest X-ray [paper][code]
  • Libra: Leveraging Temporal Images for Biomedical Radiology Analysis [paper][code]
  • On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation [paper]
  • CoCa-CXR: Contrastive Captioners Learn Strong Temporal Structures for Chest X-Ray Vision-Language Understanding [paper]
  • CheXalign: Preference fine-tuning in chest X-ray interpretation models without human feedback [paper]
  • GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report Evaluation [paper]
  • DAgent: A Relational Database-Driven Data Analysis Report Generation Agent [paper]
  • LVMedR2: Perception and Reflection-driven Complex Reasoning for Medical Report Generation [paper]
  • MedM-VL: What Makes a Good Medical LVLM? [paper][code]
  • Leveraging LLMs for Multimodal Retrieval-Augmented Radiology Report Generation via Key Phrase Extraction [paper]
  • DualPrompt-MedCap: A Dual-Prompt Enhanced Approach for Medical Image Captioning [paper]
  • DART: Disease-aware Image-Text Alignment and Self-correcting Re-alignment for Trustworthy Radiology Report Generation [paper]
  • CRG Score: A Distribution-Aware Clinical Metric for Radiology Report Generation [paper]
  • Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation [paper]
  • MedPlan: A Two-Stage RAG-Based System for Personalized Medical Plan Generation [paper]
  • CBM-RAG: Demonstrating Enhanced Interpretability in Radiology Report Generation with Multi-Agent RAG and Concept Bottleneck Models [paper]
  • ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification [paper]
  • MicarVLMoE: A Modern Gated Cross-Aligned Vision-Language Mixture of Experts Model for Medical Image Captioning and Report Generation [paper]
  • Large-scale Chest Disease Diagnosis Enabled by Multimodal Large Language Models with Self-Supervised Fine-Tuning [paper]
  • RadRevise: A Benchmark Dataset for Instruction-Based Radiology Report Editing [paper]
  • Evaluating Vision Language Model Adaptations for Radiology Report Generation in Low-Resource Languages [paper]
  • AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation [paper][code]
  • DDaTR: Dynamic Difference-aware Temporal Residual Network for Longitudinal Radiology Report Generation [paper][code]
  • CheXLearner: Text-Guided Fine-Grained Representation Learning for Progression Detection [paper]
  • Describe Anything in Medical Images [paper]
  • A Multimodal Multi-Agent Framework for Radiology Report Generation [paper]
  • Ultrasound Report Generation with Multimodal Large Language Models for Standardized Texts [paper]
  • CorBenchX: Large-Scale Chest X-Ray Error Dataset and Vision–Language Model Benchmark for Report Error Correction [paper][code]
  • Online Iterative Self-Alignment for Radiology Report Generation [paper]
  • CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays [paper][code]
  • CLEAR: A Clinically-Grounded Tabular Framework for Radiology Report Evaluation [paper]
  • Grounding Chest X-Ray Visual Question Answering with Generated Radiology Reports [paper]

2024

Nature Medicine'24

  • Collaboration between clinicians and vision–language models in radiology report generation [paper]
  • A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks [paper][code]

NEJM AI'24

  • Towards Generalist Biomedical AI [paper][code]

AAAI'24

  • Automatic Radiology Reports Generation via Memory Alignment Network [paper]
  • PromptMRG: Diagnosis-Driven Prompts for Medical Report Generation [paper][code]
  • Bootstrapping Large Language Models for Radiology Report Generation [paper] [code]

CVPR'24

  • Instance-level Expert Knowledge and Aggregate Discriminative Attention for Radiology Report Generation [paper] [code]
  • AHIVE: Anatomy-aware Hierarchical Vision Encoding for Interactive Radiology Report Retrieval [paper] [[code]]
  • InVERGe: Intelligent Visual Encoder for Bridging Modalities in Report Generation (Workshop) [paper][code]
  • MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant [paper]

ACL'24

  • DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation [paper][code]
  • SICAR at RRG2024: GPU Poor’s Guide to Radiology Report Generation [paper]
  • BiCAL: Bi-directional Contrastive Active Learning for Clinical Report Generation [paper]
  • CID at RRG24: Attempting in a Conditionally Initiated Decoding of Radiology Report Generation with Clinical Entities [paper]
  • RadGraph-XL: A Large-Scale Expert-Annotated Dataset for Entity and Relation Extraction from Radiology Reports [paper][code]
  • MLeVLM: Improve Multi-level Progressive Capabilities based on Multimodal Large Language Model for Medical Visual Question Answering [paper][code]
  • Fine-Grained Image-Text Alignment in Medical Imaging Enables Explainable Cyclic Image-Report Generation [paper]

ICLR'24

  • LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation [paper][code]

NIPS'24

  • BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays [paper][code]
  • JRadiEvo: A Japanese Radiology Report Generation Model Enhanced by Evolutionary Optimization of Model Merging (NIPS Workshop)[paper]
  • Electrocardiogram Report Generation and Question Answering via Retrieval-Augmented Self-Supervised Modeling (NIPS Workshop)[paper]
  • Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE [paper][code]
  • MediQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning [paper][code]

ACM MM'24

  • Medical Report Generation via Multimodal Spatio-Temporal Fusion [paper]
  • Diffusion Networks with Task-Specific Noise Control for Radiology Report Generation [paper]
  • Divide and Conquer: Isolating Normal-Abnormal Attributes in Knowledge Graph-Enhanced Radiology Report Generation [paper][code]
  • In-context Learning for Zero-shot Medical Report Generation [paper]

ECCV'24

  • HERGen: Elevating Radiology Report Generation with Longitudinal Data [paper] [code]
  • Contrastive Learning with Counterfactual Explanations for Radiology Report Generation [paper]
  • ChEX: Interactive Localization and Region Description in Chest X-rays[paper][code]
  • MedRAT: Unpaired Medical Report Generation via Auxiliary Tasks[paper][code]

EMNLP'24

  • ICON: Improving Inter-Report Consistency of Radiology Report Generation via Lesion-aware Mix-up Augmentation [paper] [code]
  • Divide and Conquer Radiology Report Generation via Observation Level Fine-grained Pretraining and Prompt Tuning [paper]

MICCAI'24

  • Textual Inversion and Self-supervised Refinement for Radiology Report Generation [paper] [[code]]
  • Structural Entities Extraction and Patient Indications Incorporation for Chest X-ray Report Generation [paper] [code]
  • CT2Rep: Automated Radiology Report Generation for 3D Medical Imaging [paper] [code]
  • WsiCaption: Multiple Instance Generation of Pathology Reports for Gigapixel Whole Slide Images [paper][code]
  • HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context Interaction [paper][data].
  • Multivariate Cooperative Game for Image-Report Pairs: Hierarchical Semantic Alignment for Medical Report Generation [paper]
  • MRScore: Evaluating Medical Report with LLM-Based Reward System [paper]
  • Energy-Based Controllable Radiology Report Generation with Medical Knowledge [paper]
  • GMoD: Graph-driven Momentum Distillation Framework with Active Perception of Disease Severity for Radiology Report Generation [paper][code]
  • TiBiX: Leveraging Temporal Information for Bidirectional X-ray and Report Generation (MICCAI Workshop)[paper][code]
  • Multivariate Cooperative Game for Image-Report Pairs: Hierarchical Semantic Alignment for Medical Report Generation [paper]
  • KARGEN: Knowledge-Enhanced Automated Radiology Report Generation Using Large Language Models [paper]
  • Continually Tuning a Large Language Model for Multi-domain Radiology Report Generation [paper]

CIKM'24

  • CLR2G: Cross-modal Contrastive Learning on Radiology Report [paper]

ICASSP'24

  • Improving Radiology Report Generation with D2-Net: When Diffusion Meets Discriminator [paper]

WACV'24

  • Complex Organ Mask Guided Radiology Report Generation [paper][code]
  • CXR-IRGen: An Integrated Vision and Language Model for the Generation of Clinically Accurate Chest X-Ray Image-Report Pairs [paper][code]

ACCV'24

  • FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Generation [paper][code]

ML4H'24

  • MedAutoCorrect: Image-Conditioned Autocorrection in Medical Reporting [paper]

MedIA'24

  • From Vision to Text: A Comprehensive Review of Natural Image Captioning in Medical Diagnosis and Radiology Report Generation [paper]
  • Enhancing the vision–language foundation model with key semantic knowledge-emphasized report refinement [paper]
  • DACG: Dual Attention and Context Guidance Model for Radiology Report Generation [paper][code]
  • Dual-Modality Visual Feature Flow for Medical Report Generation [paper]

TMI'24

  • Multi-grained Radiology Report Generation with Sentence-level Image-language Contrastive Learning [paper] [[code]]
  • SGT++: Improved Scene Graph-Guided Transformer for Surgical Report Generation [paper][[code]]
  • PhraseAug: An Augmented Medical Report Generation Model with Phrasebook [paper] [[code]]
  • Token-Mixer: Bind Image and Text in One Embedding Space for Medical Image Reporting [paper] [code]
  • An Organ-aware Diagnosis Framework for Radiology Report Generation [paper]
  • Attribute Prototype-guided Iterative Scene Graph for Explainable Radiology Report Generation [paper]
  • A New Benchmark: Clinical Uncertainty and Severity Aware Labeled Chest X-Ray Images with Multi-Relationship Graph Learning [paper]
  • LHR-RFL: Linear Hybrid-Reward based Reinforced Focal Learning for Automatic Radiology Report Generation [paper]
  • Unlocking the Potential of Weakly Labeled Data: A Co-Evolutionary Learning Framework for Abnormality Detection and Report Generation [paper][code]

TMM'24

  • Semi-Supervised Medical Report Generation via Graph-Guided Hybrid Feature Consistency [paper][[code]]
  • Multi-Level Objective Alignment Transformer for Fine-Grained Oral Panoramic X-Ray Report Generation [paper][[code]]
    • Knowledge-guided Cross-modal Alignment and Progressive Fusion for Chest X-ray Report Generation [paper]

JBHI'24

  • CAMANet: Class Activation Map Guided Attention Network for Radiology Report Generation [paper] [code]
  • TSGET: Two-Stage Global Enhanced Transformer for Automatic Radiology Report Generation [paper] [code]
  • Eye Gaze Guided Cross-Modal Alignment Network for Radiology Report Generation [paper]

Expert Systems with Applications'24

  • CheXReport: A transformer-based architecture to generate chest X-ray reports suggestions [paper][code]
  • ChatGPT based contrastive learning for radiology report summarization [paper][code]

Knowledge-Based Systems'24

  • Automatic medical report generation combining contrastive learning and feature difference [paper]
  • Context-enhanced framework for medical image report generation using multimodal contexts [paper][[code]

Neurocomputing'24

  • Improving radiology report generation with multi-grained abnormality prediction [paper]
  • An open chest X-ray dataset with benchmarks for automatic radiology report generation in French [paper][data]
  • Trust it or not: Confidence-guided automatic radiology report generation [paper]
  • VG-CALF: A vision-guided cross-attention and late-fusion network for radiology images in medical visual question answering [paper]

Academic Radiology'24

  • Practical Evaluation of ChatGPT Performance for Radiology Report Generation [paper]

Radiology'24

  • Constructing a Large Language Model to Generate Impressions from Findings in Radiology Reports [paper]
  • Comparing Commercial and Open-Source Large Language Models for Labeling Chest Radiograph Reports [paper]

IEEE Transactions on Emerging Topics in Computational Intelligence'24

  • End-to-End Clustering Enhanced Contrastive Learning for Radiology Reports Generation [paper]

arXiv papers'24

  • Factual Serialization Enhancement: A Key Innovation for Chest X-ray Report Generation [paper] [code]
  • FITA: Fine-grained Image-Text Aligner for Radiology Report Generation [paper] [[code]]
  • GREEN: Generative Radiology Report Evaluation and Error Notation [paper] [[code]]
  • CheXpert Plus: Hundreds of Thousands of Aligned Radiology Texts, Images and Patients [paper] [code]
  • Topicwise Separable Sentence Retrieval for Medical Report Generation [paper] [[code]]
  • Dia-LLaMA: Towards Large Language Model-driven CT Report Generation [paper] [[code]]
  • MAIRA-2: Grounded Radiology Report Generation [paper][[code]]
  • Benchmarking and Boosting Radiology Report Generation for 3D High-Resolution Medical Images [paper]
  • The Impact of Auxiliary Patient Data on Automated Chest X-Ray Report Generation and How to Incorporate It [paper][code]
  • Improving Expert Radiology Report Summarization by Prompting Large Language Models with a Layperson Summary [paper]
  • Fact-Aware Multimodal Retrieval Augmentation for Accurate Medical Radiology Report Generation [paper]
  • X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms [paper]
  • Multi-modal vision-language model for generalizable annotation-free pathology localization and clinical diagnosis [paper][code]
  • Direct Preference Optimization for Suppressing Hallucinated Prior Exams in Radiology Report Generation [paper]]
  • R2GenCSR: Retrieving Context Samples for Large Language Model based X-ray Medical Report Generation [paper][code]
  • Direct Preference Optimization for Suppressing Hallucinated Prior Exams in Radiology Report Generation [paper]
  • M4CXR: Exploring Multi-task Potentials of Multi-modal Large Language Models for Chest X-ray Interpretation [paper]
  • Medical Report Generation Is A Multi-label Classification Problem [paper]
  • KARGEN: Knowledge-enhanced Automated Radiology Report Generation Using Large Language Models [paper]
  • Democratizing MLLMs in Healthcare: TinyLLaVA-Med for Efficient Healthcare Diagnostics in Resource-Constrained Settings [paper]
  • SLaVA-CXR: Small Language and Vision Assistant for Chest X-ray Report Automation [paper]
  • Expert-level vision-language foundation model for real-world radiology and comprehensive evaluation [paper]
  • CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report Generation on CheXpert Plus Dataset [paper][code]
  • 3D-CT-GPT: Generating 3D Radiology Reports through Integration of Large Vision-Language Models [paper]
  • Image-aware Evaluation of Generated Medical Reports [paper]
  • Text-Enhanced Medical Visual Question Answering [paper]
  • MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models [paper][code]
  • R2GEN-MAMBA:ASELECTIVESTATESPACEMODELFORRADIOLOGYREPORT GENERATION [paper][code]
  • Uncovering Knowledge Gaps in Radiology Report Generation Models through Knowledge Graphs[paper][code]
  • Diff-CXR: Report-to-CXR generation through a disease-knowledge enhanced diffusion model [paper]
  • FINE-GRAINED VERIFIERS: PREFERENCE MODELING AS NEXT-TOKEN PREDICTION IN VISION-LANGUAGE ALIGNMENT [paper]
  • Decoding Report Generators: A Cyclic Vision-Language Adapter for Counterfactual Explanations [paper]
  • MCL: Multi-view Enhanced Contrastive Learning for Chest X-ray Report Generation [paper][code]
  • Anatomy-Guided Radiology Report Generation with Pathology-Aware Regional Prompts [paper]
  • MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models [paper]
  • TRRG: Towards Truthful Radiology Report Generation With Cross-modal Disease Clue Enhanced Large Language Model [paper]
  • ORID: Organ-Regional Information Driven Framework for Radiology Report Generation [paper]
  • ReXrank: A Public Leaderboard for AI-Powered Radiology Report Generation [paper][code]
  • Uncovering Knowledge Gaps in Radiology Report Generation Models through Knowledge Graphs [paper][code]
  • LaB-RAG: Label Boosted Retrieval Augmented Generation for Radiology Report Generation [paper]
  • Large Language Model with Region-guided Referring and Grounding for CT Report Generation [[paper]
  • Improving Factuality of 3D Brain MRI Report Generation with Paired Image-domain Retrieval and Text-domain Augmentation [paper]
  • MvKeTR: Chest CT Report Generation with Multi-View Perception and Knowledge Enhancement [paper]
  • MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization [paper][code]
  • Semantic Consistency-Based Uncertainty Quantification for Factuality in Radiology Report Generation [paper]
  • Foundation Models in Radiology: What, How, When, Why and Why Not [paper]
  • A Generalist Learner for Multifaceted Medical Image Interpretation [paper]
  • M4CXR: Exploring Multi-task Potentials of Multi-modal Large Language Models for Chest X-ray Interpretation [paper]
  • The Impact of AI Assistance on Radiology Reporting: A Pilot Study Using Simulated AI Draft Reports [paper]
  • DAMPER: A Dual-Stage Medical Report Generation Framework with Coarse-Grained MeSH Alignment and Fine-Grained Hypergraph Matching [paper]
  • X-ray Made Simple: Radiology Report Generation and Evaluation with Layman’s Terms [paper][code]

2023

ICLR'23

  • Advancing radiograph representation learning with masked record modeling [paper][code]

AAAI'23

CVPR'23

  • KiUT: Knowledge-injected U-Transformer for Radiology Report Generation [paper] [[code]]
  • METransformer: Radiology report generation by transformer with multiple learnable expert tokens [paper][[code]]
  • Dynamic Graph Enhanced Contrastive Learning for Chest X-Ray Report Generation [paper] [code]
  • Interactive and Explainable Region-guided Radiology Report Generation [paper][code]

ICCV'23

  • Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation [paper]

ACL'23

  • ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning [paper] [code]

EMNLP'23

  • RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning [paper] [code]
  • Normal-Abnormal Decoupling Memory for Medical Report Generation [paper] [code]
  • Style-Aware Radiology Report Generation with RadGraph and Few-Shot Prompting [paper] [[code]]
  • PhenotypeCLIP: Phenotype-based Contrastive Learning for Medical Imaging Report Generation [paper]

MICCAI'23

  • Utilizing Longitudinal Chest X-Rays and Reports to Pre-Fill Radiology Reports [paper] [code]

BIBM'23

ML4H'23

  • Pragmatic Radiology Report Generation [paper] [code]

ICASSP'23

  • Improving Radiology Report Generation with D 2-Net: When Diffusion Meets Discriminator [paper] [[code]]

MedIA'23

  • Radiology report generation with a learned knowledge base and multi-modal alignment [paper] [code]

TMI'23

  • Attributed Abnormality Graph Embedding for Clinically Accurate X-Ray Report Generation [paper][[code]]

Patterns'23

  • Evaluating progress in automatic chest X-ray radiology report generation[paper][code]

TMM'23

  • From Observation to Concept: A Flexible Multi-view Paradigm for Medical Report Generation [paper] [[code]]
  • Joint Embedding of Deep Visual and Semantic Features for Medical Image Report Generation [paper] [[code]]

Radiology'23

  • Leveraging GPT-4 for Post Hoc Transformation of Free-text Radiology Reports into Structured Reporting: A Multilingual Feasibility Study [paper]

Meta-Radiology'23

  • R2gengpt: Radiology report generation with frozen llms [paper][code]

arXiv papers'23

  • MAIRA-1: A specialised large multimodal model for radiology report generation [paper] [[code]]
  • Longitudinal Data and a Semantic Similarity Reward for Chest X-Ray Report Generation [paper][code]

2022

AAAI'22

  • Clinical-BERT: Vision-Language Pre-training for Radiograph Diagnosis and Reports Generation [paper] [[code]]

ACL'22

  • Reinforced Cross-modal Alignment for Radiology Report Generation [paper] [code]

MICCAI'22

  • A Medical Semantic-Assisted Transformer for Radiographic Report Generation [paper] [code]
  • CheXRelNet An Anatomy-Aware Model for Tracking Longitudinal Relationships Between Chest X-Rays [paper][code]

Nature Machine Intelligence'22

  • Generalized radiograph representation learning via cross-supervision between images and free-text radiology reports [paper][code]

MedIA'22

  • Knowledge matters: Chest radiology report generation with general and specific knowledge [paper] [code]

TMI'22

  • Automated Radiographic Report Generation Purely on Transformer: A Multicriteria Supervised Approach [paper] [[code]]

2021

ACL'21

  • Cross-modal Memory Networks for Radiology Report Generation [paper] [code]

EMNLP'21

  • Progressive Transformer-Based Generation of Radiology Reports [paper] [code]

NAACL'21

  • Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation [paper] [code]

2020

AAAI'20

  • When Radiology Report Generation Meets Knowledge Graph [paper] [code]

EMNLP'20

  • Generating Radiology Reports via Memory-driven Transformer [paper] [code]

Survey

  • A Systematic Review of Deep Learning-based Research on Radiology Report Generation (arXiv 2311) [paper]
  • A Survey of Deep Learning-based Radiology Report Generation Using Multimodal Data (arXiv 2405) [paper]
  • Automated Radiology Report Generation: A Review of Recent Advances (IEEE Reviews in Biomedical Engineering'24) [paper]
  • From Vision to Text: A Comprehensive Review of Natural Image Captioning in Medical Diagnosis and Radiology Report Generation (Medical Image Analysis'24)[paper]
  • Automatic Medical Report Generation: Methods and Applications (arXiv'2408) [paper]
  • Automatic medical report generation based on deep learning: A state of the art survey (Computerized Medical Imaging and Graphics'25)[paper]
  • A survey of deep-learning-based radiology report generation using multimodal inputs (Medical Image Analysis'25)[paper]

Dataset

  • MIMIC-CXR-JPG, a large publicly available database of labeled chest radiographs (MIMIC-CXR) [paper][data].
  • Preparing a collection of radiology examinations for distribution and retrieval (IU X-ray) [paper][data].
  • Learning Visual-Semantic Embeddings for Reporting Abnormal Findings on Chest X-rays (MIMIC-ABN) [paper][code]
  • An efficient but effective writer: Diffusion-based semi-autoregressive transformer for automated radiology report generation (XRG-COVID-19) [paper][data].
  • HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context Interaction (HistGen WSI) [paper][data].
  • CheXpert Plus: Hundreds of Thousands of Aligned Radiology Texts, Images and Patients (CheXpert Plus) [paper] [data]
  • CXR-PRO: MIMIC-CXR with Prior References Omitted (CXR-PRO) [data]
  • MS-CXR: Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing (MS-CXR) [data]
  • EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images (EHRXQA)[paper][code][data]
  • MIMIC-Ext-MIMIC-CXR-VQA: A Complex, Diverse, And Large-Scale Visual Question Answering Dataset for Chest X-ray Images (MIMIC-Ext-MIMIC-CXR-VQA)[code][data]
  • MS-CXR-T: Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing (MS-CXR-T)[data]
  • CAD-Chest: Comprehensive Annotation of Diseases based on MIMIC-CXR Radiology Report (CAD-Chest)[data][paper][code]
  • VinDr-CXR: An open dataset of chest X-rays with radiologist annotations (VinDr-CXR)[data]
  • Chest ImaGenome Dataset (ImaGenome) [data]
  • Interpretable medical image Visual Question Answering via multi-modal relationship graph learning (Medical-CXR-VQA) [MedIA'24][code]
  • Medical-Diff-VQA: A Large-Scale Medical Dataset for Difference Visual Question Answering on Chest X-Ray Images (Medical-Diff-VQA) [data][code]
  • ReXPref-Prior: A MIMIC-CXR Preference Dataset for Reducing Hallucinated Prior Exams in Radiology Report Generation (ReXPref-Prior)[data]
  • An open chest X-ray dataset with benchmarks for automatic radiology report generation in French (CASIA-CXR) [Neurocomputing'24] [data][paper]
  • PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology (WSI-VQA)[arXiv'2401][paper][data]
  • MIMIC-Eye: Integrating MIMIC Datasets with REFLACX and Eye Gaze for Multimodal Deep Learning Applications (MIMIC-Eye)[data][code]
  • PadChest-GR: A Bilingual Chest X-ray Dataset for Grounded Radiology Report Generation (PadChest-GR)[data][paper]
  • GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis (GEMeX)[paper][project]
  • Computed-Tomography-Report-Generation-Datasets (CTRG), including CTRG-Brain-263K and CTRG-Chest-548K [data]
  • Multi-view CXR: A large-scale multi-view benchmark for chest X-ray report generation (Multi-view CXR) [data][paper]
  • CheXmask Database: a large-scale dataset of anatomical segmentation masks for chest x-ray images (CheXmask)[data]
  • LLaVA-Rad MIMIC-CXR [data]
  • RaDialog Instruct Dataset [data][paper]
  • M3D-Cap [data]
  • ReXErr-v1: Clinically Meaningful Chest X-Ray Report Errors Derived from MIMIC-CXR [data]
  • ReXGradient-160K: A Large-Scale Publicly Available Dataset of Chest Radiographs with Free-text Reports [paper][data]

Metrics

  • FineRadScore: A Radiology Report Line-by-Line Evaluation Technique Generating Corrections with Severity Scores (arXiv'2405) [paper][code]
  • FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation (EMNLP'23) [paper][code]
  • DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation (ACL'24) [paper][code]
  • RaTEScore: A Metric for Radiology Report Generation (EMNLP'24) [paper][code][PyPI]
  • GREEN: Generative Radiology Report Evaluation and Error Notation [paper][code]
  • When Radiology Report Generation Meets Knowledge Graph (MIRQI) [paper][code]
  • Evaluating progress in automatic chest X-ray radiology report generation (RadCliQ)[paper][code]
  • Evaluating GPT-4 on Impressions Generation in Radiology Reports (Radiology)[paper]
  • ReXamine-Global: A Framework for Uncovering Inconsistencies in Radiology Report Generation Metrics (arXiv'2408)[paper]
  • MRScore: Evaluating Medical Report with LLM-Based Reward System (MICAAI'24) [paper]
  • ER2Score: LLM-based Explainable and Customizable Metric for Assessing Radiology Reports with Reward-Control Loss (arXiv'2411)[paper]
  • FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models (arXiv'2411)[paper]
  • A clinically accessible small multimodal radiology model and evaluation metric for chest X-ray findings (CheXprompt)[paper][code]

Other Resources

  • Learning to Exploit Temporal Structure for Biomedical Vision–Language Processing (CVPR'23) [paper[code]
  • Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models [paper][code]

Tools

Feel free to reach out to me if you find any interesting papers missing.

email: kangliu422@gmail.com WeChat: kangliu422

About

paper list, dataset, and tools for radiology report generation

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published