A curated list of medical LLMs, multimodal systems, datasets, benchmarks, and more. 🏥
-
Updated
Mar 24, 2026
A curated list of medical LLMs, multimodal systems, datasets, benchmarks, and more. 🏥
Reasoning as the Engine: The Evolution from Medical LLMs to Versatile Medical Agents
Official Codebase for "ER-Reason: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room"
CURE-Med: Curriculum-Informed Reinforcement Learning for Multilingual Medical Reasoning
Evaluating the reasoning ability of LLMs specifically within the biomedical field - analyzing the reasoning and factual information generation of medical LLMs
The source code of paper: A vision-language pretrained transformer for versatile clinical respiratory disease applications
Foundation Models for Genomics & Transcriptomics
MedEvalKit: A Unified Medical Evaluation Framework
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
🎯 Benchmark retrieval systems across video, image, audio, and documents with standardized datasets and queries for regulated domains.
Add a description, image, and links to the medical-llms topic page so that developers can more easily learn about it.
To associate your repository with the medical-llms topic, visit your repo's landing page and select "manage topics."