#

verifiable-rewards

Here are 4 public repositories matching this topic...

DeepGym / deepgym

RL training environments with verifiable rewards for coding agents. Works with TRL, Unsloth, verl, OpenRLHF.

python machine-learning reinforcement-learning deep-learning sandbox evaluation rl code-execution ai-agents daytona llm unsloth coding-agents grpo verifiable-rewards openrlhf reward-function grpo-training

Updated Apr 24, 2026
Python

AlaaLab / Dr-LLaVA

[ NeurIPS MAR 2024 ] Official Codebase for "Dr-LLaVA: Visual Instruction Tuning with Symbolic Clinical Grounding"

post-training reinforcement-learning-from-human-feedback multimodal-large-language-models verifiable-rewards medical-vlms

Updated Dec 4, 2024
Python

Think-a-Tron / zeno

Verifiable RL rewards for LLMs

reinforcement-learning rewards verifiable-rewards

Updated May 20, 2025
Python

mgkim1976-spec / theme_radar

금융 도메인 LLM-Wiki — YouTube 투자 채널 메소돌로지 자동 추출·검증·복리 시스템. Karpathy LLM-Wiki + Verifiable Rewards (forward returns).

finance methodology quantitative-finance knowledge-base obsidian karpathy youtube-analysis verifiable-rewards korean-stocks llm-wiki compounding-knowledge forward-validation

Updated Apr 30, 2026
Python

Improve this page

Add a description, image, and links to the verifiable-rewards topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the verifiable-rewards topic, visit your repo's landing page and select "manage topics."