#

grpo-training

Here are 4 public repositories matching this topic...

vivoCameraResearch / SmartPhotoCrafter

official github code for "SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing"

Updated Apr 27, 2026
Python

GTPO

winstonsmith1897 / GTPO

Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability

reinforcement-learning reinforcement-learning-algorithms train fine post-training llm rlhf grpo-training

Updated Feb 23, 2026
Jupyter Notebook

DeepGym / deepgym

RL training environments with verifiable rewards for coding agents. Works with TRL, Unsloth, verl, OpenRLHF.

python machine-learning reinforcement-learning deep-learning sandbox evaluation rl code-execution ai-agents daytona llm unsloth coding-agents grpo verifiable-rewards openrlhf reward-function grpo-training

Updated Apr 24, 2026
Python

Vidit-Ostwal / price-negotiation-rl-OpenEnv

python machine-learning reinforcement-learning rl rl-environment openenv grpo-training price-negotiator openenv-environment

Updated Apr 12, 2026
Python

Improve this page

Add a description, image, and links to the grpo-training topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the grpo-training topic, visit your repo's landing page and select "manage topics."