official github code for "SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing"
-
Updated
Apr 27, 2026 - Python
official github code for "SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing"
Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability
RL training environments with verifiable rewards for coding agents. Works with TRL, Unsloth, verl, OpenRLHF.
Add a description, image, and links to the grpo-training topic page so that developers can more easily learn about it.
To associate your repository with the grpo-training topic, visit your repo's landing page and select "manage topics."