Skip to content

[docs] add CRD references to README and workflow guides#155

Open
yuanzhi-zhu wants to merge 1 commit intoX-GenGroup:mainfrom
yuanzhi-zhu:docs/add-crd-references
Open

[docs] add CRD references to README and workflow guides#155
yuanzhi-zhu wants to merge 1 commit intoX-GenGroup:mainfrom
yuanzhi-zhu:docs/add-crd-references

Conversation

@yuanzhi-zhu
Copy link
Copy Markdown
Contributor

CRD was merged in #121 but the algorithm wasn't listed in the user-facing docs alongside the other algorithms. This commit:

  • README.md: add CRD row to the supported-algorithms table; add CRD to the Algorithms doc-pointer description.
  • examples/README.md: add 'crd' to the algorithm enum.
  • guidance/workflow.md: add CRD rows to the trajectory-policy and optimization-strategy tables; add CRD to the decoupled-sampling and fresh-timestep prose bullets.

CRD was merged in X-GenGroup#121 but the algorithm wasn't listed in the user-facing
docs alongside the other algorithms. This commit:

- README.md: add CRD row to the supported-algorithms table; add CRD to
  the Algorithms doc-pointer description.
- examples/README.md: add 'crd' to the algorithm enum.
- guidance/workflow.md: add CRD rows to the trajectory-policy and
  optimization-strategy tables; add CRD to the decoupled-sampling and
  fresh-timestep prose bullets.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Copilot AI review requested due to automatic review settings May 2, 2026 07:20
Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Updates user-facing documentation to include the previously-merged CRD (Centered Reward Distillation) algorithm alongside existing trainers, ensuring the README, examples guide, and workflow guide are consistent with the codebase’s supported algorithms.

Changes:

  • Add CRD to the top-level README “Supported Algorithms” table and the guidance document pointer description.
  • Add crd to the examples/ directory-structure “algorithm” enumeration.
  • Add CRD to the workflow guide’s sampling/trajectory and optimization strategy tables + related explanatory bullets.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File Description
README.md Adds CRD to supported algorithms and to the Algorithms guidance description.
examples/README.md Updates the documented examples/{algorithm}/... enum to include crd.
guidance/workflow.md Documents CRD behavior in sampling/trajectory and optimization sections.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread README.md
| AWM | awm | [Advantage Weighted Matching](https://arxiv.org/abs/2509.25050) |
| DGPO | dgpo | [DGPO](https://arxiv.org/abs/2510.08425) |
| GRPO-Guard | grpo-guard | [GRPO-Guard](https://arxiv.org/abs/2510.22319) |
| CRD | crd | [Centered Reward Distillation](https://arxiv.org/abs/2603.14128) ([Blog (chinese)](https://mp.weixin.qq.com/s/fpTi7PPi3APSNJQ2kXN3Dw))|
Comment thread README.md
| AWM | awm | [Advantage Weighted Matching](https://arxiv.org/abs/2509.25050) |
| DGPO | dgpo | [DGPO](https://arxiv.org/abs/2510.08425) |
| GRPO-Guard | grpo-guard | [GRPO-Guard](https://arxiv.org/abs/2510.22319) |
| CRD | crd | [Centered Reward Distillation](https://arxiv.org/abs/2603.14128) ([Blog (chinese)](https://mp.weixin.qq.com/s/fpTi7PPi3APSNJQ2kXN3Dw))|
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants