-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Pull requests: verl-project/verl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[rollout] feat: automatically resume generation on abort
#5071
opened Jan 27, 2026 by
wuxibin89
Loading…
[doc] fix: fix ci and move docs to npu best practice dir
#5069
opened Jan 27, 2026 by
wucong25
Loading…
8 tasks
[ray, single_controller, trainer, utils] feat: Add distributed backend selection support (Ray/openYuanrong)
#5067
opened Jan 27, 2026 by
mianmianboom
Loading…
3 of 8 tasks
[sglang,ci,doc] feat: Update Ascend Dockerfile and docker build workflow to 8.3.RC1 version for VeRL + Sglang
#5065
opened Jan 27, 2026 by
xiazhahe
Loading…
8 tasks
[sglang] feat: add NPU GRPO training scripts for Qwen2.5-32B (FSDP/SGLang backends)
#5062
opened Jan 27, 2026 by
xiazhahe
Loading…
8 tasks
[doc] chore: update the links in docker file after move to verl-project
#5061
opened Jan 27, 2026 by
yyyy2000
Loading…
8 tasks
[sglang, doc] feat: add NPU GRPO training scripts for Qwen3-30B (Megaton/SGLang backends) and update doc
#5060
opened Jan 27, 2026 by
hustmf
Loading…
8 tasks
[reward] fix: fix reward computation in _validate when use_reward_loop=True and reward_model.enable=True
#5054
opened Jan 27, 2026 by
none0663
Loading…
8 tasks done
[megatron,trainer,algo] feat: On-Policy Distillation
#5041
opened Jan 26, 2026 by
JacobHelwig
•
Draft
8 tasks
[ci] feat: add npu workflow,e2e_sft_llm&model&reward_model_vllm
#5039
opened Jan 25, 2026 by
yyyy2000
Loading…
8 tasks
[megatron, training_utils] fix: router replay R3 align router replay data with global layer indices
#5037
opened Jan 24, 2026 by
HollowMan6
Loading…
8 tasks done
[trainer] fix: resolve dataset config in agent loop
#5034
opened Jan 24, 2026 by
yyDing1
Loading…
8 tasks
[fsdp, megatron] Refactor fully-async training to support multiple checkpoint engine backends
#5029
opened Jan 23, 2026 by
Shangwei-Li
•
Draft
8 tasks
[feat] Atropos integration with GRPO (#1782)
#5026
opened Jan 22, 2026 by
vyomakesh0728
Loading…
4 of 6 tasks
[megatron] fix: megatron async save ckpt fix
#5016
opened Jan 22, 2026 by
Leem-Li
Loading…
8 tasks done
[rollout] feat: support filter for fully_async_policy
#5014
opened Jan 22, 2026 by
sl-1314
Loading…
8 tasks
Bug Report: filter_overlong_prompts Fails for Multimodal Data
#5004
opened Jan 21, 2026 by
bizhongan414
Loading…
[ray] feat: use get_device_name() for automatic device detection in RayWorkerGroup instead of by parameter passing
#5000
opened Jan 21, 2026 by
jianjunzhong
Loading…
4 of 8 tasks
[WIP][data] feat: TransferQueue - integrate TransferQueue into main codebase
#4987
opened Jan 20, 2026 by
0oshowero0
•
Draft
6 of 8 tasks
[reward] fix: support RemoteRewardManager in load_reward_manager when use_reward_loop=True for fix math-verify issue #3407
#4985
opened Jan 19, 2026 by
DtYXs
Loading…
2 of 8 tasks
[model,doc] feat: add NPU GRPO training scripts for Qwen2.5-32B/Qwen3-30B (Megaton/vLLM backends)
#4984
opened Jan 19, 2026 by
psyloy
Loading…
[megatron] feat: Support MTP training in SFT
#4981
opened Jan 19, 2026 by
arvyanh
Loading…
8 tasks done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.