Nemotron Nano Omni V3 RL assets by vinhngx · Pull Request #168 · NVIDIA-NeMo/Nemotron

vinhngx · 2026-04-28T02:10:51Z

Add Nemotron-3-Nano-Omni GRPO training cookbooks
Step-by-step guides for GRPO/GSPO-style RL fine-tuning of Nemotron Nano V3 Omni on the MMPR-Tiny visual reasoning dataset using NeMo-RL on a single DGX H100 node (8× H100 80 GB).

grpo/ — native NeMo-RL path (run_vlm_grpo.py) with built-in MMPR-Tiny loader; includes training config vlm_grpo_nanov3omni_mmpr_tiny_1node.yaml (TP=8, EP=8).
grpo_nemo_gym/ — NeMo-Gym rollout path (run_grpo_nemo_gym.py) with async vLLM HTTP server and rule-based MCQA + numeric verifiers; includes JSONL data converter and config grpo_mmpr_tiny_nanov3omni_gym.yaml.

vinhngx added 3 commits April 27, 2026 18:35

adding omni-v3 RL assets

88af90e

fix bash cells

9a4de4c

remove bash magic

a8dfeb1

marcromeyn approved these changes Apr 28, 2026

View reviewed changes

marcromeyn merged commit 34633f7 into NVIDIA-NeMo:main Apr 28, 2026
3 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nemotron Nano Omni V3 RL assets#168

Nemotron Nano Omni V3 RL assets#168
marcromeyn merged 3 commits intoNVIDIA-NeMo:mainfrom
vinhngx:vinhn-omni-v3-rl

vinhngx commented Apr 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vinhngx commented Apr 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants