Skip to content

Nemotron Nano Omni V3 RL assets#168

Merged
marcromeyn merged 3 commits intoNVIDIA-NeMo:mainfrom
vinhngx:vinhn-omni-v3-rl
Apr 28, 2026
Merged

Nemotron Nano Omni V3 RL assets#168
marcromeyn merged 3 commits intoNVIDIA-NeMo:mainfrom
vinhngx:vinhn-omni-v3-rl

Conversation

@vinhngx
Copy link
Copy Markdown
Contributor

@vinhngx vinhngx commented Apr 28, 2026

Add Nemotron-3-Nano-Omni GRPO training cookbooks
Step-by-step guides for GRPO/GSPO-style RL fine-tuning of Nemotron Nano V3 Omni on the MMPR-Tiny visual reasoning dataset using NeMo-RL on a single DGX H100 node (8× H100 80 GB).

grpo/ — native NeMo-RL path (run_vlm_grpo.py) with built-in MMPR-Tiny loader; includes training config vlm_grpo_nanov3omni_mmpr_tiny_1node.yaml (TP=8, EP=8).
grpo_nemo_gym/ — NeMo-Gym rollout path (run_grpo_nemo_gym.py) with async vLLM HTTP server and rule-based MCQA + numeric verifiers; includes JSONL data converter and config grpo_mmpr_tiny_nanov3omni_gym.yaml.

@marcromeyn marcromeyn merged commit 34633f7 into NVIDIA-NeMo:main Apr 28, 2026
3 of 4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants