feat(embed): launch finetune with torchrun for multi-GPU support by shan-nvidia · Pull Request #166 · NVIDIA-NeMo/Nemotron

shan-nvidia · 2026-04-24T20:24:34Z

Use torch.distributed.run with --nproc_per_node=gpu so training automatically uses all available GPUs (works correctly with 1 GPU too).

Use torch.distributed.run with --nproc_per_node=gpu so training automatically uses all available GPUs (works correctly with 1 GPU too). Mirrors the rerank recipe change in 756e4f2. Signed-off-by: Steve Han <sthan@nvidia.com> Made-with: Cursor

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(embed): launch finetune with torchrun for multi-GPU support#166

feat(embed): launch finetune with torchrun for multi-GPU support#166
shan-nvidia wants to merge 1 commit intomainfrom
steve/embed_finetune_multi_gpu

shan-nvidia commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

shan-nvidia commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant