Add gradient accumulation and AMP to training scaffold by Copilot · Pull Request #8 · thinksyncs/YOLOZU

Copilot · 2026-02-06T00:45:05Z

Implements gradient accumulation and automatic mixed precision (AMP) for the RT-DETR pose trainer. Gradient clipping already existed via --clip-grad-norm.

Changes

Gradient Accumulation (--gradient-accumulation-steps N)

Scales loss by 1/N to maintain effective learning rate
Defers optimizer step to accumulation boundaries: if (steps + 1) % accum_steps == 0
Logs unscaled loss to preserve metrics accuracy

AMP (--use-amp)

Initializes torch.cuda.amp.GradScaler on CUDA devices, warns and disables on CPU
Wraps forward passes (main and MIM teacher) with autocast() context
Unscales gradients before clipping: scaler.unscale_(optim) → clip_grad_norm_() → scaler.step()

Interaction

Gradient clipping + AMP: unscales before clipping to avoid clipping scaled gradients
Gradient accumulation + AMP: loss scaling happens after accumulation scaling

Usage

# Effective batch size 16 with AMP and clipping
python rtdetr_pose/tools/train_minimal.py \
  --device cuda:0 \
  --batch-size 4 \
  --gradient-accumulation-steps 4 \
  --clip-grad-norm 1.0 \
  --use-amp

All features are backward compatible (default disabled) and can be used independently.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

images.cocodataset.org
- Triggering command: /usr/bin/python3 python3 /home/REDACTED/work/YOLOZU/YOLOZU/tools/fetch_coco128_official.py --out /home/REDACTED/work/YOLOZU/YOLOZU/data/coco128 --insecure (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

Original prompt

gradient clipping
gradient accumulation
AMP

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

Co-authored-by: thinksyncs <42225585+thinksyncs@users.noreply.github.com>

…with lint fix)

Initial plan

d7c090a

Copilot AI assigned Copilot and thinksyncs Feb 6, 2026

Copilot started work on behalf of thinksyncs February 6, 2026 00:45 View session

Copilot AI and others added 3 commits February 6, 2026 00:49

Add gradient accumulation and AMP support to training

7651711

Co-authored-by: thinksyncs <42225585+thinksyncs@users.noreply.github.com>

Fix loss logging to use unscaled loss for gradient accumulation

484ba10

Co-authored-by: thinksyncs <42225585+thinksyncs@users.noreply.github.com>

Remove trailing whitespace from empty lines

9230ff1

Co-authored-by: thinksyncs <42225585+thinksyncs@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Add gradient clipping and accumulation with AMP~~ Add gradient accumulation and AMP to training scaffold Feb 6, 2026

Copilot AI requested a review from thinksyncs February 6, 2026 00:55

Copilot finished work on behalf of thinksyncs February 6, 2026 00:55

thinksyncs approved these changes Feb 6, 2026

View reviewed changes

Copilot AI mentioned this pull request Feb 6, 2026

Document lint errors blocking open PRs #9

Draft

Copilot AI added a commit that referenced this pull request Feb 6, 2026

Fix lint error in PR #8

bfb1c51

Co-authored-by: thinksyncs <42225585+thinksyncs@users.noreply.github.com>

Copilot AI mentioned this pull request Feb 6, 2026

Consolidate open PRs: MIM inference, Hessian solver, gradient accumulation, and AMP #10

Merged

Copilot AI added a commit that referenced this pull request Feb 6, 2026

Merge PR #8: Add gradient accumulation and AMP to training scaffold (…

ac3f6ad

…with lint fix)

Copilot AI added a commit that referenced this pull request Feb 6, 2026

Merge PR #9: Document lint errors (already fixed in PR #8 merge)

2f77223

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add gradient accumulation and AMP to training scaffold#8

Add gradient accumulation and AMP to training scaffold#8
Copilot wants to merge 4 commits intomainfrom
copilot/implement-gradient-clipping

Copilot AI commented Feb 6, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Usage

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Feb 6, 2026 •

edited

Loading