Skip to content

Conversation

@xmfan
Copy link
Member

@xmfan xmfan commented Nov 14, 2025

stack-info: PR: #246, branch: xmfan/stack/20
stack-info: PR: #251, branch: xmfan/stack/21
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 14, 2025
if grad_to_accumulate is not None:
if unsharded_grad is None:
unsharded_grad = grad_to_accumulate
for i in range(len(unsharded_grads)):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what's the difference?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no longer needed after rebase

n_microbatches=n_microbatches,
loss_fn=loss_fn,
backward_requires_autograd=backward_requires_autograd,
scale_grads=False,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

are you proposing to land this change, or just using it while doing numerics tests?

Copy link
Member Author

@xmfan xmfan Nov 19, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i'll change this to only be for numerics test mode maybe? it's either turn this off or emulate the scaling for non-pp

@xmfan xmfan force-pushed the xmfan/stack/20 branch 2 times, most recently from 6e72707 to b8546e1 Compare November 20, 2025 00:43
@xmfan
Copy link
Member Author

xmfan commented Nov 20, 2025

folded into previous pr

@xmfan xmfan closed this Nov 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants