Skip to content

tokens_per_batch fixed to take into account DP and micro-batch accumu…#13

Open
amithrm wants to merge 1 commit into70b_drop2_p1from
LR
Open

tokens_per_batch fixed to take into account DP and micro-batch accumu…#13
amithrm wants to merge 1 commit into70b_drop2_p1from
LR

Conversation

@amithrm
Copy link
Copy Markdown
Collaborator

@amithrm amithrm commented Oct 25, 2024

max steps uses the corrected tokens_per_batch to compute LR schedule

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant