Fix batch collator padding for training with batch size > 1 by stepan-omelka · Pull Request #36 · antoniorv6/SMT

stepan-omelka · 2026-03-10T10:16:29Z

[bug]

When running the training script with a batch size greater than 1,
the process crashed due to mismatched tensor lengths in the decoder
input and ground truth targets.

This change ensures all sequence tensors within a batch are dynamically
padded to the maximum sequence length using the dataset's padding token.
As a result, the model can safely process batches larger than 1 without
encountering tensor dimension conflicts during training or validation.

Integrates BatchCollator for dynamic sequence padding.

run pertaining with batching 4 == orange

antoniorv6 · 2026-03-12T12:37:22Z

All the changes seem good for me. However, have you tested on the full-page scenario? Note that this program currently covers both cases (system-level and full-page). Is it possible that you send results on these other scenarios in order to merge?

antoniorv6

Waiting until full-page results are presented.

stepan-omelka · 2026-03-24T08:47:25Z

Hi, I tried to run the fine-tuning, but I am repeatedly running into an error (I created an issue for it). I also tried to run the pretraining and fine-tuning in the master branch, but I ran into the same error => most likely it's not directly caused by changes in this PR.

Until the issue is solved, I am unable to actually test the fine-tuning on increased batch size.

stepan-omelka changed the title ~~fix: batching and validation batching~~ Fix batch collator padding for training with batch size > 1 Mar 10, 2026

stepan-omelka marked this pull request as ready for review March 10, 2026 11:52

antoniorv6 self-assigned this Mar 11, 2026

antoniorv6 added the enhancement New feature or request label Mar 11, 2026

antoniorv6 self-requested a review March 12, 2026 12:33

antoniorv6 reviewed Mar 12, 2026

View reviewed changes

antoniorv6 requested a review from eric-ayllon March 12, 2026 12:38

Stepan Omelka added 2 commits March 25, 2026 16:31

fix: batching and validation batching

6333b3d

fix: finalize the batching fix

cc786a6

stepan-omelka force-pushed the my-selected-changes branch from f4eb9ef to cc786a6 Compare March 25, 2026 15:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix batch collator padding for training with batch size > 1#36

Fix batch collator padding for training with batch size > 1#36
stepan-omelka wants to merge 2 commits intoantoniorv6:masterfrom
stepan-omelka:my-selected-changes

stepan-omelka commented Mar 10, 2026 •

edited

Loading

Uh oh!

antoniorv6 commented Mar 12, 2026

Uh oh!

antoniorv6 left a comment

Uh oh!

stepan-omelka commented Mar 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

stepan-omelka commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

antoniorv6 commented Mar 12, 2026

Uh oh!

antoniorv6 left a comment

Choose a reason for hiding this comment

Uh oh!

stepan-omelka commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

stepan-omelka commented Mar 10, 2026 •

edited

Loading

stepan-omelka commented Mar 24, 2026 •

edited

Loading