Trainer: Pass `num_items_in_batch` to `compute_loss` in `prediction_step` #41183

pramodith · 2025-09-26T16:52:02Z

What does this PR do?

Ensures that num_items_in_batch is passed to the compute_loss function in the prediction_step to ensure that loss is calculated the same way both at train and eval time.

Fixes #41108

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@SunMarc

pramodith · 2025-09-26T16:53:32Z

I think the test case could be better open to hearing suggestions to add tests (for multi-gpu?) or modify the test.

SunMarc

LGTM ! Thanks for this nice PR !

src/transformers/trainer.py

….com/pramodith/transformers into pramodith/predict_num_items_in_batch

SunMarc · 2025-09-29T13:41:23Z

Hmmm these tests are still failing: tests/trainer/test_trainer.py::TrainerIntegrationTest::test_evaluate_with_jit, can you quickly check why ? I guess the simplest solution would be to check if we have self.args.jit_mode_eval or not. Btw, we should probably deprecate this arg also

HuggingFaceDocBuilderDev · 2025-09-29T13:48:39Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

pramodith · 2025-09-29T14:03:04Z

Hmmm these tests are still failing: tests/trainer/test_trainer.py::TrainerIntegrationTest::test_evaluate_with_jit, can you quickly check why ? I guess the simplest solution would be to check if we have self.args.jit_mode_eval or not. Btw, we should probably deprecate this arg also

Will take a look in a few hours once I'm off work.

pramodith · 2025-09-29T19:57:52Z

The issue was with RegressionPretrainedModel's signature not having num_items_in_batch as an argument, which leads to jit compile mode not knowing about this argument. I've fixed those test cases in the least intrusive way possible by setting trainer.model_accepts_loss_kwargs=False. A bit hacky, let me know if that's not okay and if we should pass an argument to get_regression_trainer instead.

Add num_items_in_batch computation to predict_step.

c6f53b6

SunMarc approved these changes Sep 29, 2025

View reviewed changes

Merge branch 'main' into pramodith/predict_num_items_in_batch

5d39c7d

SunMarc reviewed Sep 29, 2025

View reviewed changes

src/transformers/trainer.py Outdated Show resolved Hide resolved

pramodith added 2 commits September 29, 2025 13:25

address comments.

fd28064

Merge branch 'pramodith/predict_num_items_in_batch' of https://github…

e664d51

….com/pramodith/transformers into pramodith/predict_num_items_in_batch

Fix test cases.

aa23824

fixup

753d8d7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Trainer: Pass `num_items_in_batch` to `compute_loss` in `prediction_step` #41183

Trainer: Pass `num_items_in_batch` to `compute_loss` in `prediction_step` #41183

pramodith commented Sep 26, 2025

Uh oh!

pramodith commented Sep 26, 2025

Uh oh!

SunMarc left a comment

Uh oh!

Uh oh!

SunMarc commented Sep 29, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Sep 29, 2025

Uh oh!

pramodith commented Sep 29, 2025 •

edited

Loading

Uh oh!

pramodith commented Sep 29, 2025

Uh oh!

Uh oh!

Trainer: Pass num_items_in_batch to compute_loss in prediction_step #41183

Are you sure you want to change the base?

Trainer: Pass num_items_in_batch to compute_loss in prediction_step #41183

Conversation

pramodith commented Sep 26, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

pramodith commented Sep 26, 2025

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

SunMarc commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Sep 29, 2025

Uh oh!

pramodith commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pramodith commented Sep 29, 2025

Uh oh!

Uh oh!

Trainer: Pass `num_items_in_batch` to `compute_loss` in `prediction_step` #41183

Trainer: Pass `num_items_in_batch` to `compute_loss` in `prediction_step` #41183

SunMarc commented Sep 29, 2025 •

edited

Loading

pramodith commented Sep 29, 2025 •

edited

Loading