[tests] Run trainings lazily. by pfebrer · Pull Request #1015 · metatensor/metatrain

pfebrer · 2026-01-21T11:58:33Z

Motivation

Whenever I wanted to run an isolated test with tox -e tests the generate-outputs.sh script was run and therefore I had to wait for all the training runs to finish just to run a test that doesn't need them.

Alternatives until now

Run directly pytest, which is fine but I think it is better if we can run it with the true testing environment.
Comment the line that runs the trainings in tox.ini, super annoying to do it each time and remember to undo it when pushing changes.

Implementation in this PR

The model paths are made a fixture. They run only when a test needs them. At that point, we can run the training for that specific model.

In this way, trainings are run only if/when needed.

Complications

We support running tests in parallel, so we have to make sure that if two processes request the fixture they don't train both at the same time. I used a simple lockfile so that the workers wait for the worker that is doing the training.

Nice side effects

Since the trainings are ran by each model, in principle it is possible that the trainings run in parallel. However, I think this could not be the case because it will happen that different workers ask for the same training, and all but one worker will be just idle. Some smarter splitting of the tests would ensure that trainings are run in parallel, but I don't know if it's possible (haven't loooked into it).

📚 Documentation preview 📚: https://metatrain--1015.org.readthedocs.build/en/1015/

pfebrer · 2026-01-21T12:15:53Z

There are still some tests that are not using the fixtures, and that is why they are failing, will fix it later.

Luthaf

This looks good to me! I like using upper case names for global fixtures.

Luthaf · 2026-01-22T10:04:49Z

cscs-ci run

pfebrer · 2026-01-22T12:29:41Z

Ok, I tried to run the tests locally with a GPU and they pass, so I don't know exactly what is going on here, I will try to investigate. The training script fails when running mtt train. I will try to make it print the output of mtt train which is in any case something that I wanted to do, because otherwise I have made it more opaque 😅

Luthaf · 2026-02-05T16:31:10Z

cscs-ci run

PicoCentauri · 2026-02-06T07:36:53Z

This looks good to me! I like using upper case names for global fixtures.

We could write this down somewhere. For the LLM and for us to remember.

Luthaf · 2026-02-06T09:22:58Z

CI failure seems relevant!

pfebrer · 2026-02-06T09:26:53Z

Yes yes, I have to understand what is going on 😅

pfebrer added 4 commits January 21, 2026 19:11

Run trainings lazily in tests

a95b7b8

lint

cf6729a

Add __init__.py

548a207

Fix export tests

d0e20fb

pfebrer force-pushed the tests_lazytraining branch from 1ae874b to d0e20fb Compare January 21, 2026 18:11

pfebrer added 3 commits January 21, 2026 19:27

Fix eval and abc tests

f63f6d9

Fix transfer learning tests

a062dbc

Final test fixes

2d1ae94

pfebrer requested review from Luthaf and frostedoyster January 21, 2026 19:32

Luthaf approved these changes Jan 22, 2026

View reviewed changes

Merge branch 'main' into tests_lazytraining

e2443ff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tests] Run trainings lazily.#1015

[tests] Run trainings lazily.#1015
pfebrer wants to merge 8 commits intomainfrom
tests_lazytraining

pfebrer commented Jan 21, 2026 •

edited

Loading

Uh oh!

pfebrer commented Jan 21, 2026

Uh oh!

Luthaf left a comment

Uh oh!

Luthaf commented Jan 22, 2026

Uh oh!

pfebrer commented Jan 22, 2026

Uh oh!

Luthaf commented Feb 5, 2026

Uh oh!

PicoCentauri commented Feb 6, 2026

Uh oh!

Luthaf commented Feb 6, 2026

Uh oh!

pfebrer commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pfebrer commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Alternatives until now

Implementation in this PR

Complications

Nice side effects

Uh oh!

pfebrer commented Jan 21, 2026

Uh oh!

Luthaf left a comment

Choose a reason for hiding this comment

Uh oh!

Luthaf commented Jan 22, 2026

Uh oh!

pfebrer commented Jan 22, 2026

Uh oh!

Luthaf commented Feb 5, 2026

Uh oh!

PicoCentauri commented Feb 6, 2026

Uh oh!

Luthaf commented Feb 6, 2026

Uh oh!

pfebrer commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pfebrer commented Jan 21, 2026 •

edited

Loading