Creates a unified interface for task specification for python_api by josegcpa · Pull Request #484 · wasserth/TotalSegmentator

josegcpa · 2025-06-23T10:26:29Z

Hey! Big fan of the project here :-)

This PR significantly factorises how tasks are specified and creates a simple interface to add new tasks without adding too much clutter to totalsegmentator.python_api. This is part of a larger effort to support multi-task specification using the CLI as it is quite useful across multiple applications. Let me know if this is OK and if this is desired or not.

I tried running all of the tests but some files appear to be missing (at least for tests/test_end_to_end.py) - let me know if I missed something.

EDIT: I also bumped the version management to use uv as, in my experience, it features smoother version/package versioning

…facilitate addition of new tasks

…by dicom2nifti)

wasserth · 2025-06-25T07:43:16Z

Thanks for this PR. I will have a look at it.

wasserth · 2025-06-27T09:36:20Z

I had a more detailed look at the PR. Thank you! Overally, it adds quite a few lines of code and makes the code a little bit less straight forward by introducing the Task class. I do not see the advantage of this yet. So far it has also been easy to add a new class in the python_api in my opinion. Maybe you can elaborate a bit more what you are missing at the moment. Do you want to be able to run multiple tasks without having to call the python_api multiple times?
This would save a little bit of time since the CT image has only to be loaded once and the fast model to find the crop region also has to run only once. But the cropping, resampling and running of the actual model stays the same. This takes the majority of the time (depends on the model). Therefore, I am not sure if this change is worth the effort. You would also have to be able to specify multiple output files in this case. This would require a lot of refactoring and make the code more verbose at several places.
So far I have been fine with running the python_api several times for several models.

josegcpa · 2025-06-27T10:40:03Z

I had a more detailed look at the PR. Thank you! Overally, it adds quite a few lines of code and makes the code a little bit less straight forward by introducing the Task class. I do not see the advantage of this yet. So far it has also been easy to add a new class in the python_api in my opinion. Maybe you can elaborate a bit more what you are missing at the moment. Do you want to be able to run multiple tasks without having to call the python_api multiple times? This would save a little bit of time since the CT image has only to be loaded once and the fast model to find the crop region also has to run only once. But the cropping, resampling and running of the actual model stays the same. This takes the majority of the time (depends on the model). Therefore, I am not sure if this change is worth the effort. You would also have to be able to specify multiple output files in this case. This would require a lot of refactoring and make the code more verbose at several places. So far I have been fine with running the python_api several times for several models.

Hey! It actually reduces the code and repetition, it introduces more code lines because it introduces PEP 8 compliant formatting, I should've been clearer about this.

As for the why:
Right now the main point is to compartmentalise everything such that there are no unnecessary repetitions (i.e. checks for whether tasks support fast/fastest, or whether they are commercial can be checked quickly through lists specified at the beginning of tasks.py. This is similar for task-to-task-ID conversions. Task is simply a straightforward data model which immediately makes clear what requires specification for interoperability with the Python API as this tends to significantly reduce errors due to lack of specification.

Eventually, the point was to i) further specify the weights_path and WEIGHTS_URL directly as part of the Task so that adding a new model is fully self-contained and ii) create a simple YAML-based interface where tasks can be specified as an auxiliary file rather than as part of the code.

The main point of specifying multiple tasks/roi_subsets is to reuse crops as some tasks have fairly overlapping crops and there is no need to perform the same computation 2x. This reduces compute time, it is helpful when being billed for compute.

But I fully understand if this is out of the scope of the project! It is not so much an addition to the functionalities so feel free to disregard.

wasserth · 2025-06-27T13:34:41Z

Thank you very much for these additional explanations! I will have to think about it.

josegcpa added 9 commits June 23, 2025 11:00

migration to uv as it facilitates installation and etc

d22d598

significant factorisation, creation of generic dataclass for Task to …

602c7dc

…facilitate addition of new tasks

added some dependencies for testing

17602dd

downgraded python version

7d3fc7a

added some dependencies

8ca345a

added blosc2 dependency for nnunet

47e22f5

corrected some minimal bugs

cc56419

crop now supports none as an argument (for tc)

601eb43

bumped python version as it is required for pydicom.pixesl (required …

b070d44

…by dicom2nifti)

josegcpa added 3 commits June 27, 2025 14:16

weight download now handled by task

172e5b7

further factorisation and formatting of tasks

b5b887c

added minimal type hinting, moved download orchestration to task

eedcd96

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Creates a unified interface for task specification for python_api#484

Creates a unified interface for task specification for python_api#484
josegcpa wants to merge 12 commits intowasserth:masterfrom
josegcpa:master

josegcpa commented Jun 23, 2025 •

edited

Loading

Uh oh!

wasserth commented Jun 25, 2025

Uh oh!

wasserth commented Jun 27, 2025

Uh oh!

josegcpa commented Jun 27, 2025

Uh oh!

wasserth commented Jun 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

josegcpa commented Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wasserth commented Jun 25, 2025

Uh oh!

wasserth commented Jun 27, 2025

Uh oh!

josegcpa commented Jun 27, 2025

Uh oh!

wasserth commented Jun 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

josegcpa commented Jun 23, 2025 •

edited

Loading