Feat: Dataset Unification Pipeline by Varun-sai-500 · Pull Request #96 · yoxu515/aot-benchmark

Varun-sai-500 · 2026-04-07T06:51:37Z

This PR introduces a structural overhaul of the training pipeline, separating concerns that were previously tightly coupled and blocking extensibility.

                                      **** WORK IN PROGRESS  ****

The original pipeline:

Hard-coupled static pretraining into trainer.py
Effectively supported only a single pretraining configuration
Had fragile dataset assumptions (non-standard formats → silent failures)
Made it painful to extend / reproduce / port

This PR fixes that at the root level, not with patches.

🔧 Core Changes

Trainer Refactor (Critical Path)
Decoupled static pretraining from trainer.py
Removed StaticTrain dependency from the core training loop
Cleaned dataset preparation logic → now modular and extensible

👉 Result:
trainer.py is now strictly responsible for video training, not overloaded with pretraining concerns.

Dedicated Pretraining Pipeline
Introduced: networks/managers/pre_trainer.py
Static image pretraining is now:
Standalone
Reproducible
Hardware-agnostic (CPU/GPU)

👉 Matches design patterns used in large-scale OSS (separation of stages, not flags inside one script)

Dataset Unification (Big Fix)
Added: tools/unify_datasets.py
Converts heterogeneous datasets (COCO, MSRA10K, ECSSD, VOC, etc.) → single canonical format (datasets/static/)
What it solves:
Broken assumptions across datasets
Inconsistent annotations/layouts
Previous inability to use multiple datasets reliably

👉 Now users can:

python tools/unify_datasets.py --sources --output datasets/static
4. Pretrained Model Support (Finally Usable)
Added 5 pretraining checkpoints in README
Standardized loading path
Removed dependency on “that one working model”

👉 Result:

Users can actually run pretraining without debugging the repo for hours
5. Training Pipeline Improvements
Updated tools/train.py and train_eval.sh
Integrated:
Optional pretraining stage
Default checkpoint handling
Graceful fallback if weights are missing

Varun-sai-500 · 2026-04-07T15:03:02Z

@z-x-yang Please ignore that trailing whitespace inside train_datasets, I don't know but removing it isn't as easy as I thought.

Varun-sai-500 · 2026-04-12T12:28:00Z

@z-x-yang Big Refactor, but I think it's worth it

Varun-sai-500 · 2026-04-21T18:02:06Z

@z-x-yang Can you review it if possible? Don't run yet, as it's incomplete, just check it if this is good

z-x-yang · 2026-04-25T20:50:27Z

Thanks for working on this — the dataset unification direction is useful, but I don’t think this PR is merge-ready yet.

A few hard blockers:

dataloaders/train_datasets.py currently removes DAVIS2017_Train, YOUTUBEVOS_Train, and TEST, while networks/managers/trainer.py still imports and uses them. This breaks the main training path at import / dataset preparation time.
StaticTrain now requires dataset_name, but the existing trainer call still uses the old signature:
StaticTrain(cfg.DIR_STATIC, cfg.DATA_RANDOMCROP, ...).
So the static pretraining path is not wired up yet either.
_merge_sample was removed, but StaticTrain.__getitem__ still calls it in the dynamic merge branch. That path will fail at runtime.
tools/unify_datasets.py does not create the dst/name/ directory before writing train.txt; a minimal image/mask smoke test fails with FileNotFoundError.
The README command points to unify_dataset.py, but the added file is tools/unify_datasets.py, and the static pretraining section currently has duplicated / malformed markdown.

I’d suggest narrowing this PR first: keep the existing video-training dataset classes intact, preserve the current trainer API, make the dataset-unification tool standalone and smoke-tested, then update the README after the command actually works. After that I can review the design again.

Varun-sai-500 · 2026-04-26T06:01:15Z

oh well @z-x-yang this PR wasn't ready yet, I mistakenly added this too in the email, it's still a draft PR

Varun-sai-500 added 15 commits March 12, 2026 21:03

Imporved cross system compatibility

6f22c0e

Merge branch 'yoxu515:main' into main

a35001c

Refactored checkpoint.py to ensure cross platform compatibility

83f4d97

Merge branch 'main' of https://github.com/Varun-sai-500/aot-benchmark

83ac9b5

Merge branch 'yoxu515:main' into main

b3add8f

Merge branch 'yoxu515:main' into main

b759ae3

sync with main

33a485d

Changed readme

bb918d8

Merge branch 'main' of https://github.com/Varun-sai-500/aot-benchmark

0498a7c

Refactored train_datasets into more safety

e8ec833

Merge branch 'main' of https://github.com/Varun-sai-500/aot-benchmark

b888af2

decoupled pretraining from training path

43f9549

Added unify_datasets to allow structuring

941b1c5

Added documentation

7577181

Removed trailing white spaces

a68bcdc

Varun-sai-500 changed the title ~~Separate pretraining~~ Decoupled Pretraining + Dataset Unification Pipeline Apr 7, 2026

Varun-sai-500 added 4 commits April 7, 2026 14:47

refactored unify_datasets

9c056a1

update documentation

9737bba

Removed whitespaces

ce83734

removed whitespaces

b2528a4

Varun-sai-500 added 4 commits April 12, 2026 17:24

Updated documentation

0d3cb0a

Updated unify tool

4d6922e

fixed pretraining tools

9ea1205

Fixed pretrainer

6976d34

Varun-sai-500 changed the title ~~Decoupled Pretraining + Dataset Unification Pipeline~~ Feat. Decoupled Pretraining + Dataset Unification Pipeline Apr 12, 2026

Varun-sai-500 changed the title ~~Feat. Decoupled Pretraining + Dataset Unification Pipeline~~ Feat: Decoupled Pretraining + Dataset Unification Pipeline Apr 12, 2026

Varun-sai-500 added 2 commits April 12, 2026 18:35

fixed issues regarding pretrain

97a6c9f

Added

1234556

Varun-sai-500 marked this pull request as draft April 15, 2026 15:14

Varun-sai-500 added 3 commits April 15, 2026 20:50

reverted

b01c195

reverted config change

9e5a4e9

reverted git attribute change

2faedcf

Varun-sai-500 added 6 commits April 23, 2026 21:52

restore

755a7c4

added unify_datasets.py

6278654

removed changes from train.py

29f5b14

fixed train_datasets.py

61e9faa

removed changes from shell script

a07b321

added readme.md:

065b2e1

Varun-sai-500 changed the title ~~Feat: Decoupled Pretraining + Dataset Unification Pipeline~~ Feat: Dataset Unification Pipeline Apr 25, 2026

z-x-yang mentioned this pull request Apr 25, 2026

[Priority] [Bug] removing hardcoded local file paths and standardizing them #97

Open

fix missing functions

48b65ca

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Dataset Unification Pipeline#96

Feat: Dataset Unification Pipeline#96
Varun-sai-500 wants to merge 35 commits intoyoxu515:mainfrom
Varun-sai-500:separate_pretraining

Varun-sai-500 commented Apr 7, 2026 •

edited

Loading

Uh oh!

Varun-sai-500 commented Apr 7, 2026

Uh oh!

Varun-sai-500 commented Apr 12, 2026

Uh oh!

Varun-sai-500 commented Apr 21, 2026

Uh oh!

z-x-yang commented Apr 25, 2026

Uh oh!

Varun-sai-500 commented Apr 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Varun-sai-500 commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Varun-sai-500 commented Apr 7, 2026

Uh oh!

Varun-sai-500 commented Apr 12, 2026

Uh oh!

Varun-sai-500 commented Apr 21, 2026

Uh oh!

z-x-yang commented Apr 25, 2026

Uh oh!

Varun-sai-500 commented Apr 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Varun-sai-500 commented Apr 7, 2026 •

edited

Loading