CanViT-pretrain

Passive-to-active dense latent distillation of CanViT (arXiv:2603.22570) from DINOv3 (arXiv:2508.10104).

Originally designed to run on the Nibi SLURM cluster using its hosted ImageNet-21k winter21_whole replica.

Setup

cp .envrc.example .envrc && direnv allow
# Edit .envrc to adapt to your environment.

Please ensure that HF_TOKEN, COMET_API_KEY, and COMET_WORKSPACE are set.

Run

Export DINOv3 teacher features once:

uv run python scripts/build_shuffled_index.py \
  --image-root $IN21K_IMAGE_DIR --index-dir $INDEX_DIR --dataset in21k
sbatch --array=0-99%20 slurm/export_features.sh

Pretraining:

sbatch slurm/train.sbatch [--flag value ...]

Ablations:

bash slurm/ablations/baseline.sh
bash slurm/ablations/no-bptt.sh
# ...

Citation

@article{berreby2026canvit,
  title={CanViT: Toward Active-Vision Foundation Models},
  author={Berreby, Yoha{\"i}-Eliel and Du, Sabrina and Durand, Audrey and Krishna, B. Suresh},
  year={2026},
  eprint={2603.22570},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
  url={https://arxiv.org/abs/2603.22570}
}

License

MIT. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 1,371 Commits
canvit_pretrain		canvit_pretrain
sa1b		sa1b
scripts		scripts
slurm		slurm
.envrc.example		.envrc.example
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
conftest.py		conftest.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CanViT-pretrain

Setup

Run

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CanViT-pretrain

Setup

Run

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages