CanViT-specialize

Training loops for CanViT downstream probes (ADE20K segmentation) and IN1k finetuning.

Install

uv add "canvit-specialize @ git+https://github.com/m2b3/CanViT-specialize.git"

For TPU finetuning, see gcp_in1k_clf_ft/README.md.

Using a pre-trained probe

from canvit_pytorch import SegmentationProbe
probe = SegmentationProbe.from_pretrained("canvit/probe-ade20k-40k-s512-c64-in21k")
logits = probe(features)  # [B, H, W, D] → [B, num_classes, H, W]

For the fused CanViT + probe pair, see canvit_pytorch.CanViTForSemanticSegmentation.

Training

COMET_API_KEY, COMET_WORKSPACE, and ADE20K_ROOT must be set before training.

cp .envrc.example .envrc && direnv allow
# Edit .envrc to point at your dataset / Comet workspace.

ADE20K segmentation probe (frozen CanViT)

uv run python -m canvit_specialize.training.ade20k train \
  --scene-size 512 --canvas-grid 64

DINOv3 baseline probe

uv run python -m canvit_specialize.training.ade20k train-dinov3-probe

IN1k classification finetuning on GCP TPU v6e

See canvit_specialize/training/gcp_in1k_clf_ft/README.md.

Citation

@article{berreby2026canvit,
  title={CanViT: Toward Active-Vision Foundation Models},
  author={Berreby, Yoha{\"i}-Eliel and Du, Sabrina and Durand, Audrey and Krishna, B. Suresh},
  year={2026},
  eprint={2603.22570},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
  url={https://arxiv.org/abs/2603.22570}
}

License

MIT. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 159 Commits
canvit_specialize		canvit_specialize
scripts		scripts
slurm		slurm
tests		tests
.envrc.example		.envrc.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CanViT-specialize

Install

Using a pre-trained probe

Training

ADE20K segmentation probe (frozen CanViT)

DINOv3 baseline probe

IN1k classification finetuning on GCP TPU v6e

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CanViT-specialize

Install

Using a pre-trained probe

Training

ADE20K segmentation probe (frozen CanViT)

DINOv3 baseline probe

IN1k classification finetuning on GCP TPU v6e

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages