VJEPA2

Self-supervised visual representation learning from video. Part of the Zen LM ecosystem.

Overview

VJEPA2 implements Video Joint-Embedding Predictive Architecture for learning visual representations from unlabeled video data without relying on hand-crafted augmentations.

Features

Self-supervised learning from video
No hand-crafted augmentations required
Pre-trained visual encoder for downstream tasks
Efficient training with masking strategies

License

See LICENSE file.

Part of the Zen LM ecosystem by Hanzo AI

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github/workflows		.github/workflows
app		app
assets		assets
configs		configs
evals		evals
notebooks		notebooks
src		src
tests		tests
.flake8		.flake8
.gitignore		.gitignore
APACHE-LICENSE		APACHE-LICENSE
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
hubconf.py		hubconf.py
pyproject.toml		pyproject.toml
requirements-test.txt		requirements-test.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VJEPA2

Overview

Features

Related

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VJEPA2

Overview

Features

Related

License

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages