A simplified, educational implementation of the VL-JEPA (Vision-Language Joint Embedding Predictive Architecture) paper, built on Apple MLX.
Adapts the PaliGemma VLM into a JEPA architecture (for now).
| Name | Name | Last commit date | ||
|---|---|---|---|---|
A simplified, educational implementation of the VL-JEPA (Vision-Language Joint Embedding Predictive Architecture) paper, built on Apple MLX.
Adapts the PaliGemma VLM into a JEPA architecture (for now).