A place to practice and experiment with vision language and vision language action models.
data/- Datasets and preprocessing scriptspaligemma-from-scratch.ipynb/- Jupyter notebook for recreating PaliGemma from scratchprototypes.ipynb- Notebook for experimenting with prototypes and ideasREADME.md- Project overview and documentation