GitHub - beaarend/capincho

Capincho

Image captioning composed of 3 modules: 1) a decoder only language model (OPT) for generating text, 2) a vision-language model CLIP for aligned representation of images and texts, 3) a embeddings mapper that maps CLIP embeddings to k OPT word embeddings.

Some examples from coco dataset, after training for 2 epochs only while learning a prefix of length 10 (k=10):

Installation

pip install git+https://github.com/openai/CLIP.git
pip install -r requirements.txt

Usage

check the following files:

extractFeatures.py to extract the features vectors from coco dataset using CLIP or open CLIP.

trainDecoder.py to train the mapper module and finetune OPT, or a OPT LoRA model.

evaluateCaptioning.py to qualitative evaluate results.

Name		Name	Last commit message	Last commit date
Latest commit History 90 Commits
figs		figs
.gitignore		.gitignore
adapters.py		adapters.py
captioningDataset.py		captioningDataset.py
dataLoader.py		dataLoader.py
decoder.py		decoder.py
embeddingsDataset.py		embeddingsDataset.py
evaluateCOCOCaptioning.py		evaluateCOCOCaptioning.py
evaluateCaptioning.py		evaluateCaptioning.py
evaluateRetrieval.py		evaluateRetrieval.py
featuresAdaptation.py		featuresAdaptation.py
featuresExtraction.py		featuresExtraction.py
finetuneOPT.py		finetuneOPT.py
foundation_models.py		foundation_models.py
geoVQALoader.py		geoVQALoader.py
lora.py		lora.py
lora_layers.py		lora_layers.py
mapping.py		mapping.py
projectionHeads.py		projectionHeads.py
readme.md		readme.md
requirements.txt		requirements.txt
scripts.py		scripts.py
text.py		text.py
textLoader.py		textLoader.py
trainAdapter.py		trainAdapter.py
trainDecoder.py		trainDecoder.py
trainLoRA.py		trainLoRA.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Capincho

Installation

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Capincho

Installation

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages