AdFusion

Research based on ad-copy and image generation.

By enhancing the contextual understanding of LLMs, AdFusion aim to create a model that better grasps the intent and tone of ad copy, resulting in images that are not just relevant but also creatively aligned with the message.

Fine Tuning

This t5_trainer.ipynb file is used to fine tune the model T5.

You can use google colab or install jupyter notebook to fine tune the model.

Just remember you have to insall Jupyter Notebook and it is strongly suggested you have a GPU (preferrably NVIDIA to use Cuda)

If you would like to not worry about Jupyter Notebook, you can use colab which provides you with a GPU called T4 (you just have to switch to it)

How to set up Jupyter Notebook

You can either download the Anaconda Navigator and open Jupyter Notebooks from there or directly download Jupyter Notebooks there.

Create a new notebook and set a Python environment that has torch and numpy installed.

Follow the instructions and code in t5_trainer.ipynb to fine tune your model.

How to set up Google Colab

Sign in with your google account and create a new notebook.

Since Colab notebook needs to be mounted to drive for it to be saved for later use, you can use this command.

#Mount the notebook to the drive

from google.colab import drive
drive.mount('/content/drive')

To download the required libraries in colab, it has to be installed like this:

#Example import statements

!pip install sentencepiece
!pip install transformers
!pip install rich[jupyter]

Setting up the data

If you want to fine tune T5-large or T5-base with your own data, then you can change the base_path and the output_file to your correct directory.

For example, if you are using Colab and want to access your folders:

# This gets data from this folder structure of parent folder LaVi-Bridge to subfolders 1 to n (the number of folders you have) to each captions.txt and img.jpg files
base_path = "/content/drive/My Drive/LaVi-Bridge"
output_file = "/content/drive/My Drive/LaVi-Bridge/captions.txt"

This model trains from images and text, remember to change the name of the text files from "captions" or "images" to the name of your txt and jpg files.

Finally if your data is in a csv file, then you can use:

#Insert your link here

path = "https://github.com"

df = pd.read_csv(path)

The folder structure should be like this:

| - - - - - - - - - Other Folders

| - - - - - - - - - LaVi-Bridge

| - - - - - - - - - - 1

| - - - - - - - - - - - captions.txt

| - - - - - - - - - - - img.jpg

| - - - - - - - - - - 2

| - - - - - - - - - - - captions.txt

| - - - - - - - - - - - img.jpg

| - - - - - - - - - - n

| - - - - - - - - - - - captions.txt

| - - - - - - - - - - - img.jpg

| - - - - - - - - - - caption.txt (with fine tuned data)

Resources

Huge thanks to these github repositories for inspiration and clarification needed for this code:

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
AdsScraper		AdsScraper
figs		figs
modules		modules
test		test
train		train
AdFusion_Paper.pdf		AdFusion_Paper.pdf
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
t5_trainer.ipynb		t5_trainer.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AdFusion

Fine Tuning

How to set up Jupyter Notebook

How to set up Google Colab

Setting up the data

Resources

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

AakristG/AdFusion

Folders and files

Latest commit

History

Repository files navigation

AdFusion

Fine Tuning

How to set up Jupyter Notebook

How to set up Google Colab

Setting up the data

Resources

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages