CIFAR-10 Image Diffusion Model

Scripts for training a small image diffusion model using the CIFAR-10 dataset, and a gradio UI for testing.

Try it out here! You may need to restart the space if it is asleep: https://huggingface.co/spaces/cameron-d/CIFAR-10_Diffusion_Model_Space

Based on Hugging Face's Diffusion Course: https://huggingface.co/learn/diffusion-course/en/unit2/3

Training dataset: CIFAR-10 https://www.cs.toronto.edu/~kriz/cifar.html

Utilizes a UNet architecture with four down and up blocks. Images are 32x32 pixels.

The model was trained for 200 epochs. The generated images are of mixed quality, but are generally recognizable as CIFAR-10 images. The "car" and "truck" classes perform the best, most likely due to the more rigid and predictable structure of these objects.

Future plans:

Continue experimenting with larger, higher resolution datasets to try and achieve better results.
Use CLIP to train a text-to-image model.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitattributes		.gitattributes
CIFAR10_unet_200_epochs_inference.pth		CIFAR10_unet_200_epochs_inference.pth
CIFAR_10_diffusion_model_training.ipynb		CIFAR_10_diffusion_model_training.ipynb
README.md		README.md
UI_screenshot.png		UI_screenshot.png
gradio_app.py		gradio_app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CIFAR-10 Image Diffusion Model

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CIFAR-10 Image Diffusion Model

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages