GenAI Homework 1

This repository contains the code for the tasks of the first homework of the course Generative artificial intelligence for graphics and multimedia (01VRWOV, 01VRWYG) at the Polytechnic University of Turin

Tasks

The tasks cover the implementation of several GAN (Generative Adversarial Network) based architectures trained on the Oxford Flowers102 dataset:

Ex.1 - GAN (Unconditional Generation)

Objective: train and test an unconditional GAN from scratch
Architecture: DCGAN-based architecture
Implementation Details:
- ConvTranspose2D layers replaced with Conv2D + Upsample to avoid checkerboard patterns
- images resized to 64x64 and pixels normalized to [-1, 1]
Results Preview:

Ex.2 - CGAN (Conditional Generation)

Objective: extend Ex.1 to achieve conditional generation through labels
Architecture: DCGAN-based architecture
Implementation Details:
- ConvTranspose2D layers replaced with Conv2D + Upsample to avoid checkerboard patterns
- images resized to 64x64 and pixels normalized to [-1, 1]
- labels concatenated in the first convolutional layer
Results Preview:

Ex.3 - WGAN-GP (Unconditional Generation)

Objective: re-implement Ex.1 using the Wasserstein objective with Gradient Penalty
Architecture: DCGAN-based architecture
Implementation Details:
- ConvTranspose2D layers replaced with Conv2D + Upsample to avoid checkerboard patterns
- BatchNorm2d layers replaced with InstanceNorm2d layers in the critic
- images resized to 64x64 and pixels normalized to [-1, 1]
Results Preview:

Ex.4 - WGAN-GP (Conditional Generation)

Objective: extend Ex.3 to achieve conditional generation through labels
Architecture: DCGAN-based architecture
Implementation Details:
- ConvTranspose2D layers replaced with Conv2D + Upsample to avoid checkerboard patterns
- BatchNorm2d layers replaced with GroupNorm2d layers in the critic
- images resized to 64x64 and pixels normalized to [-1, 1]
- labels passed separately wrt images, and "combined" via a dot product of their final representations
Results Preview:

Ex.5 (Optional) - Pix2Pix

Objective: implement the Pix2Pix architecture for paired image translation from dark and noisy to bright images
Architecture: Pix2Pix (PatchGAN discriminator + UNet generator)
Implementation Details:
- dark and noisy images created by multiplying by a dark factor (from 10% to 40%) and by adding Gaussian noise
- images resized to 256x256 and pixels normalized to [-1, 1]
- to avoid overfitting, training has been done on the val and test splits (7k images), leaving about 1k images of the training split for inference
- given the high computational effort to pass 256x256 images, a batch size of 1 is used
Results Preview:

Requirements

Dependencies: Python 3.10+, PyTorch, Torchvision, OpenCV (cv2), NumPy, Matplotlib. Install them with:

pip install -r requirements.txt

Important Note on Git LFS (Model Weights)

The trained weights for the Ex.5 Pix2Pix Generator are tracked using Git Large File Storage (LFS) due to the large memory footprint of the 256x256 UNet. To successfully clone this repository and download the actual .pth file (rather than a 130-byte text pointer), ensure you have Git LFS installed on your system before cloning:

# install Git LFS
git lfs install
# clone the repository
git clone https://github.com/Malgesw/GenAI-Homework1

If you already cloned the repository without Git LFS, install it and pull the weights directly:

# install Git LFS
git lfs install
# pull the weights
git lfs pull

How to Run

To run training and/or inference with one of the architectures, simply run the corresponding jupyter notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
output		output
.gitattributes		.gitattributes
.gitignore		.gitignore
01-dcgan.ipynb		01-dcgan.ipynb
02-cgan.ipynb		02-cgan.ipynb
03-wgan.ipynb		03-wgan.ipynb
04-cwgan.ipynb		04-cwgan.ipynb
README.md		README.md
extra-pix2pix.ipynb		extra-pix2pix.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GenAI Homework 1

Tasks

Ex.1 - GAN (Unconditional Generation)

Ex.2 - CGAN (Conditional Generation)

Ex.3 - WGAN-GP (Unconditional Generation)

Ex.4 - WGAN-GP (Conditional Generation)

Ex.5 (Optional) - Pix2Pix

Requirements

Important Note on Git LFS (Model Weights)

How to Run

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GenAI Homework 1

Tasks

Ex.1 - GAN (Unconditional Generation)

Ex.2 - CGAN (Conditional Generation)

Ex.3 - WGAN-GP (Unconditional Generation)

Ex.4 - WGAN-GP (Conditional Generation)

Ex.5 (Optional) - Pix2Pix

Requirements

Important Note on Git LFS (Model Weights)

How to Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages