WeatherVision: Multi-class Weather Classification

This repository contains a project for multi-class weather classification using two different approaches: a classic deep learning model and a large language model (LLM) with vision capabilities.

Features

Two Model Implementations: Compare a classic computer vision model (ResNet-18) with a modern multi-modal vision model (Gemma 3).
Data Preparation: Scripts to download and prepare the dataset for training.
Model Training & Evaluation: Train the ResNet-18 model and evaluate both models to generate classification reports, confusion matrices, and ROC curves.
Prediction: Use either model to predict the weather category of new images.

Dataset

The project uses the "Multi-class Weather Dataset for Image Classification" from Kaggle. The download_data.py script can be used to download the dataset using the Kaggle API.

The dataset should be placed in the dataset directory.

Installation

Clone the repository:

git clone <repository-url>
cd weathervision

Install dependencies: Make sure you have Python 3 installed. You can install the required packages using pip:
```
pip install -r requirements.txt
pip install scikit-learn matplotlib seaborn tqdm
```
Ollama Setup (for Gemma 3 model): To use the Gemma 3 model, you need to have Ollama installed. Follow the instructions on the Ollama website.

Usage

This project provides two distinct models for weather classification. Below are the instructions for using each.

Model 1: ResNet-18 Transfer Learning

This approach uses a pre-trained ResNet-18 model and fine-tunes it on the weather dataset.

Download the data:
```
python download_data.py
```
Prepare the data:
```
python prepare_data.py
```
Train the model:
```
python train.py
```
Evaluate the model:
```
python evaluate.py
```

Make predictions:

python predict.py --image <path/to/image>

Model 2: Gemma 3 Vision Model

This approach uses the gemma3:4b-it-qat multi-modal model running locally with Ollama to perform zero-shot classification.

Classify a single image: Make sure the Ollama service is running. The first time you run the script, it will automatically download the Gemma 3 model.
```
python classify_ollama.py --image <path/to/your/image.jpg>
```
Evaluate the model: This script will evaluate the Gemma 3 model on a sample of the dataset (or the full dataset if you configure it) and generate a classification report, confusion matrix, and ROC curves.
```
python evaluate_vision_model.py
```

Results

Below are the performance results for both models.

Model 1: ResNet-18 Results

Classification Report:

Class	Precision	Recall	F1-Score	Support
Cloudy	0.95	0.98	0.97	60
Rain	1.00	0.98	0.99	43
Shine	0.98	0.92	0.95	51
Sunrise	0.97	1.00	0.99	71
---	---	---	---	---
Accuracy			0.97	225
Macro Avg	0.98	0.97	0.97	225
Weighted Avg	0.97	0.97	0.97	225

Confusion Matrix	ROC Curves

Model 2: Gemma 3 Results

Classification Report:

Class	Precision	Recall	F1-Score	Support
Cloudy	1.00	0.70	0.82	57
Rain	0.83	1.00	0.91	34
Shine	0.86	1.00	0.93	62
---	---	---	---	---
Accuracy			0.89	153
Macro Avg	0.90	0.90	0.89	153
Weighted Avg	0.91	0.89	0.88	153

Confusion Matrix	ROC Curves

Dependencies

The project relies on the following Python libraries:

torch
torchvision
scikit-learn
kaggle
kagglehub
pandas
matplotlib
numpy
tqdm
requests
seaborn

You can install all dependencies by running pip install -r requirements.txt and pip install scikit-learn matplotlib seaborn tqdm.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
README.md		README.md
classify_ollama.py		classify_ollama.py
confusion_matrix.png		confusion_matrix.png
download_data.py		download_data.py
evaluate.py		evaluate.py
evaluate_vision_model.py		evaluate_vision_model.py
gui.py		gui.py
model.pth		model.pth
ollama_confusion_matrix.png		ollama_confusion_matrix.png
ollama_roc_curves.png		ollama_roc_curves.png
predict.py		predict.py
prepare_data.py		prepare_data.py
requirements.txt		requirements.txt
roc_curves.png		roc_curves.png
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WeatherVision: Multi-class Weather Classification

Table of Contents

Features

Dataset

Installation