This project explores the use of Deep Q-Learning for image classification task on the MNIST dataset.
The following table summarizes the various metrics for the classification task:
Rewards and Epsilon During Training:
Evaluation Metrics on the trained model:
| Metric | Value |
|---|---|
| Accuracy | 95.72% |
| F1-Score | 0.96 |
| Precision | 0.96 |
| Recall | 0.96 |
Heatmap for predicted vs true labels:
To replicate the results of this project, follow these steps:
Run the following command to create a virtual environment:
python3 -m venv .venvPoetry is the dependency management tool used for this project. It simplifies package management, environment setup, and dependency tracking. Install Poetry by following the instructions on their official website.
Once Poetry is installed, use it to set up the project’s dependencies:
(.venv) poetry installExecute the main script to start the training:
(.venv) python main.py
