ASL_Recognition

This project demonstrates the use of deep learning models to recognize American Sign Language (ASL) letters. It was created as a cumulative project for the Deep Learning with PyTorch course at Fanshawe College, showcasing the application of advanced neural network architectures in solving real-world problems.

Features

Model Comparison: Explore ResNet18, ResNet50, and a custom convolutional neural network (CNN) model.
Real-Time Predictions: Get predictions for ASL letters with confidence scores displayed as a bar chart.
Custom Image Upload: Test the models with your own ASL letter images.
Dataset Visualization: View samples from the Sign Language MNIST dataset.
Interactive Dashboard: A user-friendly Gradio interface for seamless interaction.

Getting Started

Prerequisites

Ensure you have the following installed:

Python 3.8 or later
Required libraries: torch, torchvision, gradio, pandas, numpy, plotly, Pillow

Install the dependencies using pip:

pip install requirements.txt

Clone the Repository

git clone https://github.com/yourusername/sign-language-recognition.git
cd sign-language-recognition

Running the Application

Download the Dataset
- Download the Sign Language MNIST dataset.
- Extract the dataset and place the CSV files (sign_mnist_train.csv, sign_mnist_test.csv) in the Extracted_SignLanguageMNIST folder.
Train the Models

If you haven't already trained the models, use the training script provided in the repository to train and save the models (ResNet18, ResNet50, and the custom CNN).
Launch the Dashboard

Run the app_signlanageMNIST.ipynb script and launch the dashboard.
Access the Dashboard

Open the Gradio app in your browser (usually at http://127.0.0.1:7860/).

File Structure

sign-language-recognition/
│
├── Extracted_SignLanguageMNIST/
│   ├── sign_mnist_train.csv
│   ├── sign_mnist_test.csv
│
├── saved_models/
│   ├── trained_resnet18.pth
│   ├── trained_resnet50.pth
│   ├── trained_custom.pth
│
├── main.py          # Main script to launch the dashboard
├── README.md        # Project documentation
├── requirements.txt # List of dependencies

How It Works

Model Inference: The selected model processes the input image to predict the ASL letter.
Confidence Visualization: Confidence scores for all classes are displayed as a bar chart.
Real-Time Updates: The dashboard updates predictions as you interact with it, providing an intuitive user experience.

Dataset

The models are trained on the Sign Language MNIST dataset. This dataset contains 28x28 grayscale images of ASL letters, excluding J and Z, as they involve motion.

Models

ResNet18

A deep residual network with 18 layers.
Designed to handle vanishing gradients effectively using residual connections.

ResNet50

A deeper version of ResNet with 50 layers.
Suitable for large-scale image recognition tasks.

Custom CNN

A lightweight convolutional neural network tailored for the ASL recognition task.
Includes convolutional layers, pooling layers, and fully connected layers.

Purpose

This project was created as a cumulative project for the Deep Learning with PyTorch course at Fanshawe College. It demonstrates the application of deep learning techniques in accessibility-focused technology, providing a foundation for further research and development in sign language recognition.

Future Enhancements

Add support for real-time camera input to recognize ASL letters.
Implement heatmap visualizations (e.g., Grad-CAM) to interpret model predictions.
Train models on larger datasets for improved accuracy.

Contributing

Contributions are welcome! Please follow these steps:

Fork the repository.
Create a new branch for your feature or bug fix.
Commit your changes.
Submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contact

For more information or to contribute to this project, please reach out:

Author: Paige Berrigan
GitHub: @paigeberrigan
Email: paige@interweavemediagroup.ca

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASL_Recognition

Features

Getting Started

Prerequisites

Clone the Repository

Running the Application

File Structure

How It Works

Dataset

Models

ResNet18

ResNet50

Custom CNN

Purpose

Future Enhancements

Contributing

License

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Extracted_SignLanguageMNIST		Extracted_SignLanguageMNIST
saved_models		saved_models
LICENSE		LICENSE
README.md		README.md
app_signlanguageMNIST.ipynb		app_signlanguageMNIST.ipynb
models.ipynb		models.ipynb
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

ASL_Recognition

Features

Getting Started

Prerequisites

Clone the Repository

Running the Application

File Structure

How It Works

Dataset

Models

ResNet18

ResNet50

Custom CNN

Purpose

Future Enhancements

Contributing

License

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages