Skip to content

Implements a machine learning-based SMS spam detection system. It classifies incoming text messages as Spam or Ham (Not Spam) using Natural Language Processing (NLP) techniques and supervised learning algorithms.

License

Notifications You must be signed in to change notification settings

Mulla6518/SMSSpamTextClassifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SMS Spam Text Classifier

Overview

This project implements a machine learning-based SMS spam detection system. It classifies incoming text messages as Spam or Ham (Not Spam) using Natural Language Processing (NLP) techniques and supervised learning algorithms.

Features

  • Preprocessing of SMS text (tokenization, stopword removal, stemming)
  • Feature extraction using TF-IDF or Bag of Words
  • Model training with algorithms like Naive Bayes, Logistic Regression, or SVM
  • Evaluation metrics: Accuracy, Precision, Recall, F1-score
  • Easy-to-use script for predictions on new messages

Project Structure

SMSSpamTextClassifier/
│
├── data/                # Dataset (e.g., SMS Spam Collection)
├── notebooks/           # Jupyter notebooks for EDA and model training
├── src/                 # Source code for preprocessing and model pipeline
├── models/              # Saved trained models
├── requirements.txt     # Python dependencies
└── README.md            # Project documentation

Installation

Clone the repository:

git clone https://github.com/Mulla6518/SMSSpamTextClassifier.git

Install dependencies:

python 3

Usage Train the model:

Predict a message:

python script.py --data spam.csv

License This project is licensed under the MIT License.

About

Implements a machine learning-based SMS spam detection system. It classifies incoming text messages as Spam or Ham (Not Spam) using Natural Language Processing (NLP) techniques and supervised learning algorithms.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 2

  •  
  •  

Languages