This repository contains machine learning projects for classifying breast cancer tumors as malignant or benign using diagnostic features.
It demonstrates a full data science workflow β from data preprocessing and exploratory analysis to model training, evaluation, and comparison.
- Goal: Build accurate classification models for early breast cancer detection.
- Dataset: Breast Cancer Wisconsin (Diagnostic) dataset.
- Approach:
- Data cleaning & preprocessing
- Exploratory Data Analysis (EDA) with visualizations
- Training multiple machine learning models
- Comparing performance metrics
- Visualizing model results
-
Breast_Cancer_Classification.ipynb
β Clean notebook focusing on the end-to-end classification pipeline. -
Breast_Cancer_Classification_2.ipynb
β Extended notebook with additional results and analysis, including visual outputs for immediate review. -
README.md
β Project documentation.
- Python 3.x
- NumPy, Pandas β data handling
- Matplotlib, Seaborn β visualization
- Scikit-learn β machine learning models & evaluation
- Jupyter Notebook
- Logistic Regression
- Support Vector Machine (SVM)
- Decision Tree
- Random Forest
- K-Nearest Neighbors (KNN)
Performance evaluated using:
- Accuracy
- Precision
- Recall
- F1-Score
- Confusion Matrix