This repository contains the cleaning & preprocessing of a csv file dataset named as titanic_data.csv
The tools used are :Python, Pandas, NumPy, Matplotlib/Seaborn
The file has following steps :
1.Import the dataset and explore basic info (nulls, data types).
2.Handling of missing values using mean, median & modeimputation.
3.Conversion of categorical features into numerical using encoding.
4.Normalizing the numerical features.
5.Visualization of outliers using boxplots and removal.
nainarathore/Data-Cleaning-Preprocessing-
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|