This notebook contains Task 1 from my Data Science internship, where I performed detailed Exploratory Data Analysis (EDA) and Data Preprocessing on a restaurant dataset.
Analyze and prepare the dataset by understanding its structure, handling missing values, correcting data types, and exploring the target variable.
- Loaded CSV data using
pandas - Checked the number of rows and columns (
df.shape) - Previewed the first few records with
df.head()
- Identified 9 missing values in the
Cuisinescolumn - Handled them using
fillna("Unknown")
- Converted 4 columns (
Has Table booking,Has Online delivery,Is delivering now,Switch to order menu) from"Yes"/"No"(object) toTrue/False(boolean)
- Counted and visualized rating distribution
- Identified a high number of unrated entries (
0.0) - Noted class imbalance, with most ratings falling between 3.0 and 4.0
- Python
- Pandas
- Jupyter Notebook