Skip to content

πŸ“Š Exploratory Data Analysis (EDA) project on Diwali sales data using the Python along with key data analysis libraries including Pandas, Seaborn, and Matplotlib. The project delivers customer behavior insights, segment-wise trends, and business-driven recommendations through data cleaning and visualization.

Notifications You must be signed in to change notification settings

Nandini2233/Diwali_Sales_Analysis_Python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

11 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Diwali_Sales_Analysis_Python

πŸ“Š This project presents an end-to-end Exploratory Data Analysis (EDA) on Diwali Sales data with the goal of uncovering actionable business insights for targeted marketing and strategic decision-making. Using Python and core data analysis libraries, the dataset is analyzed to identify key customer segments, spending behaviors, and state-wise revenue distribution during the Diwali festival season. This analysis supports real-world use cases like campaign optimization and customer profiling.


πŸš€ Business Objective

Simulating the role of a junior data analyst at a retail company using Diwali sales data to:

  • 🎯 Discover high-revenue customer segments
  • 🧠 Recommend marketing strategies based on customer behavior
  • πŸ“ˆ Support business decisions using data insights

πŸ› οΈ Tools and Technologies

Tool Purpose
Python (Jupyter Notebook) Analysis environment
Pandas, NumPy Data manipulation & preprocessing
Matplotlib, Seaborn Visualization
Power BI (optional) Business dashboarding

πŸ“‚ Dataset Overview

  • Source: Diwali Sales (fictional dataset)
  • Rows: 11,251
  • Columns: 15
  • Features: Gender, Age Group, Marital Status, Occupation, State, Product Category, Amount, etc.

🧹 Data Cleaning and Preprocessing

  • βœ”οΈ Removed null and missing values
  • βœ”οΈ Dropped non-relevant columns like Status, Unnamed
  • βœ”οΈ Standardized and cleaned categorical variables
  • βœ”οΈ Converted data types as required

πŸ“Š Exploratory Data Analysis

1️⃣ Gender-based Spending

  • πŸ‘©β€πŸ¦° Females made more purchases (by count)
  • πŸ‘¨ Males spent more in total amount

2️⃣ Age Group Contribution

  • πŸ‘₯ Age group 26–35 dominates in both count and spending
  • 🎯 Ideal demographic for campaign targeting

3️⃣ State-wise Performance

  • πŸ₯‡ Top States: Uttar Pradesh, Maharashtra, Karnataka
  • πŸ“ High demand zones for future campaigns

4️⃣ Marital Status Influence

  • πŸ’ Married individuals contributed more to total revenue
  • πŸ›οΈ Family-centric spending patterns

5️⃣ Occupation vs Product Categories

  • πŸ’Ό Working professionals spent the most
  • πŸ“¦ Popular categories: Electronics, Clothing

πŸ“ˆ Key Insights

  • 🎯 Target Segment: Married Males, Aged 26–35, Working Professionals
  • 🌍 Focus States: Uttar Pradesh, Maharashtra, Karnataka
  • 🧾 Female customers make frequent purchases but spend less per transaction

πŸ”¬ Conclusion & Recommendations

  • βœ… Launch personalized Diwali marketing campaigns for target segments
  • βœ… Focus inventory and advertising efforts in top-performing states
  • βœ… Promote high-demand products based on occupational preferences

About

πŸ“Š Exploratory Data Analysis (EDA) project on Diwali sales data using the Python along with key data analysis libraries including Pandas, Seaborn, and Matplotlib. The project delivers customer behavior insights, segment-wise trends, and business-driven recommendations through data cleaning and visualization.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published