eCommerce Transactions Dataset Analysis

This project analyzes an eCommerce transactions dataset to derive business insights, build a lookalike model, and perform customer segmentation. The goal is to demonstrate data science capabilities in exploratory data analysis (EDA), recommendation systems, and clustering techniques.

Project Tasks

1. Exploratory Data Analysis (EDA)

Perform EDA on the dataset to uncover trends, patterns, and actionable insights.
Deliverables:
- Python code (Jupyter Notebook)
- PDF report summarizing 5 key business insights.

2. Lookalike Model

Build a recommendation system that suggests 3 similar customers based on profile and transaction history.
Deliverables:
- CSV file containing lookalike recommendations for the first 20 customers.
- Python code (Jupyter Notebook) for model development.

3. Customer Segmentation / Clustering

Perform clustering to segment customers based on profiles and transactions.
Deliverables:
- Report with clustering results, including the number of clusters and evaluation metrics (e.g., DB Index).
- Python code (Jupyter Notebook) for clustering.

Dataset

The project uses three files:

Customers.csv:
- CustomerID: Unique customer identifier.
- CustomerName: Name of the customer.
- Region: Customer's region.
- SignupDate: Date of signup.
Products.csv:
- ProductID: Unique product identifier.
- ProductName: Name of the product.
- Category: Product category.
- Price: Product price (USD).
Transactions.csv:
- TransactionID: Unique transaction identifier.
- CustomerID: Customer who made the transaction.
- ProductID: Product involved in the transaction.
- TransactionDate: Date of transaction.
- Quantity: Quantity purchased.
- TotalValue: Total value of the transaction (USD).

Repository Structure

├── data/
│   ├── Customers.csv
│   ├── Products.csv
│   ├── Transactions.csv
├── notebooks/
│   ├── FirstName_LastName_EDA.ipynb
│   ├── FirstName_LastName_Lookalike.ipynb
│   ├── FirstName_LastName_Clustering.ipynb
├── reports/
│   ├── FirstName_LastName_EDA.pdf
│   ├── FirstName_LastName_Clustering.pdf
├── results/
│   ├── Lookalike.csv
├── README.md

Getting Started

Prerequisites

Python 3.7+
Required libraries:
- pandas
- numpy
- matplotlib
- seaborn
- scikit-learn

Installation

Clone the repository:

git clone https://github.com/<username>/ecommerce-analysis.git
cd ecommerce-analysis

Install required libraries:
```
pip install -r requirements.txt
```
Upload the datasets to the data/ directory.

Usage

Run EDA:

Navigate to the notebooks/ directory.
Open and execute the FirstName_LastName_EDA.ipynb notebook to perform EDA and generate insights.

Build Lookalike Model:

Open and execute the FirstName_LastName_Lookalike.ipynb notebook.
Check the output file Lookalike.csv in the results/ directory.

Perform Clustering:

Open and execute the FirstName_LastName_Clustering.ipynb notebook.
Review the clustering report in reports/.

Results

EDA: Uncovered trends in customer behavior, product sales, and revenue.
Lookalike Model: Recommended 3 similar customers for each of the first 20 customers.
Clustering: Segmented customers into distinct groups with detailed metrics.

Author

Kanishkar V kanishvijay2005@gmail.com www.linkedin.com/in/kanishkar-v-3471782a2/

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Customer_Segments.csv		Customer_Segments.csv
Customers.csv		Customers.csv
Kanishkar_V_Clustering.ipynb		Kanishkar_V_Clustering.ipynb
Kanishkar_V_Clustering.pdf		Kanishkar_V_Clustering.pdf
Kanishkar_V_EDA.ipynb		Kanishkar_V_EDA.ipynb
Kanishkar_V_EDA.pdf		Kanishkar_V_EDA.pdf
Kanishkar_V_Lookalike.csv		Kanishkar_V_Lookalike.csv
Kanishkar_V_Lookalike.ipynb		Kanishkar_V_Lookalike.ipynb
Processed_Transactions.csv		Processed_Transactions.csv
Products.csv		Products.csv
README.md		README.md
Transactions.csv		Transactions.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

eCommerce Transactions Dataset Analysis

Project Tasks

1. Exploratory Data Analysis (EDA)

2. Lookalike Model

3. Customer Segmentation / Clustering

Dataset

Repository Structure

Getting Started

Prerequisites

Installation

Usage

Run EDA:

Build Lookalike Model:

Perform Clustering:

Results

Author

License

About

Uh oh!

Releases

Packages

Languages

Kanishkar16Vijay/Data-Science-Project

Folders and files

Latest commit

History

Repository files navigation

eCommerce Transactions Dataset Analysis

Project Tasks

1. Exploratory Data Analysis (EDA)

2. Lookalike Model

3. Customer Segmentation / Clustering

Dataset

Repository Structure

Getting Started

Prerequisites

Installation

Usage

Run EDA:

Build Lookalike Model:

Perform Clustering:

Results

Author

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages