Order-to-delivery demand analysis project using SQL-based feature engineering and baseline modeling.
This repository contains an end-to-end workflow:
- Generate synthetic/structured data into a SQLite database
- Build features via SQL aggregations
- Validate a baseline model and evaluation metrics
notebooks/01_eda_order_to_delivery.ipynb— EDA & problem framing02_feature_engineering.ipynb— feature design & validation03_model_validation.ipynb— baseline model & evaluation
src/generate_customers.py— generate customer-related tablesgenerate_products_orders.py— generate products/orders tablesstep1_sql_features.py— SQL feature engineering pipelinecheck_db.py— database sanity checks
data/raw/— SQLite DB (excluded from git)processed/— feature artifacts (excluded from git)
outputs/— figures/results (optional)
Note: Data files (
*.db,*.parquet) are excluded from version control via.gitignore.
pip install -r requirements.txtpython src/generate_customers.py
python src/generate_products_orders.pypython src/step1_sql_features.pyjupyter notebookFigures: outputs/figures/ (if enabled in notebooks)
- Python (pandas, numpy)
- SQLite
- Jupyter Notebook
- scikit-learn (baseline modeling)
For portfolio / demonstration use.