Skip to content

docs: Update README to surface ML/data-science concepts#1

Draft
Copilot wants to merge 2 commits intomasterfrom
copilot/update-readme-machine-learning
Draft

docs: Update README to surface ML/data-science concepts#1
Copilot wants to merge 2 commits intomasterfrom
copilot/update-readme-machine-learning

Conversation

Copy link
Copy Markdown

Copilot AI commented Feb 28, 2026

The README only described setup steps with no context on the underlying data-science patterns the project demonstrates.

Changes

  • New "Machine Learning / Data Science Concepts" section — table mapping each concept to its concrete location in the codebase:

    • ETL Pipeline (ReadMapWrite class separation)
    • Data Cleaning / Pre-processing (stripping $/,, casting to float)
    • Categorical Feature Mapping / Label Encoding (mapper.map_data() merging raw Amazon categories via categories.csv lookup)
    • Feature Engineering (deriving month from Order Date for time-series aggregation)
    • Data Aggregation (groupby/sum, pivot_table)
    • Pandas DataFrame API as the ML-ecosystem backbone
  • Prerequisites — inlined the categories.csv schema example (was previously buried in data/lookup/readme.md)

  • Output section — lists and describes the three generated CSV files (orders_by_monthly_spend, orders_by_category, orders_by_parent_category)


🔒 GitHub Advanced Security automatically protects Copilot coding agent pull requests. You can protect all pull requests by enabling Advanced Security for your repositories. Learn more about Advanced Security.

Co-authored-by: tedsteinmann <2926908+tedsteinmann@users.noreply.github.com>
Copilot AI changed the title [WIP] Update README to highlight machine learning concepts docs: Update README to surface ML/data-science concepts Feb 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants