Build core dbt data pipeline with staging and marts models #2
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Overview
Establish the complete core data pipeline for Flit's e-commerce analytics platform, transforming TheLook dataset and synthetic experiment assignments into analytics-ready models designed to support comprehensive experimentation analytics and ML workloads.
Changes Made
Project Foundation
dbt_project.ymlwith staging/intermediate/marts layer configurationflit_staging,flit_marts).gitkeepfiles to maintain directory structureData Models
Staging Layer: Clean, standardized data models
stg_customers- Customer profiles with demographic and behavioral attributesstg_orders- Order transactions with derived business metricsstg_products- Product catalog with category hierarchiesstg_experiment_assignments- Experiment variant assignments from synthetic dataCore Marts: Business-ready analytics tables
dim_customers- Customer 360° with lifetime metrics and segmentationfct_orders- Order fact table with enriched transaction datacustomer_ltv_base- Lifetime value foundations and cohort analysisproduct_performance- Product analytics for business intelligenceExperimentation Foundation
Data Quality & Testing
Architecture Design
Models are strategically designed to support downstream analytics:
Testing & Validation
Business Impact
Enables data-driven decision making through:
🔗 Related Work
This PR establishes the data foundation for advanced analytics capabilities:
Deployment Notes
flit_stagingandflit_martsdatasets