Skip to content

tripathisatwik/WeatherDataPipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Real-Time Weather Event Analytics Pipeline (No-Cloud, Free Stack)

Overview

This project demonstrates a production-style data engineering pipeline built entirely with free, local tools. The pipeline ingests real-time weather data from a public API, lands raw JSON into a local data lake, loads curated data into a PostgreSQL warehouse, transforms it into analytics-ready tables, and orchestrates workflows using Apache Airflow.

Architecture

Public API (Open-Meteo)
→ Python Ingestion
→ Raw Data Lake (local files)
→ PostgreSQL (data warehouse via Docker)
→ Apache Airflow (orchestration + scheduling)

Tech Stack

  • Python
  • SQL
  • Docker
  • PostgreSQL (local warehouse)
  • Apache Airflow (local orchestration)

Project Structure

de-event-pipeline/
├── ingestion/      
├── airflow/        
├── data_lake/      
├── requirements.txt
├── .gitignore
└── README.md

Setup

python -m venv venv
venv\Scripts\activate
pip install -r requirements.txt
python ingestion/fetch_weather.py

Analytics Screenshots

App Screenshot App Screenshot

About

This project showcases a production-style data pipeline using free, local tools. It ingests real-time weather API data into a local data lake, loads it into PostgreSQL, transforms it via dbt for analytics, and orchestrates the entire workflow with Apache Airflow.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages