π― Aspiring Cloud Data Engineer | π οΈ SQL & ETL Enthusiast | π Transforming messy data into reliable pipelines
Iβm a motivated and detail-driven Cloud Data Engineer-in-training, with professional experience in incident and ETL, now transitioning into the world of data engineering.
My focus is on hands-on, project-based learning, where each GitHub repository reflects real-world scenarios, complex SQL transformations, and data pipeline automation using tools like Docker, PostgreSQL, and Python.
- Languages & Tools: SQL (PostgreSQL, T-SQL), Python (Pandas)/(learning), Git, Docker, Batch
- ETL Pipelines: SQL-based extraction, regex-powered transformation, conditional repair logic
- Data Cleaning: Contextual inference, null handling, regex validation
- Automation: Dockerized pipelines, Task Scheduler, GitHub Actions (CI/CD)
- Cloud Tools: Apache Airflow with Docker orchestration
- Databases: PostgreSQL, MSSQL, DBeaver
A complete data engineering project simulating real-world ETL operations. It includes raw ingestion, complex SQL logic repair using CTEs, Dockerized orchestration, and scheduled automation.
Key highlights:
- Built a multi-step SQL pipeline for cleaning
item,payment_method, andlocationusing regex-safe contextual logic - Used CTEs and layered subqueries to preserve clean values while repairing invalid entries
- Implemented Docker-based containerization and automated with Windows Task Scheduler
- Structured and documented for professional GitHub presentation
π View Project
Iβm growing my expertise step-by-step through guided mini-projects learning. Here's my roadmap:
- β PostgreSQL-Based SQL Transformation Projects
- β Dockerized ETL Pipelines
- β Apache Airflow Orchestration
- β Multi-Source Data Integration and Warehousing
- π Python for Data Engineering
- π Cloud Platforms (AWS / Azure)
- π dbt for Analytics Engineering
- πΌ LinkedIn
- π§ Iβm open to collaboration, feedback, and continuous learning!
βBuild systems that make data reliable, not just available.β

