Skip to content
View Aakash-pal's full-sized avatar
🏠
Working from home
🏠
Working from home
  • India
  • 17:30 (UTC +05:30)

Block or report Aakash-pal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Aakash-pal/README.md

πŸ‘‹ Hi, I'm Aakash Pal

🎯 Aspiring Cloud Data Engineer | πŸ› οΈ SQL & ETL Enthusiast | πŸ“Š Transforming messy data into reliable pipelines


πŸš€ About Me

I’m a motivated and detail-driven Cloud Data Engineer-in-training, with professional experience in incident and ETL, now transitioning into the world of data engineering.

My focus is on hands-on, project-based learning, where each GitHub repository reflects real-world scenarios, complex SQL transformations, and data pipeline automation using tools like Docker, PostgreSQL, and Python.


πŸ”§ Skills & Technologies

  • Languages & Tools: SQL (PostgreSQL, T-SQL), Python (Pandas)/(learning), Git, Docker, Batch
  • ETL Pipelines: SQL-based extraction, regex-powered transformation, conditional repair logic
  • Data Cleaning: Contextual inference, null handling, regex validation
  • Automation: Dockerized pipelines, Task Scheduler, GitHub Actions (CI/CD)
  • Cloud Tools: Apache Airflow with Docker orchestration
  • Databases: PostgreSQL, MSSQL, DBeaver

πŸ“‚ Featured Projects

🧼 ETL CafΓ© Sales Pipeline β€” From Raw to Reliable

A complete data engineering project simulating real-world ETL operations. It includes raw ingestion, complex SQL logic repair using CTEs, Dockerized orchestration, and scheduled automation.

Key highlights:

  • Built a multi-step SQL pipeline for cleaning item, payment_method, and location using regex-safe contextual logic
  • Used CTEs and layered subqueries to preserve clean values while repairing invalid entries
  • Implemented Docker-based containerization and automated with Windows Task Scheduler
  • Structured and documented for professional GitHub presentation

πŸ”— View Project


πŸ“˜ Current Learning Journey

I’m growing my expertise step-by-step through guided mini-projects learning. Here's my roadmap:

  • βœ… PostgreSQL-Based SQL Transformation Projects
  • βœ… Dockerized ETL Pipelines
  • βœ… Apache Airflow Orchestration
  • βœ… Multi-Source Data Integration and Warehousing
  • πŸ”„ Python for Data Engineering
  • πŸ”„ Cloud Platforms (AWS / Azure)
  • πŸ”„ dbt for Analytics Engineering

πŸ“Š GitHub Stats

GitHub stats

GitHub Streak


πŸ’¬ Let’s Connect

  • πŸ’Ό LinkedIn
  • 🧠 I’m open to collaboration, feedback, and continuous learning!

β€œBuild systems that make data reliable, not just available.”

Pinned Loading

  1. rottingresearch/rottingresearch rottingresearch/rottingresearch Public

    A project devoted to helping academics and researchers provide robust citations and mitigate link rot.

    HTML 20 20

  2. etl-cafe-sales etl-cafe-sales Public

    End-to-end Python ETL pipeline: cleans and loads cafe sales data into PostgreSQL, containerized with Docker, automated via Task Scheduler.

    Python