Skip to content
View prudvikomerelli's full-sized avatar

Block or report prudvikomerelli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
prudvikomerelli/README.md

πŸš€ Prudvi S R Komerelli

Staff / Principal Data Engineer | Azure | Databricks | Snowflake | Data Platforms | AI Systems

I’m a data engineer with 13+ years of experience building scalable data platforms, lakehouse architectures, and analytics systems across Azure, Snowflake, Databricks, and AWS.

I specialize in designing end-to-end data platforms that transform raw operational data into reliable, analytics-ready datasets used for business decision-making, reporting, and machine learning.


πŸ”§ Core Expertise

  • Cloud Data Platforms: Azure (ADF, Databricks, Synapse), Snowflake, AWS
  • Data Engineering: PySpark, SQL, Python, ETL/ELT, orchestration
  • Lakehouse Architecture: Delta Lake, medallion design, large-scale pipelines
  • Analytics & BI: Power BI, Tableau, OBIEE
  • Data Governance: data quality, compliance, reliability

πŸ“Œ Featured Projects

πŸ”Ή ResumeAI β€” AI-Powered SaaS Platform

End-to-end AI SaaS that generates ATS-optimized resumes and cover letters.
Built with Next.js, Supabase, Prisma, Stripe, and LLM pipelines.

Focus: Product engineering, AI workflows, system design


πŸ”Ή OpenWeather ETL Pipeline (Airflow)

Production-style ETL pipeline using Airflow, Python, PostgreSQL, and Docker.

Features:

  • Dynamic task mapping (TaskFlow API)
  • Layered data modeling
  • Idempotent loads
  • Dockerized local environment

πŸ”Ή NYC Taxi Streaming Data Platform

Real-time pipeline using Kafka, Spark, Airflow, PostgreSQL, and Superset.

Focus:

  • Streaming ingestion
  • Distributed processing
  • End-to-end analytics pipeline

🧠 What I Focus On

I enjoy building systems that:

  • Scale reliably
  • Produce trusted data
  • Support real business decisions
  • Evolve with growing data needs

πŸ“« Connect With Me

Pinned Loading

  1. ResumeAI ResumeAI Public

    AI-powered SaaS platform that generates ATS-optimized resumes and cover letters using LLMs, with scoring, keyword analysis, and Stripe-based billing.

    TypeScript

  2. openweather-airflow-postgres openweather-airflow-postgres Public

    End-to-end ETL pipeline using Apache Airflow to ingest OpenWeather API data, transform it with Python, and load analytics-ready tables into PostgreSQL with Dockerized local setup.

    Python

  3. nyc-taxi-data-pipeline nyc-taxi-data-pipeline Public

    Real-time data pipeline using Kafka, Spark, and Airflow to process NYC taxi data and deliver analytics via PostgreSQL and Superset.

    Python