Skip to content
View tushar-data-engineer's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report tushar-data-engineer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Hi, I'm Tushar Shinde πŸ‘‹

πŸš€ Data Engineer | Python | SQL | PySpark | AWS | Microsoft Fabric

I am a Data Engineer with 3+ years of experience designing and building scalable data pipelines, working with cloud platforms, and implementing modern data architectures such as Lakehouse and Medallion Architecture.

I enjoy building end-to-end data engineering projects, solving data problems, and optimizing data pipelines for analytics and reporting.


πŸ›  Tech Stack

Python SQL PySpark AWS Microsoft Fabric ETL

Technologies I work with

  • Programming: Python, SQL
  • Big Data: PySpark, Spark
  • Cloud Platforms: AWS, Microsoft Fabric
  • AWS Services: S3, Glue, Lambda
  • Data Integration: Azure Data Factory (ADF)
  • Data Architecture: Lakehouse, Medallion Architecture
  • Data Processing: ETL Pipelines, Batch Processing
  • Data Warehousing: SQL-based analytics systems

πŸ“‚ Data Engineering Projects

πŸš€ Fabric SocialPulse Lakehouse Platform

End-to-end data pipeline built using Microsoft Fabric implementing Medallion Architecture (Bronze β†’ Silver β†’ Gold) to ingest API data, validate data quality, and produce analytics-ready datasets.

🏦 Banking Fraud Detection Data Pipeline

Designed a data pipeline to process transaction data and identify suspicious activity patterns for fraud detection.

πŸ“Š Fabric Data Engineering Project

Built a data pipeline using Fabric pipelines, Lakehouse storage, and PySpark notebooks for scalable data processing.

☁️ AWS Medallion Data Pipeline

Implemented a Medallion Architecture pipeline on AWS using S3, Glue, and Spark for large-scale data processing.

πŸš— Vehicle Data Pipeline on AWS

Created a cloud-based pipeline for processing connected vehicle data and generating insights from telemetry datasets.

πŸ”₯ PySpark Data Processing Project

Built scalable PySpark workflows for data transformation and aggregation.

🏒 SQL Data Warehouse Project

Designed dimensional models and optimized SQL queries for analytics workloads.


πŸ“ˆ What I'm Currently Working On

  • Building real-world data engineering projects
  • Learning advanced Microsoft Fabric data pipelines
  • Improving PySpark performance optimization
  • Implementing data quality frameworks

πŸ“« Connect With Me

πŸ’Ό LinkedIn https://www.linkedin.com/in/tushar-shinde-1207

πŸ“§ Email tushar.shinde1207@gmail.com

πŸ“ Location Pune, India

Pinned Loading

  1. banking-fraud-detection-data-pipeline banking-fraud-detection-data-pipeline Public

    data-engineering aws pyspark fraud-detection data-pipeline etl-pipeline aws-glue data-lake medallion-architecture banking-analytics

    Python

  2. fabric-data-engineering-project fabric-data-engineering-project Public

    Data engineering pipeline built using Microsoft Fabric that extracts data from a public API and stores it in a Lakehouse using a Medallion Architecture (Bronze, Silver, Gold). The pipeline definiti…

    Jupyter Notebook

  3. fabric-socialpulse-lakehouse-platform fabric-socialpulse-lakehouse-platform Public

    End-to-end data engineering pipeline built on Microsoft Fabric implementing Medallion Architecture (Bronze–Silver–Gold) to ingest API data, perform data quality validation, transform data using PyS…

    Jupyter Notebook