π Data Engineer | Python | SQL | PySpark | AWS | Microsoft Fabric
I am a Data Engineer with 3+ years of experience designing and building scalable data pipelines, working with cloud platforms, and implementing modern data architectures such as Lakehouse and Medallion Architecture.
I enjoy building end-to-end data engineering projects, solving data problems, and optimizing data pipelines for analytics and reporting.
- Programming: Python, SQL
- Big Data: PySpark, Spark
- Cloud Platforms: AWS, Microsoft Fabric
- AWS Services: S3, Glue, Lambda
- Data Integration: Azure Data Factory (ADF)
- Data Architecture: Lakehouse, Medallion Architecture
- Data Processing: ETL Pipelines, Batch Processing
- Data Warehousing: SQL-based analytics systems
End-to-end data pipeline built using Microsoft Fabric implementing Medallion Architecture (Bronze β Silver β Gold) to ingest API data, validate data quality, and produce analytics-ready datasets.
Designed a data pipeline to process transaction data and identify suspicious activity patterns for fraud detection.
Built a data pipeline using Fabric pipelines, Lakehouse storage, and PySpark notebooks for scalable data processing.
Implemented a Medallion Architecture pipeline on AWS using S3, Glue, and Spark for large-scale data processing.
Created a cloud-based pipeline for processing connected vehicle data and generating insights from telemetry datasets.
Built scalable PySpark workflows for data transformation and aggregation.
Designed dimensional models and optimized SQL queries for analytics workloads.
- Building real-world data engineering projects
- Learning advanced Microsoft Fabric data pipelines
- Improving PySpark performance optimization
- Implementing data quality frameworks
πΌ LinkedIn https://www.linkedin.com/in/tushar-shinde-1207
π§ Email tushar.shinde1207@gmail.com
π Location Pune, India