Skip to content
View SK2837's full-sized avatar

Block or report SK2837

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SK2837/README.md

πŸ‘‹ Hi, I'm Sai Adarsh Kasula

πŸ“Š Data Scientist | πŸ€– Applied AI & Machine Learning Engineer


🧠 About Me

I am a Data Scientist with 5+ years of experience building production-grade machine learning and analytics systems across healthcare, IT services, and consulting environments.

I specialize in end-to-end ML ownership β€” from problem framing and hypothesis-driven analysis to model development, evaluation, deployment, and stakeholder adoption. My work focuses on converting complex, high-dimensional data (EHRs, behavioral signals, operational logs) into trustworthy, decision-ready insights with measurable business and clinical impact.


πŸ”­ What I’m Working On

🩺 Predictive and causal modeling on large-scale longitudinal healthcare data
πŸ“ˆ Designing statistically rigorous ML pipelines aligned with real-world workflows
πŸ” Building explainable & monitored ML systems for daily decision-making
πŸ“Š Translating models into executive dashboards and narratives


🌱 Current Areas of Focus

πŸ“ Advanced statistical modeling & quasi-experimental methods
⏱️ Time-series modeling & forecasting for operational systems
πŸ§ͺ Model calibration, diagnostics & evaluation in high-stakes settings
☁️ Scalable Python + SQL pipelines on cloud infrastructure


πŸ₯ Professional Experience

🩺 Robert Wood Johnson University Hospital – Cardiology

Data Scientist | Dec 2024 – Nov 2025

  • Built and deployed statistical, causal, and ML models over 50K+ longitudinal EHR records, achieving ~20% PR-AUC improvement
  • Led hypothesis-driven analyses and quasi-experiments using cohort analysis and EDA
  • Owned pipelines end-to-end (SQL β†’ Python β†’ production) and presented insights to senior leadership

🏒 Allsec Technologies (IT Services / BPM)

Machine Learning Engineer / Data Scientist | Sep 2021 – Jul 2023

  • Developed risk prediction and prioritization models for high-volume operational workflows
  • Applied forecasting, regression, and feature ablation to evaluate system changes
  • Productionized ML solutions reducing reactive handling and improving proactive outcomes by ~30%

🧩 Mastek (Technology Services & Consulting)

Junior Data Scientist / Data Science Intern | 2019 – 2021

  • Applied ML and statistical techniques to customer and system-level data
  • Built scalable SQL pipelines and analysis-ready datasets
  • Delivered insights via dashboards and data stories influencing planning decisions

πŸ› οΈ Technical Skill Stack

πŸ‘¨β€πŸ’» Core Technologies

πŸ“Š Machine Learning & Statistics

  • Classification, Regression, Tree-Based Models
  • Time Series Modeling & Forecasting
  • Model Calibration, Diagnostics, Statistical Analysis

☁️ Cloud & Distributed Systems

  • AWS (EC2, S3), Spark, Distributed Data Processing

πŸ“ˆ Visualization & Reporting

  • Tableau, Executive Dashboards, Stakeholder Reporting

βš™οΈ Engineering & Tooling

  • REST APIs, CI/CD, Docker, Git, Agile/Scrum

πŸ“‚ What You’ll Find in My GitHub

This GitHub contains real-world, end-to-end ML projects focused on:

βœ… Clear problem framing
βœ… Reproducible data pipelines
βœ… Interpretable model evaluation
βœ… Business-aligned metrics & outcomes

πŸ“ data/        β†’ Sample or synthetic datasets
πŸ“ notebooks/  β†’ EDA & modeling
πŸ“ src/        β†’ Reusable pipeline code
πŸ“ metrics/    β†’ Evaluation & diagnostics
πŸ“„ README.md   β†’ Problem, approach, results

Pinned Loading

  1. Youtube-Trust-Safety-Profiler- Youtube-Trust-Safety-Profiler- Public

    Designed a multi-layer risk scoring framework (content + behavioral + contextual features), evaluated with Trust & Safety-specific metrics (FPR, FNR, Expected Harm Score), optimizing detection thre…

    Python 1

  2. AI-JobAgent AI-JobAgent Public

    Python

  3. ATM-Fraud-Detection-PredCatch ATM-Fraud-Detection-PredCatch Public

    Python scripts and Jupyter notebooks for building a predictive model to detect fraudulent ATM transactions, addressing data imbalance and leveraging transaction metadata and proprietary indices for…

    Jupyter Notebook

  4. Credit-Risk-Optimization Credit-Risk-Optimization Public

    Reinforcement‑learning prototype for credit‑risk threshold optimisation

    Python

  5. Twitter-Search-Application Twitter-Search-Application Public

    Jupyter Notebook