Skip to content
View MuditNautiyal-21's full-sized avatar

Highlights

  • Pro

Block or report MuditNautiyal-21

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MuditNautiyal-21/README.md

Mudit Nautiyal

I build production data systems that process real transactions at scale and solve measurable business problems.


Who I Am

6+ years building enterprise data systems at HCL Technologies, where I owned ETL pipelines processing 100K+ daily transactions across SAP, Informatica MDM, and AWS. Refactored legacy SQL that cut cycle time by 45%. Built an ML failure prediction model that caught 85% of infrastructure failures before they happened (HCL Innovation Box 2023).

Before that, I was at Uttarakhand Aaj, a digital news platform operating across 13 states, where I built the analytical case that got leadership to approve a full infrastructure migration from monolithic to modular architecture.

MS in Data Science, University at Buffalo (Dec 2025). Currently seeking Data Engineer, Data Analyst, Data Scientist, and ML Engineer roles.


What I Build

I don't build tutorial projects. Every repo below solves a real problem for a real user or client.

The Problem: A manufacturing client (Nissha Medical Technologies) produces ~30M casino tickets/day with a 4% defect rate in Q-Block alignment. Manual inspection cannot scale.

What I Built: Real-time computer vision pipeline using YOLOv8 + CUDA + TensorRT with 4 deterministic validation gates. 100% accuracy on 2,177 production images. 45% latency reduction (117ms to 62ms).

My Role: Team Lead. Designed the architecture, built the inference engine, defined production acceptance criteria with client engineers.

The Problem: Over 80% of US medical bills contain errors. Patients overpay by thousands because they cannot interpret CPT/ICD codes or generate dispute letters.

What I Built: AI-powered billing advocate that ingests medical bills via OCR, detects billing errors using Google Gemini, and auto-generates legally structured dispute letters. Reduced manual reconciliation by ~70%.

Recognition: Winner, AI For Good Hackathon, University at Buffalo.

The Problem: Retail traders juggle fragmented tools for charts, news, and backtesting. No single platform connects real-time data, technical analysis, and strategy validation.

What I Built: Full-stack trading intelligence platform with ML-driven adaptive strategies (30-min learning cycles), sentiment analysis across 9 global exchanges, and a backtesting engine generating 1,000+ simulated trades. Containerized with Docker, deployed on Render.

The Problem: In production, fraud labels are rare or delayed. Financial institutions need detection systems that work without labeled training data.

What I Built: Two unsupervised anomaly detection models (Isolation Forest + Autoencoder) on 284K+ real transactions. Autoencoder catches 84.6% of fraud; Isolation Forest minimizes false positives. Documented the precision-recall tradeoff as a business decision, not just a technical one.


Impact By The Numbers

100K+

Daily Transactions
Processed at HCL

45%

Processing Time
Reduction

85%

Failure Prediction
Accuracy

30M+

Units/Day
Throughput

100%

Defect Detection
Accuracy

~70%

Manual Work
Reduced

Tools I Use In Production


Credentials


Pinned Loading

  1. MARA MARA Public

    M.A.R.A. is a Multi-Agent Research Analyst. A self-correcting multi-agent research system built with LangGraph, Gemma 4, and Streamlit.

    Python

  2. EagleEyes-QBlock-Vision-AI EagleEyes-QBlock-Vision-AI Public

    Real-time defect detection system for manufacturing QA - YOLOv8 + 4-gate rule-based validation pipeline achieving 100% accuracy Good Images and 99.82% on No Good Images, achieving a total of 99.91%…

    Python

  3. MediFriend MediFriend Public

    AI-Assistant based medical billing application, utilizing OCR for document ingestion and Large Language model (LLM) to automate entity extraction and billing reconciliation.

    TypeScript

  4. SUDNAXI-Trading-Intelligence SUDNAXI-Trading-Intelligence Public

    Python

  5. Data-Pulse Data-Pulse Public

    Catch silent data pipeline failures before your stakeholders do. Lightweight data quality engine with YAML-based checks, statistical anomaly detection, Slack alerting, and a health dashboard.

    Python

  6. Credit-Card-Fraud-Detection Credit-Card-Fraud-Detection Public

    Anomaly detection in credit card transactions using Isolation Forest and Autoencoder

    Jupyter Notebook 3