Skip to content
View ShubhamSarvankar's full-sized avatar

Highlights

  • Pro

Block or report ShubhamSarvankar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ShubhamSarvankar/README.md

Hi, I'm Shubham Sarvankar,

A full-stack software engineer with depth in backend systems, distributed infrastructure, and applied ML.

I build production-grade systems end to end, from REST APIs and real-time messaging services to fine-tuned language models and retrieval pipelines. My work sits at the intersection of software engineering and applied ML: I care as much about how a system is deployed and scaled as I do about the model inside it. I enjoy working on projects that bring different domains together to produce an impact on the world.

Currently finishing my MS in Information Science at the University of Pittsburgh, where I also mentor Masters of Data Science capstone teams through applied data science projects. Previously a fullstack intern at TeamLease EdTech and a mobile application intern at GameOn Technologies.

Open to software engineering and ML engineering roles starting May 2026 (F-1 OPT).


Technical skills

Languages: Python, R, Java, C, C++, C#/.NET, JavaScript, TypeScript, SQL, HTML/CSS, PHP

ML & AI: PyTorch, TensorFlow, Scikit-learn, XGBoost, LightGBM, ONNX, Gensim, spaCy, SHAP, Plotly, NLTK, Hugging Face, Matplotlib

MLOps & data: MLflow/W&B, Airflow, Spark, Hadoop, Pandas, NumPy, SciPy, statsmodels, NetworkX

Backend & APIs: Spring Boot, Hibernate, ASP.NET MVC, Node.js/Express, Flask, FastAPI, React/Next.js, Angular, Vue.js, REST, WebSockets, OpenAPI

Infrastructure & DevOps: Docker/Kubernetes, AWS, GCP, Terraform, CI/CD, Linux, Git, CMake, GDB

Databases & tools: PostgreSQL, SQL Server, pytest, MongoDB, SQLite, Cassandra, Neo4j, Elasticsearch, Redis, BullMQ, Visual Studio, Splunk, Figma, Flutter, RaspberryPi


Projects

These are some selected projects that I've worked on recently.

  • Litmus: S3-compatible object store built for correctness. AWS CLI and boto3 work unmodified. A two-tier chaos suite proves durability invariants hold across SIGKILL and injected OSErrors. 115/325 ceph/s3-tests pass; every failure documented.
  • ITCH 5.0 Parser: C++ limit order book reconstruction for NASDAQ TotalView-ITCH 5.0. Lock-free snapshot handoff via atomic<shared_ptr>, fixed-point price arithmetic throughout, 57M msg/s parse throughput. Validated against a real 7.7 GB production capture.
  • Botnet C2 Detection: graph topology classifier over CTU-13 (13 captures, 7 botnet families). Leave-one-family-out cross-validation with pre-registered thresholds. Donbot achieves 0.872 PR-AUC on an unseen family; the cc_dst_only collapse to 0.001 is itself a documented finding.
  • Storm: real-time messaging platform in TypeScript. AES-256-GCM encryption, single-use JWT refresh tokens with replay detection, 112 tests against real MongoDB and Redis.

Let’s Connect

Pinned Loading

  1. storm storm Public

    Production-grade real-time messaging platform. TypeScript, WebSockets, MongoDB, Redis, BullMQ, AES-256-GCM encryption, JWT auth, Docker. 112 tests, 90%+ coverage.

    TypeScript

  2. itch-parser itch-parser Public

    A C++ limit order book reconstruction system for NASDAQ TotalView-ITCH 5.0 binary feeds.

    C++

  3. botnet-graph-analysis botnet-graph-analysis Public

    Graph-topology-based botnet C2 node detection on CTU-13. LightGBM + LOFO cross-family evaluation. One Strong result (Donbot 0.872)

    Python

  4. qianxusheng/PA-LawSearch qianxusheng/PA-LawSearch Public

    An Information Retrieval course project that builds a search engine for Pennsylvania law cases from https://case.law./

    Python 1

  5. litmus litmus Public

    S3-compatible object store built for correctness. AWS CLI and boto3 work unmodified. A two-tier chaos suite proves durability invariants hold across SIGKILL and injected OSErrors. 115/325 ceph/s3-t…

    Python

  6. medguard medguard Public

    Cross-platform medical records system with end-to-end encryption, mandatory PHI de-identification, and HIPAA-aware audit logging. Android (Kotlin/Compose) + React web + Firebase Cloud Functions.

    TypeScript