Skip to content
View Sam-Sundar's full-sized avatar
🤖
🤖

Block or report Sam-Sundar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Sam-Sundar/README.md

👋 Hi, I'm Sam Sundar Suresh

DevOps / MLOps Engineer | Cloud Infrastructure | Kubernetes | AI Platforms

🚀 I build production-grade cloud platforms that scale
⚙️ I automate everything that shouldn’t be manual
🔥 I run systems after go-live, not just diagrams

🧠 What I Do

  • Design and operate high-availability Kubernetes platforms
  • Run GPU-backed ML & LLM inference workloads
  • Build CI/CD pipelines that ship fast without breaking prod
  • Optimize cloud cost, performance, and reliability
  • Help engineers unblock themselves and ship with confidence

I care about uptime, p99s, and root cause analysis — not buzzwords.

🛠️ Tech I Work With

Cloud        : AWS • GCP • Hybrid / On-Prem
Containers   : Kubernetes • Docker • Helm 
MLOps        : vLLM • NVIDIA Triton • CUDA • MIG • MLflow
IaC          : Terraform • Ansible
CI/CD        : Argo CD • Jenkins • GitHub Actions 
Observability: Prometheus • Grafana • Loki • VictoriaMetrics
Security     : Vault • Kyverno • Trivy • Falco
Languages    : Python • Bash • Go (working knowledge)

⚡ Stuff I’ve Built / Run in Production

  • ☸️🐋 Migrated 60+ monoliths → microservices, scaling to 10k RPS with p99 < 500ms
  • 🤖 Deployed LLM inference platforms with multi-GPU scheduling with NVIDIA MIG
  • 🔄 Built CI/CD systems delivering zero-downtime releases
  • 📊 Ran data pipelines handling billions of clickstream events per day
  • 💸 Cut cloud costs by 25%+ without sacrificing performance
  • 🚨 Owned production incidents, led RCAs, and fixed problems permanently

🧑‍💻 How I Work

  • Production-first mindset
  • Strong bias toward automation
  • Calm during incidents
  • Direct, honest communication
  • Big believer in “teach, don’t gatekeep”

If developers are stuck — I jump in.

📫 Let’s Connect


💡 “Make it reliable first. Make it fast second. Make it pretty later.”

Popular repositories Loading

  1. DevOps-cheat-sheet-pdf DevOps-cheat-sheet-pdf Public

    Forked from saiumesh535/cheat-sheet-pdf

    📜 A Cheat-Sheet Collection from the WWW

    20 9

  2. files-changed-github-action files-changed-github-action Public

    Go

  3. kubeswitch kubeswitch Public

    Forked from saiumesh535/kubeswitch

    Go

  4. k8s-cluster k8s-cluster Public

    Forked from mipsel64/my-cluster

    My Kubernetes cluster

    HCL

  5. kubernetes-handbook kubernetes-handbook Public

    Forked from feiskyer/kubernetes-handbook

    Kubernetes Handbook https://kubernetes.feisky.xyz

    Makefile

  6. Certified-Kubernetes-Administrator-Notes Certified-Kubernetes-Administrator-Notes Public

    Forked from ismet55555/Certified-Kubernetes-Administrator-Notes

    https://www.cncf.io/certification/cka/