ml-model-serving

Learning project. Goal is to get familiar with the tools around deploying ML models (Docker, CI/CD, Ansible, AWS), focusing on serving / MLOps tooling.

The actual model is a MobileNetV2 fine-tuned to classify bean leaf images (healthy / angular leaf spot / bean rust). It's a simple enough task that I can focus on the infra side.

How it fits together

train locally (GPU)
    → MLflow tracks the run + saves model.pth
    → promote --run-id <id>  pushes it to S3

git push --tags v0.x
    → GitHub Actions builds the Docker image → GHCR
    → Ansible SSHes into EC2, pulls the image, restarts the container
    → container loads model.pth from S3 on startup

Dev environment is Nix + uv. Two shells: nix develop for CUDA (training), nix develop .#minimal for CPU-only work.

Running things

Train:

uv run --extra training train

Set MLFLOW_TRACKING_URI if you want a remote MLflow server, otherwise it writes to mlruns/ locally.

Promote a run to S3:

MODEL_BUCKET=<bucket> uv run promote --run-id <mlflow-run-id>

Serve locally:

MODEL_BUCKET=<bucket> uv run --extra inference uvicorn ml_model_serving.main:app --port 8080
# or
docker build -t ml-model-serving . && docker run -p 8080:8080 -e MODEL_BUCKET=<bucket> ml-model-serving

MODEL_BUCKET is optional — it'll just start with random weights and warn you.

Provision a fresh EC2 (one-time):

ansible-playbook -i inventory.aws_ec2.yml playbook-provision.yml

Deploy manually:

ansible-playbook -i inventory.aws_ec2.yml playbook-deploy.yml -e "model_bucket=<bucket>"

CI does this automatically on version tags.

API

GET  /health   → { status, device }
POST /predict  → image upload → { predicted_class, label, probabilities }

GitHub Actions secrets needed

SSH_PRIVATE_KEY, AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, MODEL_BUCKET

The EC2 instance needs the tag Name: ml-model-serving in eu-north-1 for the dynamic inventory to find it.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.github/workflows		.github/workflows
src/ml_model_serving		src/ml_model_serving
.envrc		.envrc
.gitignore		.gitignore
.python-version		.python-version
.terraform.lock.hcl		.terraform.lock.hcl
Dockerfile		Dockerfile
README.md		README.md
demo.gif		demo.gif
example.png		example.png
flake.lock		flake.lock
flake.nix		flake.nix
inventory.aws_ec2.yml		inventory.aws_ec2.yml
main.tf		main.tf
playbook-deploy.yml		playbook-deploy.yml
playbook-provision.yml		playbook-provision.yml
pyproject.toml		pyproject.toml
terraform.tf		terraform.tf
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ml-model-serving

How it fits together

Running things

API

GitHub Actions secrets needed

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ml-model-serving

How it fits together

Running things

API

GitHub Actions secrets needed

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages