token-loom

Self-hosted LLM inference platform for k8s. Serves an open-source language model via an OpenAI-compatible REST API

Overview

[Client] → REST API → [Service] → [Ollama Pod] → [PVC: model storage]

Ollama runs as a k8s Deployment. The model is pulled once onto a persistent volume and loaded into RAM on startup, where it stays resident for the lifetime of the pod. All inference happens in RAM - disk is not involved after the initial load. An init container runs before the main pod on every startup. It checks whether the model is already present on the PVC and pulls it only if not

First deployment on an empty PVC will take time to download the model before the pod becomes ready. Download time depends on model size and network speed. This is expected behaviour

Deployment

helm repo add token-loom https://morooshka.github.io/token-loom
helm repo update

helm upgrade --install token-loom token-loom/token-loom \
  --namespace token-loom \
  --create-namespace

Smoke Test

After deployment, verify end-to-end:

# 1. Confirm the server is up and the model is registered
curl http://<host>:<port>/api/tags

# 2. Confirm inference works
curl http://<host>:<port>/api/generate \
  -d '{
    "model": "<your-model>",
    "prompt": "Say hello",
    "stream": false
  }'

A successful response contains "done": true. The first request may take additional time while the model loads into RAM

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
chart		chart
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

token-loom

Overview

Deployment

Smoke Test

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

token-loom

Overview

Deployment

Smoke Test

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages