You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The frontend image is a specialized container that includes the Dynamo components (NATS, etcd, dynamo, NIXL, etc) along with the Endpoint Picker (EPP) for Kubernetes Gateway API Inference Extension integration. This image is primarily used for inference gateway deployments.
200
+
201
+
**Step 1: Build the Custom Dynamo EPP Image**
202
+
203
+
Follow the instructions in [`deploy/inference-gateway/README.md`](../deploy/inference-gateway/README.md) under "Build the custom EPP image" section. This process:
204
+
- Clones the Gateway API Inference Extension repository
205
+
- Applies Dynamo-specific patches for custom routing
206
+
- Builds the Dynamo router as a static library
207
+
- Creates a custom EPP image with integrated Dynamo routing capabilities
208
+
209
+
**Step 2: Build the Dynamo Base Image**
210
+
211
+
The base image contains the core Dynamo runtime components, NATS server, etcd, and Python dependencies:
212
+
```bash
213
+
# Build the base dev image (framework=none for frontend-only deployment)
214
+
./build.sh --framework none --target dev
215
+
```
216
+
217
+
**Step 3: Build the Frontend Image**
218
+
219
+
Now build the frontend image that combines the Dynamo base with the EPP:
220
+
221
+
```bash
222
+
# 2. Build the frontend image using the pre-built EPP
-**Python Environment**: Virtual environment with all required dependencies
239
+
-**NATS Server**: Message broker for Dynamo's distributed communication
240
+
-**etcd**: Distributed key-value store for configuration and coordination
241
+
242
+
#### Deployment
243
+
244
+
The frontend image is designed for Kubernetes deployment with the Gateway API Inference Extension. See [`deploy/inference-gateway/README.md`](../deploy/inference-gateway/README.md) for complete deployment instructions using Helm charts.
245
+
195
246
### run.sh - Container Runtime Manager
196
247
197
248
The `run.sh` script launches Docker containers with the appropriate configuration for development and inference workloads.
0 commit comments