Problem / Motivation
CUDA tests (cargo nextest run --features cuda) can only run on machines with NVIDIA GPUs. Currently these must be run manually on RunPod. An ephemeral GPU CI runner would catch GPU regressions automatically on every PR.
Solution
GitHub Actions workflow that:
- Spins up an ephemeral RunPod instance (or similar GPU cloud) on PR
- Runs
cargo nextest run --features cuda
- Optionally runs the benchmark protocol script
- Tears down the instance after completion
Options
- RunPod serverless endpoints
- GitHub-hosted GPU runners (if available)
- Self-hosted runner on always-on GPU instance (cheapest long-term)
Key files
- NEW:
.github/workflows/gpu-ci.yml
- Existing:
.github/workflows/ci.yml — CPU-only CI
Acceptance criteria
Problem / Motivation
CUDA tests (
cargo nextest run --features cuda) can only run on machines with NVIDIA GPUs. Currently these must be run manually on RunPod. An ephemeral GPU CI runner would catch GPU regressions automatically on every PR.Solution
GitHub Actions workflow that:
cargo nextest run --features cudaOptions
Key files
.github/workflows/gpu-ci.yml.github/workflows/ci.yml— CPU-only CIAcceptance criteria