Skip to content

added info for v100 runtime for model deployment#307

Merged
Milstein merged 3 commits intomainfrom
v100_model_vllm-runtime-info
Feb 10, 2026
Merged

added info for v100 runtime for model deployment#307
Milstein merged 3 commits intomainfrom
v100_model_vllm-runtime-info

Conversation

@Milstein
Copy link
Contributor

@Milstein Milstein commented Feb 7, 2026

How to Use the NVIDIA V100 GPU Accelerator to Reduce Costs?

You can use the NVIDIA V100 GPU to reduce costs when deploying your model.
To do this, make sure you select the Serving Runtime as (V100 Support) vLLM NVIDIA GPU ServingRuntime for KServe, which is customized to support the NVIDIA V100 GPU architecture. Then, choose NVIDIA A100 GPU as the Accelerator and set the Number of accelerators to 1.

@Milstein Milstein requested a review from joachimweyl February 7, 2026 00:31
Copy link
Contributor

@joachimweyl joachimweyl left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These changes are fine, but in reviewing them, I noticed an issue with the page itself. There are a bunch of missing images. https://github.com/nerc-project/nerc-docs/blob/docs/openshift-ai/other-projects/images/minio-create-bucket-path.png is the first one, but there are many.

@Milstein Milstein merged commit b5984b7 into main Feb 10, 2026
4 checks passed
@Milstein Milstein deleted the v100_model_vllm-runtime-info branch February 10, 2026 22:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants