You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/getting_started/installation/gpu.cuda.inc.md
+10-4Lines changed: 10 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -84,7 +84,7 @@ uv pip install vllm \
84
84
# --8<-- [end:pre-built-wheels]
85
85
# --8<-- [start:build-wheel-from-source]
86
86
87
-
#### Set up using Python-only build (without compilation)
87
+
#### Set up using Python-only build (without compilation) {#python-only-build}
88
88
89
89
If you only need to change Python code, you can build and install vLLM without compilation. Using `uv pip`'s [`--editable` flag](https://docs.astral.sh/uv/pip/packages/#editable-packages), changes you make to the code will be reflected when you run vLLM:
90
90
@@ -108,18 +108,24 @@ This command will do the following:
108
108
In case you see an error about wheel not found when running the above command, it might be because the commit you based on in the main branch was just merged and the wheel is being built. In this case, you can wait for around an hour to try again, or manually assign the previous commit in the installation using the `VLLM_PRECOMPILED_WHEEL_LOCATION` environment variable.
109
109
110
110
```bash
111
-
exportVLLM_COMMIT=72d9c316d3f6ede485146fe5aabd4e61dbc59069 # use full commit hash from the main branch
exportVLLM_PRECOMPILED_WHEEL_COMIMT=$(git rev-parse HEAD~1)# or earlier commit on main
112
+
exportVLLM_USE_PRECOMPILED=1
113
113
uv pip install --editable .
114
114
```
115
115
116
+
There are more environment variables to control the behavior of Python-only build:
117
+
118
+
*`VLLM_PRECOMPILED_WHEEL_LOCATION`: specify the exact wheel URL or local file path of a pre-compiled wheel to use. All other logic to find the wheel will be skipped.
119
+
*`VLLM_PRECOMPILED_WHEEL_COMMIT`: override the commit hash to download the pre-compiled wheel. It can be `nightly` to use the last **already built** commit on the main branch.
120
+
*`VLLM_PRECOMPILED_WHEEL_VARIANT`: specify the variant subdirectory to use on the nightly index, e.g., `cu129`, `cpu`. If not specified, the CUDA variant with `VLLM_MAIN_CUDA_VERSION` will be tried, then fallback to the default variant on the remote index.
121
+
116
122
You can find more information about vLLM's wheels in [Install the latest code](#install-the-latest-code).
117
123
118
124
!!! note
119
125
There is a possibility that your source code may have a different commit ID compared to the latest vLLM wheel, which could potentially lead to unknown errors.
120
126
It is recommended to use the same commit ID for the source code as the vLLM wheel you have installed. Please refer to [Install the latest code](#install-the-latest-code) for instructions on how to install a specified wheel.
121
127
122
-
#### Full build (with compilation)
128
+
#### Full build (with compilation) {#full-build}
123
129
124
130
If you want to modify C++ or CUDA code, you'll need to build vLLM from source. This can take several minutes:
0 commit comments