Skip to content

Commit edb76be

Browse files
authored
Add known_issue about container oom (#104)
* add known_issue * update
1 parent 22817fa commit edb76be

File tree

1 file changed

+12
-0
lines changed

1 file changed

+12
-0
lines changed

vllm/KNOWN_ISSUES.md

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11,3 +11,15 @@ When using a single GPU card over a x16 PCIe connection without a PCIe switch, t
1111
Workaround: Change the PCIe slot configuration in BIOS from Auto/x16 to x8/x8.
1212
With this change, over 40 GB/s bi-directional P2P bandwidth can be achieved.
1313
Root cause analysis is still in progress.
14+
15+
# 03. Container OOM killed by using `--enable-auto-tool-choice` and starting container not by /bin/bash and not run `source /opt/intel/oneapi/setvars.sh`
16+
17+
When using `--enable-auto-tool-choice` and deploy container by docker-compose without `source /opt/intel/oneapi/setvars.sh`, the LD_LIBRARY_PATH will be different and cause the container OOM. It can be reproduced by this two command:
18+
19+
```bash
20+
docker run --rm --entrypoint "/bin/bash" --name=test intel/llm-scaler-vllm:latest -c env | grep LD_LIBRARY_PATH
21+
22+
docker run --rm --entrypoint "/bin/bash" --name=test intel/llm-scaler-vllm:latest -c "source /opt/intel/oneapi/setvars.sh --force && env | grep LD_LIBRARY_PATH"
23+
```
24+
25+
So we need to run `source /opt/intel/oneapi/setvars.sh --force` to ensure some configurations are consistent.

0 commit comments

Comments
 (0)