Update README.md

gc-fu · web-flow · commit 6e40c3d204a3 · 2025-08-14T18:57:46.000+08:00
diff --git a/vllm/README.md b/vllm/README.md
@@ -20,7 +20,7 @@ llm-scaler-vllm is an extended and optimized version of vLLM, specifically adapt
    2.4 [Multi-Modal Model Support](#24-multi-modal-model-support)  
    2.5 [Omni Model Support](#25-omni-model-support)  
    2.6 [Data Parallelism (DP)](#26-data-parallelism-dp)  
-   2.7 [Maximum Context Length Support](#27-maximum-context-length-support)  
+   2.7 [Finding maximum Context Length](#27-finding-maximum-context-length)  
 3. [Supported Models](#3-supported-models)  
 4. [Troubleshooting](#4-troubleshooting)
 5. [Performance tuning](#5-performance-tuning)
@@ -2281,7 +2281,7 @@ To enable data parallelism, add:
 
 ---
 
-### 2.7 Maximum Context Length Support
+### 2.7 Finding maximum Context Length
 When using the `V1` engine, the system automatically logs the maximum supported context length during startup based on the available GPU memory and KV cache configuration.
 
 #### Example: Successful Startup