diff --git a/README.md b/README.md index dc4e24a..ff98d13 100644 --- a/README.md +++ b/README.md @@ -326,6 +326,7 @@ A curated list of Large Language Model systems related academic papers, articles - [Cornserve](https://arxiv.org/abs/2512.14098): Efficiently Serving Any-to-Any Multimodal Models - [HydraInfer](https://arxiv.org/abs/2505.12658): Hybrid Disaggregated Scheduling for Multimodal Large Language Model Serving - [Enabling Disaggregated Multi-Stage MLLM Inference via GPU-Internal Scheduling and Resource Sharing](https://arxiv.org/abs/2512.17574) +- [VoxServe](https://arxiv.org/abs/2602.00269): Streaming-Centric Serving System for Speech Language Models ## LLM for Systems