From e5340cf59484ebcfa3203c34cbe4c2e87d405ca8 Mon Sep 17 00:00:00 2001 From: Claude Date: Sun, 8 Feb 2026 01:02:20 +0000 Subject: [PATCH] Add VoxServe paper to Multi-Modal Serving Systems https://claude.ai/code/session_016pDJaYmPtmTriCMEWqUJAA --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index dc4e24a..ff98d13 100644 --- a/README.md +++ b/README.md @@ -326,6 +326,7 @@ A curated list of Large Language Model systems related academic papers, articles - [Cornserve](https://arxiv.org/abs/2512.14098): Efficiently Serving Any-to-Any Multimodal Models - [HydraInfer](https://arxiv.org/abs/2505.12658): Hybrid Disaggregated Scheduling for Multimodal Large Language Model Serving - [Enabling Disaggregated Multi-Stage MLLM Inference via GPU-Internal Scheduling and Resource Sharing](https://arxiv.org/abs/2512.17574) +- [VoxServe](https://arxiv.org/abs/2602.00269): Streaming-Centric Serving System for Speech Language Models ## LLM for Systems