-
Notifications
You must be signed in to change notification settings - Fork 96
Closed
Description
add papers
HydraInfer: Hybrid Disaggregated Scheduling for Multimodal Large Language Model Serving
https://arxiv.org/abs/2505.12658
Enabling Disaggregated Multi-Stage MLLM Inference via GPU-Internal Scheduling and Resource Sharing
https://arxiv.org/abs/2512.17574
Decoupling the "What" and "Where" With Polar Coordinate Positional Embeddings
https://arxiv.org/abs/2509.10534
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels