feat(file-processors): add async request scheduler to prevent docling… by alinaryan · Pull Request #5627 · ogx-ai/ogx

alinaryan · 2026-04-24T20:30:50Z

What does this PR do?

Adds an AsyncRequestScheduler utility and integrates it into the remote::docling-serve file processor to prevent server overload under concurrent load.

Problem: Benchmarking showed that Docling Serve degrades severely under concurrency — latency triples for small files (6s → 18s), and large image-heavy PDFs crash the server entirely
at concurrency ≥ 5. Without request queuing, a burst of file uploads can take down the processing pipeline.

Solution: A reusable AsyncRequestScheduler (asyncio semaphore with FIFO queuing) that gates how many requests hit the backend simultaneously. The docling-serve provider wraps its HTTP
calls through the scheduler, so excess requests wait in line rather than overwhelming the server.

max_concurrency (default 2): how many requests can be in-flight at once
max_queue_size (default 0/unlimited): cap on waiting requests before rejecting

The scheduler is a general-purpose utility in providers/utils/ — any provider with a resource-constrained backend can adopt it.

Follows up on: #3970 and #5412

Test Plan

In progress

…-serve overload Docling Serve degrades severely under concurrent load — latency triples for small files, and large image-heavy PDFs crash the server entirely at concurrency >= 5. This adds an AsyncRequestScheduler utility that limits concurrent requests via an asyncio semaphore with FIFO queuing, and integrates it into the docling-serve file processor provider. Operators can tune max_concurrency (default 2) and max_queue_size (default unlimited) via provider config or environment variables. Signed-off-by: Alina Ryan <aliryan@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(file-processors): add async request scheduler to prevent docling…#5627

feat(file-processors): add async request scheduler to prevent docling…#5627
alinaryan wants to merge 1 commit intoogx-ai:mainfrom
alinaryan:async-scheduler

alinaryan commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

alinaryan commented Apr 24, 2026

What does this PR do?

Test Plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant