fix: use per-call sessions for embeddings to prevent timeout on large collections by debugerman · Pull Request #420 · tobi/qmd

debugerman · 2026-03-17T03:47:59Z

The old design wrapped the entire embedding loop in a single LLM session with a 30-minute maxDuration cap. For collections exceeding ~39k chunks, the session timer would fire mid-loop and all remaining embeddings would fail with SessionReleasedError.

Each embed/embedBatch call now gets its own short-lived session with the default 10-minute timeout, which is more than sufficient for a single batch of ≤32 texts.

Closes #410

… collections The old design wrapped the entire embedding loop in a single LLM session with a 30-minute maxDuration cap. For collections exceeding ~39k chunks, the session timer would fire mid-loop and all remaining embeddings would fail with SessionReleasedError. Each embed/embedBatch call now gets its own short-lived session with the default 10-minute timeout, which is more than sufficient for a single batch of ≤32 texts. Closes tobi#410

obbax mentioned this pull request Mar 24, 2026

Bug: qmd embed crashes with "Context is disposed" on CPU-only system after ~5 minutes #450

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use per-call sessions for embeddings to prevent timeout on large collections#420

fix: use per-call sessions for embeddings to prevent timeout on large collections#420
debugerman wants to merge 1 commit intotobi:mainfrom
debugerman:main

debugerman commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

debugerman commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant