Summary
Optimize vector ingestion path to reduce time from chat creation to READY.
Problem
Vector processing currently does sequential scraping + embedding with relatively small batching and can be slow for medium/large docs sets.
Expected Solution
Improve throughput using controlled concurrency, smarter batching, and optional caching for repeated pages/chunks.
Scope
backend/chatWorker.js
backend/utils/ragUtilities.js
- Qdrant write strategy and queue behavior
- Progress reporting logic
Acceptance Criteria
Summary
Optimize vector ingestion path to reduce time from chat creation to READY.
Problem
Vector processing currently does sequential scraping + embedding with relatively small batching and can be slow for medium/large docs sets.
Expected Solution
Improve throughput using controlled concurrency, smarter batching, and optional caching for repeated pages/chunks.
Scope
backend/chatWorker.jsbackend/utils/ragUtilities.jsAcceptance Criteria