Skip to content

Speed Up Vector Chat Creation Pipeline #11

@avishek0769

Description

@avishek0769

Summary

Optimize vector ingestion path to reduce time from chat creation to READY.

Problem

Vector processing currently does sequential scraping + embedding with relatively small batching and can be slow for medium/large docs sets.

Expected Solution

Improve throughput using controlled concurrency, smarter batching, and optional caching for repeated pages/chunks.

Scope

  • backend/chatWorker.js
  • backend/utils/ragUtilities.js
  • Qdrant write strategy and queue behavior
  • Progress reporting logic

Acceptance Criteria

  • Measured end-to-end ingestion time improves on representative docs sets.
  • Throughput improvements do not degrade indexing correctness.
  • Worker remains stable under concurrent jobs.
  • Progress feedback remains accurate and monotonic.

Metadata

Metadata

Assignees

No one assigned

    Labels

    backendBackend issueshardThis is issue is hard to solveperformanceImprove performance of the existing feature

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions