perf: Share SentenceTransformer between RAGIndexer and ContextRetriever

## Scope
`refactron/rag/indexer.py` (`RAGIndexer`) and `refactron/rag/retriever.py` (`ContextRetriever`)

## Problem
Both classes construct `SentenceTransformer(embedding_model)` in `__init__`. Loading the model twice in one process duplicates **memory** and **startup latency** (common in workflows: index then query, or repeated CLI invocations if embedder is ever kept warm).

## Suggested direction
- Introduce a small factory or module-level LRU keyed by `(model_name, device)`, or allow injecting a shared `SentenceTransformer` instance into both classes.
- Keep backward-compatible defaults.

## Acceptance
- Single-model workflows only load weights once; public API still works without callers passing a custom instance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Share SentenceTransformer between RAGIndexer and ContextRetriever #151

Scope

Problem

Suggested direction

Acceptance

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

perf: Share SentenceTransformer between RAGIndexer and ContextRetriever #151

Description

Scope

Problem

Suggested direction

Acceptance

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions