RAG Integration by RLKRo · Pull Request #426 · deeppavlov/chatsky

RLKRo · 2025-03-31T14:47:58Z

Description

Add RAG integration.

Checklist

I have performed a self-review of the changes

To Consider

Add tests (if functionality is changed)
Update API reference / tutorials / guides
Update CONTRIBUTING.md (if devel workflow is changed)
Update .ignore files, scripts (such as lint), distribution manifest (if files are added/deleted)
Search for references to changed entities in the codebase

• Added VectorStoreService base class with config model • Implemented ChromaVectorStore integration with HuggingFace embeddings • Created RetrieverService for document search in pipeline • Updated docker-compose for Chroma • Excluded from pipeline flow (WIP) TODO: Connect to pipeline services (in progress)

chatsky/core/vector_store.py

RLKRo · 2025-03-31T15:00:36Z

chatsky/responses/rag.py

+        self.vector_store = QdrantClient(url=vector_store_url)
+        self.retriever = SentenceTransformer(retriever_model)
+
+    def execute(self, user_query: str, ctx: Dict[str, Any], **kwargs) -> Dict[str, Any]:


See response_tutorial for information on how to subclass BaseResponse.

RLKRo · 2025-03-31T15:00:50Z

chatsky/responses/rag.py

+            context = "\n".join([hit.payload["text"] for hit in results])
+            ctx["cached_rag_context"] = context
+
+        prompt = f"Контекст: {context}\nВопрос: {user_query}"


Would like prompt customization (i.e. ability to change prompt template via __init__).

RLKRo · 2025-03-31T15:01:47Z

chatsky/responses/rag.py

+    def __init__(self, vector_store_url: str, retriever_model: str, **kwargs):
+        super().__init__(**kwargs)
+        self.vector_store = QdrantClient(url=vector_store_url)
+        self.retriever = SentenceTransformer(retriever_model)


Cannot use local models:

additional heavy dependencies;

CPU load.

chatsky/core/vector_store.py

RLKRo · 2025-04-01T16:42:48Z

chatsky/responses/rag.py

+from typing import Dict, Any
+from sentence_transformers import SentenceTransformer
+from qdrant_client import QdrantClient
+from chatsky.responses.base_response import BaseResponse


There should be an empty line separating import groups (pep-8):

Imports are always put at the top of the file, just after any module comments and docstrings, and before module globals and constants.

Imports should be grouped in the following order:

Standard library imports. Related third party imports. Local application/library specific imports.

You should put a blank line between each group of imports.

from typing import Dict, Any from sentence_transformers import SentenceTransformer from qdrant_client import QdrantClient from chatsky.responses.base_response import BaseResponse

RLKRo · 2025-04-01T16:46:52Z

chatsky/responses/rag.py

+
+    def execute(self, user_query: str, ctx: Dict[str, Any], **kwargs) -> Dict[str, Any]:
+        if "cached_rag_context" in ctx:
+            context = ctx["cached_rag_context"]


context and ctx are similar parameter names which can be confusing. It's better to rename context to something else (e.g. rag_context).

Dependencies have been updated

chatsky/llm/rag.py

pyproject.toml

tests/llm/test_rag.py

RLKRo · 2025-05-19T19:07:06Z

tests/llm/test_rag.py

+    assert docs == expected_docs
+
+
+async def test_retriever_with_threshold(pipeline_with_retrievers):


If these two tests were previously implemented as one parametrized test in order to keep them grouped (since both are tests for retriever), you can achieve this with test classes instead:
https://docs.pytest.org/en/stable/getting-started.html#group-multiple-tests-in-a-class

So it would look like

class TestRetriever: async def test_get_documents(self, pipeline_with_retrievers): async def test_threshold(self, pipeline_with_retrievers):

SPI315 and others added 2 commits March 30, 2025 23:16

rag base init commit

341f161

RLKRo added the enhancement New feature or request label Mar 31, 2025

RLKRo commented Mar 31, 2025

View reviewed changes

RLKRo commented Apr 1, 2025

View reviewed changes

SPI315 added 5 commits April 20, 2025 22:12

Pipeline accepts dict with retrievers/vector db

143dc70

Add get_document function

fa41774

Add unit tests for chatsky/llm/rag.py

07e0b3a

Dependencies have been updated

Formatted

2398f1e

Add docstrings

1c81b35

RLKRo commented May 15, 2025

View reviewed changes

SPI315 and others added 2 commits May 15, 2025 22:01

Edits based on comments

d3198fa

Add basic get_documents_tutorial

c1bb88f

RLKRo commented May 19, 2025

View reviewed changes

		assert docs == expected_docs


		async def test_retriever_with_threshold(pipeline_with_retrievers):

Conversation

RLKRo commented Mar 31, 2025

Description

Checklist

To Consider

Uh oh!

Uh oh!

Uh oh!

RLKRo Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RLKRo Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

RLKRo Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RLKRo Apr 1, 2025

Choose a reason for hiding this comment

Uh oh!

RLKRo Apr 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RLKRo May 19, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RLKRo Mar 31, 2025 •

edited

Loading