[api][runtime] Introduce long-term memory in python #332

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

wenjin272 wants to merge 2 commits into apache:main from wenjin272:long-term-memory

+1,169 −0

Collaborator

wenjin272 commented Nov 20, 2025 •

edited

Loading

Linked issue: #331

Purpose of change

Introduce the long-term memory interface in python, and provide an implementation based on chroma.

This is the first pr of three to introduce long-term memory in python:

interface and one implementation
support using long-term memory in action
async interface and execution

Tests

Unit test

API

Yes, add long-term memory related api.

Documentation

doc-needed

github-actions bot added priority/major fixVersion/0.2.0 doc-needed labels

wenjin272 changed the title ~~Long term memory~~ [api][runtime] Introduce long-term memory in python

wenjin272 added 2 commits

November 27, 2025 17:28


          [api][python] Introduce long-term memory interface.

ad6ad64

rename memory item


          [runtime][python] Implement chroma based long term memory.

7c14985

refactor

wenjin272 force-pushed the long-term-memory branch from e85caea to 7c14985 Compare

November 27, 2025 10:56

xintongsong reviewed

View reviewed changes

python/flink_agents/api/memory/long_term_memory.py

		from flink_agents.api.prompts.prompt import Prompt


		class ReduceStrategy(Enum):

Contributor

xintongsong Nov 30, 2025

Not sure about the name Reduce. I think Compact might be better.

python/flink_agents/api/memory/long_term_memory.py

		SUMMARIZE = "summarize"


		class ReduceSetup(BaseModel):

Contributor

xintongsong Nov 30, 2025

I'd suggest to name this CompactionStrategy, and make it an abstract class that we can provide different implementations, so we can have strict limit on which arguments should be specified for each strategy. We can call the current ReduceStrategy CompactionStrategyType.

Contributor

xintongsong Nov 30, 2025

I think CompactionStrategy.trim(n) might be more straightforward for users, compared to ReduceSetup.trim_setup(n).

python/flink_agents/api/memory/long_term_memory.py

+                  id: str
+                  value: Any
+                  compacted: bool = False
+                  created_time: DatetimeRange

Contributor

xintongsong Nov 30, 2025

Suggested change

      
                created_time: DatetimeRange
          
                created_time: DatetimeRange | datetime

python/flink_agents/api/memory/long_term_memory.py

+                  size: int = 0
+                  capacity: int
+                  reduce_setup: ReduceSetup
+                  item_ids: List[str] = Field(default_factory=list)

Contributor

xintongsong Nov 30, 2025

Why do we need to store ids of all the items?

python/flink_agents/api/memory/long_term_memory.py

+                  capacity: int
+                  reduce_setup: ReduceSetup
+                  item_ids: List[str] = Field(default_factory=list)
+                  reduced: bool = False

Contributor

xintongsong Nov 30, 2025

And why do we need to know whether a memory set has been reduced/compacted?

python/flink_agents/runtime/memory/chroma_long_term_memory.py

+                  )
+                  # Connection configuration
+                  persist_directory: str | None = Field(

Contributor

xintongsong Nov 30, 2025

what is this directory for?

python/flink_agents/runtime/memory/chroma_long_term_memory.py

Comment on lines +200 to +202

+                      if memory_set.size >= memory_set.capacity:
+                          # trigger reduce operation to manage memory set size.
+                          self._reduce(memory_set)

Contributor

xintongsong Nov 30, 2025

This can be extremely slow. We should proactively do the compaction.

python/flink_agents/runtime/memory/chroma_long_term_memory.py

+                      self.client.delete_collection(name=name)
+                  @override
+                  def add(self, memory_set: MemorySet, memory_item: str | ChatMessage) -> None:

Contributor

xintongsong Nov 30, 2025

I had a feeling that adding items to long-term memory can take time, for embedding. We probably should also provide async apis.

python/flink_agents/runtime/memory/chroma_long_term_memory.py

+                      return self.slice(memory_set=memory_set, offset=offset, n=n)
+                  @override
+                  def search(

Contributor

xintongsong Nov 30, 2025

Same here.

python/flink_agents/runtime/memory/chroma_long_term_memory.py

Comment on lines +411 to +416

+                  def _trim(self, memory_set: MemorySet) -> None:
+                      reduce_setup: ReduceSetup = memory_set.reduce_setup
+                      n = reduce_setup.arguments.get("n")
+                      self.delete(memory_set=memory_set, offset=0, n=n)
+                  def _summarize(self, memory_set: MemorySet) -> None:

Contributor

xintongsong Nov 30, 2025

Are these methods specialized for this class?

Collaborator Author

wenjin272 commented Dec 1, 2025 •

edited

Loading

Hi, @alnzng. There's a design issue related to the vector store that I'd appreciate your help reviewing.

As describe in the design doc #339, long-term memory of flink-agents is also based on vector store. Currently, I provide an implementation based on chroma. In this implementation, I directly use chroma client rather than flink-agents BaseVectorStore, because there are some long-term memory needed interface not provided in BaseVectorStore.

@xintongsong believes that we can directly build upon the Flink-Agents BaseVectorStore. Thus, we can support using any already supported vector store as the backend for long-term memory.

I think this make sense, but it requires add some interfaces to BaseVectorStore, which maybe look like:

def get_or_create_collection(self, name: str, metadata: Dict[str, Any]) -> Dict[str, Any]:
    """Get a collection, create if it doesn't already exist."""
    
def get_collection(self, name: str) -> None:
    """Get an existing collection."""
    
def update_collection(self, name: str, metadata: Dict[str, Any]) -> None:
    """Update an existing collection."""

def delete_collection(self, name: str) -> bool:
    """Delete a collection."""
    
def add(self, document: Document, collection_name: str | None = None) -> None:
    """Add a document to the collection."""

def update(self, document: Document, collection_name: str | None = None) -> None:
    """Update a document, can only update metadata."""

def delete(self, offset: int | None, limit: int | None, ids: List[int] | None = None, **kwargs: Any) -> bool:
    """Delete documents from collection."""

def get(self, offset: int | None, limit: int | None, collection_name: str | None = None, **kwargs: Any) -> List[Document]:
    """Get documents from collection."""

These interface may not be achievable for each vector store, I will conduct research and refinement afterward.
WDTY?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

doc-needed fixVersion/0.2.0 priority/major