feat: add LlamaIndex memory integration example (Python + TypeScript) by m1lestones · Pull Request #545 · plastic-labs/honcho

m1lestones · 2026-04-10T05:41:05Z

Summary

Adds examples/llamaindex/ with both Python and TypeScript implementations of Honcho memory for LlamaIndex agents
Python uses ReActAgent.from_tools() with prefix_messages for dynamic system prompt injection
TypeScript uses new ReActAgent({ tools, llm }) with chatHistory for system context
Follows the same pattern as the existing examples/openai-agents/ example

What's included

examples/llamaindex/
├── README.md
├── python/
│   ├── main.py
│   ├── pyproject.toml
│   └── tools/
│       ├── client.py
│       ├── save_memory.py
│       ├── get_context.py
│       └── query_memory.py   # make_query_memory_tool(ctx) → FunctionTool
└── typescript/
    ├── main.ts
    ├── package.json
    ├── tsconfig.json
    └── tools/
        ├── client.ts
        ├── saveMemory.ts
        ├── getContext.ts
        └── queryMemory.ts    # makeQueryMemoryTool(ctx) → FunctionTool

How it works

Dynamic system prompt — Honcho session history is injected via prefix_messages (Python) or chatHistory (TypeScript) before every LLM call.
Tool factory — make_query_memory_tool(ctx) / makeQueryMemoryTool(ctx) wraps a FunctionTool that calls Honcho's Dialectic API, closing over the user context.
Auto-save — chat() persists the user message before the agent runs and the assistant response after.

Test plan

Python:

pip install llama-index llama-index-llms-openai honcho-ai python-dotenv
cd python && python main.py

TypeScript:

cd typescript && bun install && bun run main.ts

🤖 Generated with Claude Code

Summary by CodeRabbit

Documentation
- Added a comprehensive guide for integrating Honcho persistent memory with LlamaIndex, including architecture, quick-start snippets, required environment variables, example project layout, and license info.
New Features
- Added Python and TypeScript example projects demonstrating end-to-end memory: saving/retrieving conversation turns, injecting context into prompts, a memory-query tool, and interactive chat demos.

coderabbitai · 2026-04-10T05:41:19Z

Walkthrough

Adds a LlamaIndex example demonstrating Honcho-backed persistent memory with parallel Python and TypeScript implementations, documentation, and tools to save/retrieve conversation turns and expose memory query tooling to LlamaIndex agents.

Changes

Cohort / File(s)	Summary
Documentation `examples/llamaindex/README.md`	New README explaining Honcho ↔ LlamaIndex integration, setup, env vars, example structure, and usage patterns for Python and TypeScript.
Python example `examples/llamaindex/python/main.py`, `examples/llamaindex/python/pyproject.toml`	Adds chat entrypoint using ReActAgent + OpenAI, project metadata and dependencies.
Python tools `examples/llamaindex/python/tools/client.py`, `examples/llamaindex/python/tools/get_context.py`, `examples/llamaindex/python/tools/query_memory.py`, `examples/llamaindex/python/tools/save_memory.py`	Adds Honcho client/context helpers, get_context(tokens), a memory-query FunctionTool, and save_memory for persisting turns.
TypeScript example `examples/llamaindex/typescript/main.ts`, `examples/llamaindex/typescript/package.json`, `examples/llamaindex/typescript/tsconfig.json`	Adds TypeScript interactive chat entrypoint and project config (package.json, tsconfig).
TypeScript tools `examples/llamaindex/typescript/tools/client.ts`, `examples/llamaindex/typescript/tools/getContext.ts`, `examples/llamaindex/typescript/tools/queryMemory.ts`, `examples/llamaindex/typescript/tools/saveMemory.ts`	Adds Honcho context/ client helpers, getContext(tokens), a query_memory FunctionTool, and saveMemory for persisting turns in TypeScript.

Sequence Diagram

sequenceDiagram
    actor User
    participant App as LlamaIndex App
    participant Honcho as Honcho Client
    participant Agent as ReAct Agent
    participant LLM as OpenAI LLM

    User->>App: send message
    App->>Honcho: save_memory(user_id, message, "user")
    Honcho-->>App: saved

    App->>Honcho: get_context(ctx, tokens=2000)
    Honcho-->>App: conversation history

    App->>App: build system prompt + history
    App->>Agent: init with system prompt + query_memory tool
    Agent->>Agent: process message (may call query_memory)
    Agent->>Honcho: peer.chat(query) if invoked
    Honcho-->>Agent: memory results

    Agent->>LLM: request with prompt + context
    LLM-->>Agent: response
    Agent-->>App: response

    App->>Honcho: save_memory(user_id, response, "assistant")
    Honcho-->>App: saved
    App-->>User: return response

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

v3.0.0 Release Candidate #346: Related SDK API updates (session/context and peer APIs) that this example uses.
feat(examples): add Honcho memory skill for Zo Computer #495: Adds similar Honcho memory tooling and example integrations across the repo.
feat: run deriver once for multiple observers #335: Large SDK API changes to client/session/peer semantics that overlap this example's usage.

Suggested reviewers

ajspig
VVoruganti

Poem

🐰 A rabbit hops through LlamaIndex fields,
Honcho keeps memory in tidy little yields.
Turns are saved, history blooms in the prompt,
Python and TypeScript, side-by-side they romp.
Chat, recall, repeat—our tiny rabbit squeals! 🎋

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 46.15% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'feat: add LlamaIndex memory integration example (Python + TypeScript)' clearly and concisely describes the primary change: introducing new example code for LlamaIndex memory integration across both Python and TypeScript implementations.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 7

🧹 Nitpick comments (1)

examples/llamaindex/typescript/tools/client.ts (1)

19-24: Consider memoizing the Honcho client instance.

getClient() creates a new SDK client on every call. In this example flow, that happens multiple times per turn; a shared instance keeps setup overhead lower.

♻️ Proposed refactor

+let cachedClient: Honcho | null = null;
+
 export function getClient(): Honcho {
+  if (cachedClient) return cachedClient;
   const apiKey = process.env.HONCHO_API_KEY;
   if (!apiKey) throw new Error("HONCHO_API_KEY is required.");
   const workspaceId = process.env.HONCHO_WORKSPACE_ID ?? "default";
-  return new Honcho({ apiKey, workspaceId });
+  cachedClient = new Honcho({ apiKey, workspaceId });
+  return cachedClient;
 }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@examples/llamaindex/typescript/tools/client.ts` around lines 19 - 24,
getClient() currently constructs a new Honcho SDK client on every call; change
it to return a cached singleton by adding a module-level variable (e.g., let
cachedClient: Honcho | null = null) and only instantiate new Honcho({ apiKey,
workspaceId }) when cachedClient is null, then assign and return cachedClient;
ensure you still validate HONCHO_API_KEY and HONCHO_WORKSPACE_ID as before and
reference the same getClient and Honcho symbols.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@examples/llamaindex/python/main.py`:
- Around line 78-85: Wrap the interactive loop's external calls in error
handling so one SDK/provider exception doesn't exit the REPL: surround the call
to chat(_user_id, _user_input, _session_id) (and the subsequent print) with a
try/except, catch broad exceptions, log or print a concise error message
including exception info, and continue the loop (preserving _user_id/_session_id
and prompting again) so the session remains alive after transient failures.

In `@examples/llamaindex/python/tools/get_context.py`:
- Around line 19-20: Update the docstring that describes the returned list to
reflect the actual shape produced by to_openai(): state that entries may include
roles "user", "assistant", or "system" and that entries can optionally include a
"name" field, not just {"role","content"}; reference the helper that converts
messages (to_openai()) and update the description in the function (get_context)
so callers know the list may contain system-role items and optional name keys
and that an empty list is returned when there are no messages.

In `@examples/llamaindex/python/tools/query_memory.py`:
- Around line 30-31: The current guard if not query: in the function handling
queries allows whitespace-only strings; update the validation to treat strings
containing only whitespace as empty by using a trimmed check (e.g., check
query.strip() or equivalent) before raising ValueError("query must not be
empty"), so any whitespace-only input triggers the same ValueError; locate the
validation where query is inspected and replace or augment the condition
accordingly.

In `@examples/llamaindex/python/tools/save_memory.py`:
- Around line 25-36: The code currently treats any non-"assistant" role as a
user message, risking silent corruption; update the logic around role, sender,
and message creation (variables: role, sender, assistant_peer, user_peer,
session.add_messages) to validate role explicitly—allow only "assistant" or
"user" (or your canonical enum), raise a ValueError on invalid values, and only
map "assistant"->assistant_peer and "user"->user_peer before calling
session.add_messages to prevent misattributed writes.

In `@examples/llamaindex/README.md`:
- Line 9: Update the Features wording to clarify that context injection differs
by implementation: change the sentence referencing `prefix_messages` to indicate
that non-TypeScript implementations use `prefix_messages` while the TypeScript
client uses `chatHistory` for injecting conversation history into the LLM;
mention both symbols (`prefix_messages`, `chatHistory`) and phrase it like
"Conversation history is retrieved from Honcho and formatted for the LLM before
every request (uses `prefix_messages` in most SDKs; TypeScript client uses
`chatHistory`)."

In `@examples/llamaindex/typescript/main.ts`:
- Around line 75-90: The readline interface `rl` is not guaranteed to be closed
if `chat(...)` throws; wrap the interactive loop in a try/finally so
`rl.close()` always runs: create `rl` as now, then put the while loop and calls
to `chat(userId, userInput, sessionId)` inside a try block and call `rl.close()`
in the finally block (keeping the existing early-close on "quit"/"exit"
behavior, but still ensure `rl.close()` in finally for error-safe cleanup).
- Around line 27-92: Export the chat function as a named export and prevent
automatic CLI execution on import by wrapping the main() invocation in a
module-entry guard; specifically, add a named export for chat (export async
function chat...) and replace the unconditional main().catch(console.error) call
with a runtime check (e.g., if (require && require.main === module) {
main().catch(console.error); } for CommonJS or if (import.meta &&
import.meta.main) { main().catch(console.error); } for ESM) so importing this
module does not run the CLI.

---

Nitpick comments:
In `@examples/llamaindex/typescript/tools/client.ts`:
- Around line 19-24: getClient() currently constructs a new Honcho SDK client on
every call; change it to return a cached singleton by adding a module-level
variable (e.g., let cachedClient: Honcho | null = null) and only instantiate new
Honcho({ apiKey, workspaceId }) when cachedClient is null, then assign and
return cachedClient; ensure you still validate HONCHO_API_KEY and
HONCHO_WORKSPACE_ID as before and reference the same getClient and Honcho
symbols.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 80a33bc3-5ab4-475f-a12a-9783acf12aac

📥 Commits

Reviewing files that changed from the base of the PR and between 5b6bd59 and 20f6efb.

📒 Files selected for processing (15)

examples/llamaindex/README.md
examples/llamaindex/python/main.py
examples/llamaindex/python/pyproject.toml
examples/llamaindex/python/tools/__init__.py
examples/llamaindex/python/tools/client.py
examples/llamaindex/python/tools/get_context.py
examples/llamaindex/python/tools/query_memory.py
examples/llamaindex/python/tools/save_memory.py
examples/llamaindex/typescript/main.ts
examples/llamaindex/typescript/package.json
examples/llamaindex/typescript/tools/client.ts
examples/llamaindex/typescript/tools/getContext.ts
examples/llamaindex/typescript/tools/queryMemory.ts
examples/llamaindex/typescript/tools/saveMemory.ts
examples/llamaindex/typescript/tsconfig.json

coderabbitai · 2026-04-10T05:46:25Z

+        A list of message dicts: ``[{"role": "user" | "assistant", "content": "..."}]``.
+        Returns an empty list if the session has no messages yet.


⚠️ Potential issue | 🟡 Minor

Return contract is too narrow for actual to_openai() output.

to_openai() can include "system" role entries and optional "name" fields, so this docstring currently over-promises a stricter shape than returned.

✏️ Suggested doc fix

- A list of message dicts: ``[{"role": "user" | "assistant", "content": "..."}]``. - Returns an empty list if the session has no messages yet. + OpenAI-format message dicts with ``role``/``content`` (and optional ``name``). + Depending on available memory artifacts, the list may include ``"system"`` + messages (e.g., summary/peer metadata) before conversation turns.

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

A list of message dicts: ``[{"role": "user" | "assistant", "content": "..."}]``.

Returns an empty list if the session has no messages yet.

OpenAI-format message dicts with ``role``/``content`` (and optional ``name``).

Depending on available memory artifacts, the list may include ``"system"``

messages (e.g., summary/peer metadata) before conversation turns.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@examples/llamaindex/python/tools/get_context.py` around lines 19 - 20, Update the docstring that describes the returned list to reflect the actual shape produced by to_openai(): state that entries may include roles "user", "assistant", or "system" and that entries can optionally include a "name" field, not just {"role","content"}; reference the helper that converts messages (to_openai()) and update the description in the function (get_context) so callers know the list may contain system-role items and optional name keys and that an empty list is returned when there are no messages.

coderabbitai · 2026-04-10T05:46:26Z

+
+- **Persistent Memory**: Every conversation turn is saved to Honcho and automatically injected into the agent's system prompt on the next turn.
+- **Natural Language Recall**: The agent can query Honcho's Dialectic API to answer questions like "What are my hobbies?" or "What did we talk about last time?"
+- **Context Injection**: Conversation history is retrieved from Honcho and formatted for the LLM before every request via `prefix_messages`.


⚠️ Potential issue | 🟡 Minor

Clarify TypeScript context-injection wording in Features.

Line 9 currently implies prefix_messages for all implementations, but TypeScript uses chatHistory.

📝 Proposed fix

-- **Context Injection**: Conversation history is retrieved from Honcho and formatted for the LLM before every request via `prefix_messages`. +- **Context Injection**: Conversation history is retrieved from Honcho and formatted for the LLM before every request via `prefix_messages` (Python) or `chatHistory` (TypeScript).

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

- **Context Injection**: Conversation history is retrieved from Honcho and formatted for the LLM before every request via `prefix_messages`.

- **Context Injection**: Conversation history is retrieved from Honcho and formatted for the LLM before every request via `prefix_messages` (Python) or `chatHistory` (TypeScript).

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@examples/llamaindex/README.md` at line 9, Update the Features wording to clarify that context injection differs by implementation: change the sentence referencing `prefix_messages` to indicate that non-TypeScript implementations use `prefix_messages` while the TypeScript client uses `chatHistory` for injecting conversation history into the LLM; mention both symbols (`prefix_messages`, `chatHistory`) and phrase it like "Conversation history is retrieved from Honcho and formatted for the LLM before every request (uses `prefix_messages` in most SDKs; TypeScript client uses `chatHistory`)."

coderabbitai · 2026-04-10T05:46:26Z

+async function chat(
+  userId: string,
+  message: string,
+  sessionId: string
+): Promise<string> {
+  const ctx: HonchoContext = createContext(userId, sessionId);
+
+  const base =
+    "You are a helpful assistant with persistent memory powered by Honcho. " +
+    "You remember users across conversations. " +
+    "When a user asks what you remember about them, use the query_memory tool.";
+
+  const history = await getContext(ctx, 2000);
+  const systemContent =
+    history.length > 0
+      ? `${base}\n\n## Conversation History\n${history
+          .map(
+            (m) =>
+              `${m.role.charAt(0).toUpperCase() + m.role.slice(1)}: ${m.content}`
+          )
+          .join("\n")}`
+      : base;
+
+  const llm = new OpenAI({ model: "gpt-4.1-mini" });
+  const agent = new ReActAgent({
+    tools: [makeQueryMemoryTool(ctx)],
+    llm,
+    verbose: false,
+  });
+
+  await saveMemory(userId, message, "user", sessionId);
+
+  const result = await agent.chat({
+    message,
+    chatHistory: [{ role: "system", content: systemContent }],
+  });
+  const response = result.message.content as string;
+
+  await saveMemory(userId, response, "assistant", sessionId);
+
+  return response;
+}
+
+async function main() {
+  console.log("LlamaIndex HonchoMemoryAgent — type 'quit' to exit\n");
+  const userId = "demo-user";
+  const sessionId = "demo-session";
+
+  const rl = readline.createInterface({
+    input: process.stdin,
+    output: process.stdout,
+  });
+
+  while (true) {
+    const userInput = (await rl.question("You: ")).trim();
+    if (!userInput) continue;
+    if (["quit", "exit"].includes(userInput.toLowerCase())) {
+      rl.close();
+      break;
+    }
+    const response = await chat(userId, userInput, sessionId);
+    console.log(`Agent: ${response}\n`);
+  }
+}
+
+main().catch(console.error);


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Verify the import-side-effect issue in examples/llamaindex/typescript/main.ts rg -nP '^\s*async function chat$' examples/llamaindex/typescript/main.ts rg -nP '^\s*main\($\.catch$console\.error$;' examples/llamaindex/typescript/main.ts # Expected: both patterns match in current code.

Repository: plastic-labs/honcho

Length of output: 118

🏁 Script executed:

#!/bin/bash # Check if README mentions importing chat from main.ts or this file find . -name README.md -o -name readme.md | head -20

Repository: plastic-labs/honcho

Length of output: 437

🏁 Script executed:

#!/bin/bash # Search for any references to chat import or this module in docs rg -i "llamaindex.*chat|import.*chat.*main" --type md

Repository: plastic-labs/honcho

Length of output: 126

🏁 Script executed:

#!/bin/bash # Check if any other files import from this main.ts rg "from.*examples/llamaindex/typescript/main|from.*\./main" --type ts --type tsx --type js --type jsx

Repository: plastic-labs/honcho

Length of output: 90

🏁 Script executed:

#!/bin/bash # Verify import.meta.main support in TypeScript/Node.js context # Check the package.json and tsconfig to understand the environment cat examples/llamaindex/typescript/package.json 2>/dev/null || echo "package.json not found" cat examples/llamaindex/typescript/tsconfig.json 2>/dev/null || echo "tsconfig.json not found"

Repository: plastic-labs/honcho

Length of output: 861

Export chat and guard CLI bootstrap to avoid import side effects.

The README documents importing chat as a named export, but the function is not exported. Additionally, Line 92 unconditionally invokes main().catch(console.error), which causes CLI execution during module import instead of exposing a reusable function.

Proposed fix

-async function chat( +export async function chat( userId: string, message: string, sessionId: string ): Promise<string> { @@ -main().catch(console.error); +if (import.meta.main) { + main().catch(console.error); +}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@examples/llamaindex/typescript/main.ts` around lines 27 - 92, Export the chat function as a named export and prevent automatic CLI execution on import by wrapping the main() invocation in a module-entry guard; specifically, add a named export for chat (export async function chat...) and replace the unconditional main().catch(console.error) call with a runtime check (e.g., if (require && require.main === module) { main().catch(console.error); } for CommonJS or if (import.meta && import.meta.main) { main().catch(console.error); } for ESM) so importing this module does not run the CLI.

coderabbitai · 2026-04-10T05:46:26Z

+  const rl = readline.createInterface({
+    input: process.stdin,
+    output: process.stdout,
+  });
+
+  while (true) {
+    const userInput = (await rl.question("You: ")).trim();
+    if (!userInput) continue;
+    if (["quit", "exit"].includes(userInput.toLowerCase())) {
+      rl.close();
+      break;
+    }
+    const response = await chat(userId, userInput, sessionId);
+    console.log(`Agent: ${response}\n`);
+  }
+}


⚠️ Potential issue | 🟡 Minor

Close readline in a finally block for error-safe cleanup.

If chat(...) throws, rl.close() is skipped. Wrap the loop in try/finally so cleanup always runs.

💡 Proposed fix

const rl = readline.createInterface({ input: process.stdin, output: process.stdout, }); - while (true) { - const userInput = (await rl.question("You: ")).trim(); - if (!userInput) continue; - if (["quit", "exit"].includes(userInput.toLowerCase())) { - rl.close(); - break; - } - const response = await chat(userId, userInput, sessionId); - console.log(`Agent: ${response}\n`); + try { + while (true) { + const userInput = (await rl.question("You: ")).trim(); + if (!userInput) continue; + if (["quit", "exit"].includes(userInput.toLowerCase())) { + break; + } + const response = await chat(userId, userInput, sessionId); + console.log(`Agent: ${response}\n`); + } + } finally { + rl.close(); } }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

const rl = readline.createInterface({

input: process.stdin,

output: process.stdout,

});

while (true) {

const userInput = (await rl.question("You: ")).trim();

if (!userInput) continue;

if (["quit", "exit"].includes(userInput.toLowerCase())) {

rl.close();

break;

}

const response = await chat(userId, userInput, sessionId);

console.log(`Agent: ${response}\n`);

}

}

const rl = readline.createInterface({

input: process.stdin,

output: process.stdout,

});

try {

while (true) {

const userInput = (await rl.question("You: ")).trim();

if (!userInput) continue;

if (["quit", "exit"].includes(userInput.toLowerCase())) {

break;

}

const response = await chat(userId, userInput, sessionId);

console.log(`Agent: ${response}\n`);

}

} finally {

rl.close();

}

}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@examples/llamaindex/typescript/main.ts` around lines 75 - 90, The readline interface `rl` is not guaranteed to be closed if `chat(...)` throws; wrap the interactive loop in a try/finally so `rl.close()` always runs: create `rl` as now, then put the while loop and calls to `chat(userId, userInput, sessionId)` inside a try block and call `rl.close()` in the finally block (keeping the existing early-close on "quit"/"exit" behavior, but still ensure `rl.close()` in finally for error-safe cleanup).

…PL error handling

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@examples/llamaindex/python/main.py`:
- Around line 27-28: Add a whitespace-only input guard at the start of the
chat(user_id: str, message: str, session_id: str) -> str function: call message
= message.strip() (or a separate local variable) and if the result is empty,
raise a ValueError (with a brief message like "message must not be empty or
whitespace") so the function does not save/send blank content; apply the same
strip+ValueError pattern to the other handler referenced around the same area
(the second chat-like call at lines ~66-67) to ensure consistent validation.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: aec3604c-e9f3-4ad8-8254-e971a98155bd

📥 Commits

Reviewing files that changed from the base of the PR and between 20f6efb and e41f0e8.

📒 Files selected for processing (3)

examples/llamaindex/python/main.py
examples/llamaindex/python/tools/query_memory.py
examples/llamaindex/python/tools/save_memory.py

🚧 Files skipped from review as they are similar to previous changes (2)

examples/llamaindex/python/tools/query_memory.py
examples/llamaindex/python/tools/save_memory.py

coderabbitai · 2026-04-10T05:57:26Z

+def chat(user_id: str, message: str, session_id: str) -> str:
+    """Run one conversation turn with persistent Honcho memory.


⚠️ Potential issue | 🟡 Minor

Guard chat() against whitespace-only input.

chat() is reusable outside the REPL, and currently accepts " " which gets saved and sent to the agent. Add a local strip() validation at function entry.

Suggested patch

def chat(user_id: str, message: str, session_id: str) -> str: @@ - ctx = HonchoContext(user_id=user_id, session_id=session_id) + cleaned_message = message.strip() + if not cleaned_message: + raise ValueError("message must not be empty or whitespace") + + ctx = HonchoContext(user_id=user_id, session_id=session_id) @@ - save_memory(user_id, message, "user", session_id) - response = str(agent.chat(message)) + save_memory(user_id, cleaned_message, "user", session_id) + response = str(agent.chat(cleaned_message))

As per coding guidelines: "Use explicit error handling with appropriate exception types".
Also applies to: 66-67

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@examples/llamaindex/python/main.py` around lines 27 - 28, Add a whitespace-only input guard at the start of the chat(user_id: str, message: str, session_id: str) -> str function: call message = message.strip() (or a separate local variable) and if the result is empty, raise a ValueError (with a brief message like "message must not be empty or whitespace") so the function does not save/send blank content; apply the same strip+ValueError pattern to the other handler referenced around the same area (the second chat-like call at lines ~66-67) to ensure consistent validation.

ajspig · 2026-04-28T19:20:36Z

Closing this as part of a broader prioritization shift and in an effort to minimize maintenance burden. Thanks for putting in the work on this!

feat: add LlamaIndex memory integration example (Python + TypeScript)

20f6efb

coderabbitai Bot reviewed Apr 10, 2026

View reviewed changes

fix: address CodeRabbit review — role validation, query trim+wrap, RE…

e41f0e8

…PL error handling

coderabbitai Bot reviewed Apr 10, 2026

View reviewed changes

ajspig added the needs-changes label Apr 14, 2026

ajspig closed this Apr 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add LlamaIndex memory integration example (Python + TypeScript)#545

feat: add LlamaIndex memory integration example (Python + TypeScript)#545
m1lestones wants to merge 2 commits intoplastic-labs:mainfrom
m1lestones:feat/llamaindex-memory-integration

m1lestones commented Apr 10, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Apr 10, 2026 •

edited

Loading

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

coderabbitai Bot Apr 10, 2026

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot Apr 10, 2026

Uh oh!

coderabbitai Bot Apr 10, 2026

Uh oh!

coderabbitai Bot Apr 10, 2026

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Apr 10, 2026

Uh oh!

ajspig commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		A list of message dicts: ``[{"role": "user" \| "assistant", "content": "..."}]``.
		Returns an empty list if the session has no messages yet.

-        A list of message dicts: ``[{"role": "user" | "assistant", "content": "..."}]``.
-        Returns an empty list if the session has no messages yet.
+        OpenAI-format message dicts with ``role``/``content`` (and optional ``name``).
+        Depending on available memory artifacts, the list may include ``"system"``
+        messages (e.g., summary/peer metadata) before conversation turns.

	- Context Injection: Conversation history is retrieved from Honcho and formatted for the LLM before every request via `prefix_messages`.
	- Context Injection: Conversation history is retrieved from Honcho and formatted for the LLM before every request via `prefix_messages` (Python) or `chatHistory` (TypeScript).

		def chat(user_id: str, message: str, session_id: str) -> str:
		"""Run one conversation turn with persistent Honcho memory.

Conversation

m1lestones commented Apr 10, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What's included

How it works

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

ajspig commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

m1lestones commented Apr 10, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 10, 2026 •

edited

Loading