HaluMem is the first operation level hallucination evaluation benchmark tailored to agent memory systems.
benchmark ai memory memos hallucination long-term-memory memzero llm hallucination-evaluation llm-memory mem0 memory-system memobase
-
Updated
Nov 9, 2025 - Python