Personal Knowledge Base

A RAG-based personal knowledge base using natural language for both input and querying, built with Go.

Technology Stack

Vector Database: Milvus
Metadata Storage: Redis
Embedding Generation: VoyageAI
Reranking: VoyageAI Rerank
LLM: Anthropic Claude Sonnet
Email Service: Mailjet (interface-based design)
Backend Language: Go

Architecture Overview

The personal knowledge base consists of these major components:

Document Processing Pipeline: Handles incoming natural language text
Storage System: Manages both vector embeddings and metadata
Query Processing System: Processes natural language queries with reranking for improved accuracy
Response Generation System: Creates natural language responses

Setup & Installation

Prerequisites

Docker and Docker Compose
Go 1.21 or later
VoyageAI API key
Anthropic Claude API key

Environment Configuration

Copy the example environment file and fill in your API keys:

cp exmample.env .env

Edit .env with your VoyageAI and Anthropic API keys

Starting all services

Start all services using Docker Compose:

docker-compose up -d

Using the Web UI

Navigate to localhost:3000 to interact with hippocamp using the web UI. Here you can add new documents and ask questions using natural lanugage queries.

CLI API Usage

Adding a Document

curl -X POST http://localhost:8080/api/documents \
  -H "Content-Type: application/json" \
  -d '{
    "title": "Example Document",
    "content": "This is the content of the document that will be processed and stored in the knowledge base."
  }'

Listing Documents

curl http://localhost:8080/api/documents

Getting a Document by ID

curl http://localhost:8080/api/documents/{document_id}

Querying the Knowledge Base

curl -X POST http://localhost:8080/api/query \
  -H "Content-Type: application/json" \
  -d '{
    "text": "What information do you have about example topics?"
  }'

Deleting a Document

curl -X DELETE http://localhost:8080/api/documents/{document_id}

Reranking Configuration

The system uses VoyageAI's reranking models to improve retrieval quality. This is configured through the following environment variables:

RERANKER_ENABLED: Set to true to enable reranking (default: true)
RERANKER_MODEL: The reranking model to use (default: rerank-2). Available options:
- rerank-2: Best quality, 16K token context
- rerank-2-lite: Good quality with faster speed, 8K token context
- rerank-1: Legacy model (not recommended)
- rerank-lite-1: Legacy model (not recommended)
RERANKER_TOP_K: Number of top results to return after reranking (default: 10)

Reranking improves retrieval quality by using a more sophisticated cross-encoder model that considers both the query and document together for relevance assessment.

Architecture Details

Document Processing

When documents are added, they are:

Split into chunks with configurable size and overlap
Embedded using VoyageAI
Stored in Milvus for vector search
Metadata is stored in Redis

Query Processing

When queries are processed:

The query is embedded using the same VoyageAI model
Similar chunks are retrieved from Milvus
Chunks are used as context for Claude to generate a response
The response is returned with citations to the source material

Configuration

The application is configured via environment variables:

VOYAGEAI_API_KEY: API key for VoyageAI embedding service
ANTHROPIC_API_KEY: API key for Anthropic Claude
Milvus and Redis connection settings
Chunking parameters
Server port and logging level

Development

Running Tests

go test ./...

Code Structure

config/: Configuration loading from environment
internal/api/: HTTP server and API endpoints
internal/model/: Data models
internal/processor/: Document and query processing logic
internal/storage/: Vector and metadata storage interfaces

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
.github/workflows		.github/workflows
cmd/hippocamp		cmd/hippocamp
docs		docs
internal		internal
observability		observability
scripts		scripts
tests		tests
ui		ui
.dockerignore		.dockerignore
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
Dockerfile		Dockerfile
Dockerfile.test		Dockerfile.test
Makefile		Makefile
README.md		README.md
docker-compose.test.yml		docker-compose.test.yml
docker-compose.yml		docker-compose.yml
example.env		example.env
example.env.test		example.env.test
go.mod		go.mod
go.sum		go.sum
query.sh		query.sh
store.sh		store.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Personal Knowledge Base

Technology Stack

Architecture Overview

Setup & Installation

Prerequisites

Environment Configuration

Starting all services

Using the Web UI

CLI API Usage

Adding a Document

Listing Documents

Getting a Document by ID

Querying the Knowledge Base

Deleting a Document

Reranking Configuration

Architecture Details

Document Processing

Query Processing

Configuration

Development

Running Tests

Code Structure

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Personal Knowledge Base

Technology Stack

Architecture Overview

Setup & Installation

Prerequisites

Environment Configuration

Starting all services

Using the Web UI

CLI API Usage

Adding a Document

Listing Documents

Getting a Document by ID

Querying the Knowledge Base

Deleting a Document

Reranking Configuration

Architecture Details

Document Processing

Query Processing

Configuration

Development

Running Tests

Code Structure

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages