VCAL-core

VCAL-core is a lightweight, in-process HNSW vector index written in safe Rust with optional SIMD and atomic snapshots.
It’s designed as a minimal building block for semantic caches (e.g., LLM prompt deduplication) and embedded ANN search.

MSRV: 1.56 (edition 2021). Dev-dependencies may require a newer stable toolchain.

Why VCAL-core?

Ultra-light: minimal dependencies, no unsafe in public API.
Fast enough: competitive k-NN for small/mid indexes (edge or cache use).
Embeddable: no daemon — runs directly inside your process.
Deterministic: single-threaded core; wrap in RwLock for concurrency.
Persistent: safe paired snapshots and simple TTL/LRU eviction.

Key Features

HNSW index with greedy descent + ef_search
Pluggable metrics: Cosine (default), Dot
Optional SIMD (--features simd + RUSTFLAGS="-C target-cpu=native")
Snapshots: binary persistence with the serde feature
- Atomic paired saves prevent corruption (.index.A / .index.B)
- Automatic recovery from latest intact snapshot
Eviction:
- evict_ttl(ttl_secs) — remove expired entries
- evict_lru_until(max_vectors, max_bytes) — respect soft caps
Stats: stats() → (vector_count, approx_bytes)
Simple API: insert, delete, contains, and search

Install

[dependencies]
vcal-core = { version = "0.1.1", features = ["serde"] }

Optional features:

serde — enable snapshot persistence
simd — AVX2-optimized inner loops (x86_64 only)

Quick Example

use vcal_core::{HnswBuilder, Cosine};

let mut idx = HnswBuilder::<Cosine>::default().dims(128).build();
idx.insert(vec![0.1; 128], 1001)?;
let hits = idx.search(&vec![0.1; 128], 5)?;

Persistence (v0.1.1)

use vcal_core::Index;
use std::fs::File;

let idx = Index::new(...)?;
let f = File::create("vcal.index")?;
idx.save(f)?; // alternates between paired files safely

Paired saves guarantee atomic recovery: on restart, load() automatically picks the last valid .index version.

Eviction

idx.evict_ttl(3600);                        // Remove expired entries
idx.evict_lru_until(Some(1000), None);      // Keep up to 1000 vectors

Observability

vcal-core itself is metrics-agnostic.
For Prometheus and Grafana integration, use vcal-server, which exposes /metrics.

Performance Tips

Build in release mode:
cargo build --release
Use native SIMD:
RUSTFLAGS="-C target-cpu=native" cargo build --release --features simd
Normalize embeddings for cosine metric.
Typical parameters:
m = 16–32, ef_search = 64–256.

Design Principles

No background threads or implicit I/O.
No public unsafe.
Snapshot format is stable for 0.x line, versioned from v0.1.0.
Optimized for embedded and server-local caches, not massive-scale ANN.

For Python and chatbot integration, see INSTALL.md.

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
benches		benches
docs		docs
src		src
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Cargo.toml		Cargo.toml
INSTALL.md		INSTALL.md
LICENSE-Apache-2.0		LICENSE-Apache-2.0
NOTICE		NOTICE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VCAL-core

Why VCAL-core?

Key Features

Install

Quick Example

Persistence (v0.1.1)

Eviction

Observability

Performance Tips

Design Principles

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VCAL-core

Why VCAL-core?

Key Features

Install

Quick Example

Persistence (v0.1.1)

Eviction

Observability

Performance Tips

Design Principles

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages