GitHub - mcp-tool-shop-org/knowledge-core: Role-aware retrieval engine — governed knowledge substrate for Role OS

Role-aware retrieval engine — governed knowledge substrate for Role OS.

What This Is

A retrieval engine where role law governs what gets retrieved, ranked, and cited. Not generic RAG — governed input with hard role boundaries.

Same task, different roles, materially different evidence. Proven across 5 pilot roles with 86 tests.

Architecture

One canonical corpus with metadata, embeddings, and versioning
Per-role overlays that control source policy, vocabulary, reranking, and synthesis
Structured retrieval bundles with full provenance, rejection trails, and trust posture
Fallback governance for degraded scenarios (stale, conflicted, weak evidence)

Pipeline

task + overlay + corpus → query builder → BM25 + semantic → merge → metadata filter → rerank → bundle

Key Contracts

Contract	Description
`RetrievalBundle`	Governed output of every retrieval operation
`RoleOverlay`	Declarative per-role retrieval configuration
`PacketKnowledge`	How Role OS packets carry knowledge bundles
`EvidenceProvenance`	How evidence items trace back to retrieval sources
`FallbackDecision`	Explicit governance for degraded retrieval scenarios

Install

npm install @roleos/knowledge-core

Requirements: Node.js >= 18. Optional: Ollama with nomic-embed-text for semantic search.

Quick Start

import { CorpusStore, ingestFixtureCorpus, retrieve } from "@roleos/knowledge-core";

// 1. Create store and ingest corpus
const store = new CorpusStore(":memory:");
ingestFixtureCorpus(store, "./corpus.json");

// 2. Load a role overlay
const overlay = JSON.parse(readFileSync("./roles/security-reviewer.json", "utf-8"));

// 3. Retrieve with role governance
const bundle = await retrieve({
  store,
  roleId: "security-reviewer",
  taskText: "Review auth middleware for injection risks",
  overlay,
  lexicalOnly: true,
});

// bundle.selected — evidence chunks, scored and ranked
// bundle.rejected — why candidates were excluded
// bundle.provenance — trust and freshness posture
// bundle.warnings — degradation signals

Pilot Roles (Wave 1)

Role	Retrieval Profile
Product Strategist	User value, tradeoffs, strategic precedent
Security Reviewer	Threat models, CVEs, exploit patterns
Competitive Analyst	Market data, honest disadvantages, substitutes
Docs Architect	Structure patterns, navigation, information hierarchy
Critic Reviewer	Quality standards, rejection precedent, contract compliance

How It Works

Query Builder — expands task text with overlay vocabulary (boost phrases, preferred terms, synonyms)
Lexical Search — BM25 via SQLite FTS5 for exact term matching
Semantic Search — embedding similarity via Ollama (graceful fallback to lexical-only)
Merge + Dedupe — union candidate pool from both lanes
Metadata Filter — enforces forbidden sources, role exclusions, stale penalties, trust/doc-type boosts
Role Reranker — transparent weighted scoring with source diversity pressure
Bundle Assembly — produces governed RetrievalBundle with full audit trail

Trust & Security

This library operates locally only as a retrieval engine.

Data touched: local SQLite database, document chunks, embeddings
Data NOT touched: no user credentials, no PII, no external APIs (except optional Ollama localhost)
No network egress except optional embedding generation via local Ollama (localhost:11434)
No telemetry is collected or sent
No secrets handling — does not read, store, or transmit credentials

Status

Phase	Description	Tests
1. Contract Spine	Types, validation, overlays, fixtures	58
2. Retrieval Engine	Full pipeline, 5 roles proven distinct	86
3. Prompt Consumption	Role OS prompt integration	Role OS: 38
4. Artifact Evidence	Chain-of-custody proof	Role OS: 22
5. Production Adoption	Live in persistent run engine	Role OS: 11

License

MIT

Built by MCP Tool Shop

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github/workflows		.github/workflows
knowledge		knowledge
site		site
src		src
test		test
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.es.md		README.es.md
README.fr.md		README.fr.md
README.hi.md		README.hi.md
README.it.md		README.it.md
README.ja.md		README.ja.md
README.md		README.md
README.pt-BR.md		README.pt-BR.md
README.zh.md		README.zh.md
SCORECARD.md		SCORECARD.md
SECURITY.md		SECURITY.md
SHIP_GATE.md		SHIP_GATE.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What This Is

Architecture

Pipeline

Key Contracts

Install

Quick Start

Pilot Roles (Wave 1)

How It Works

Trust & Security

Status

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

What This Is

Architecture

Pipeline

Key Contracts

Install

Quick Start

Pilot Roles (Wave 1)

How It Works

Trust & Security

Status

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages