Agents Research & Knowledge Base
A comprehensive research repository on AI agent design, engineering, and implementation.
Started: 2026-03-07 | Last updated: 2026-04-10
#
Document
Description
01
What Are AI Agents?
First principles: definitions, the agent loop, workflows vs agents distinction (Anthropic), autonomy spectrum, failure modes, and the three eras of agent design
#
Document
Description
02
Context Engineering
The core discipline โ "the art of providing the right information in the right format at the right time" (Schmid). Context as a system, the seven principles, token budgets, pre-fetching, error compaction
04
Tool Use & Function Calling
ACI (Agent-Computer Interface), tool design principles, poka-yoke, tools as structured outputs, MCP protocol, and the tool explosion problem
05
Memory Systems
CoALA memory taxonomy (working, episodic, semantic, procedural), storage backends, retrieval strategies, consolidation patterns, and the stateless-reducer tension
08
Planning & Reasoning
Chain-of-thought, tree-of-thought, reflexion, hierarchical decomposition, adaptive re-planning, metacognition, and knowing when to stop
10
Language Selection for Agents
Language analysis for agent-generated code: static vs dynamic typing, GC vs manual memory, Rust, Go, C#, Java, TypeScript, compile times, and the context engineering framing
11
Self-Reflection & Verification
Self-correction, verification strategies, reflection loops, and when self-repair helps vs hurts
#
Document
Description
03
Architecture Patterns
Workflows (prompt chaining, routing, parallelization) and Agents (ReAct, Plan-Execute, Autonomous) โ with Anthropic's simplicity-first principle and anti-patterns
06
Multi-Agent Systems
Communication topologies, human-as-tool-call, the framework landscape (2026), and why most teams move away from frameworks
#
Document
Description
07
Evaluation & Reliability
Benchmarking, LLM-as-judge, compounding error math, sandboxed testing, guardrails, and the reliability ladder
Implementation guides for building specific agent types:
Blueprint
Description
Status
Generic Agent
Full implementation blueprint with agent loop, context assembler, tool executor, guardrails, and tools library
โ
Complete
Research Agent
General-purpose research agent (.NET 10 + Microsoft Agent Framework)
โ
Complete
Coding Agent (Forge)
SWE-bench-style coding agent with verification, guardrails, and session management
๐ง In Progress
Life Agent
Audio lifelogging and life-augmentation agent
๐ง In Progress
MCP Server
Model Context Protocol server implementation
๐ง In Progress
Reference materials and taxonomies:
Document
Description
Agent & Copilot Taxonomy
Comprehensive classification by autonomy level, domain, architecture, and design decision matrix
Research Paper Catalogue
91 papers organized by theme with intellectual landscape analysis, core tensions, schools of thought, and timeline view
Additional Knowledge Base
๐ Research Papers (126 papers)
126 academic papers downloaded from arXiv and converted to Markdown via Docling . Full conversions are in papers/docling/ . See the Research Paper Catalogue for the complete organized listing with intellectual landscape analysis.
Below are highlights from the foundational papers:
Foundational Agent Frameworks
Tool Use & Interface Design
Deep Research & Verification
๐ Research Notes & Design Documents
Internal design documents and experiment observations from building agents:
If you're new to agent building , read in this order:
What Are Agents? โ Foundations and the workflows-vs-agents distinction
Context Engineering โ The most important skill ("agent failures are context failures")
Architecture Patterns โ Know your options; start with the simplest
Tool Use โ How agents take action; invest more here than in prompts
Memory Systems โ How agents remember
Planning & Reasoning โ How agents think
Multi-Agent Systems โ Scaling with teams (but try single-agent first)
Evaluation โ Making agents reliable
Implementation Blueprint โ Build one
Language Selection โ Choosing the right language for agent-generated code
๐ Key Insights (Quick Reference)
Agents = LLM + Tools + Loop + Memory + Goal โ but most production "agents" are actually workflows (predefined code paths with LLM steps)
Context engineering > prompt engineering โ context is a system , not a string (Schmid)
Agent failures are context failures, not model failures โ Anthropic spent more time optimizing tools than prompts for SWE-bench
ACI (Agent-Computer Interface) > UI โ tool descriptions, error messages, and output formats are the agent's interface
The model is the engine, context is the fuel โ garbage in, garbage out, regardless of model capability
Own your prompts, own your context window, own your control flow (12-Factor Agents)
Build for failure first, success second โ $0.95^{10}$ per-step accuracy = 60% end-to-end
Start as a copilot, graduate to autonomous โ trust must be earned through evidence from evals
Use the simplest architecture that achieves your reliability goals โ don't use a framework where a while-loop suffices (Anthropic)
Tool descriptions are a form of context engineering โ they're the primary way you communicate intent to the model
Memory is the scaffolding that turns a stateless function into a stateful agent
The evaluator's rubric IS the specification
Working memory is the most underappreciated memory type
The handoff summary is the most important piece of multi-agent communication
Metacognition turns a reactive system into a self-monitoring system โ the best tool is sometimes request_clarification
Relevance over recency โ Not everything recent is relevant
Compression without loss โ Fit more signal into less tokens
Structure signals intent โ How you format context changes how the model uses it
Examples > instructions โ Show, don't tell
Single responsibility tools โ One tool, one job
Rich error messages โ Compact errors into context; the error IS the model's feedback
Verify after modify โ Always check your work
Graceful degradation โ Agents that say "I can't" are better than agents that hallucinate
Human contact is a tool call โ Not a failure state, but a smart decision (12-Factor)
Poka-yoke your tools โ Make incorrect usage impossible (absolute paths > relative paths)
agents/
โโโ README.md โ You are here
โโโ LICENSE โ MIT License
โโโ .github/
โ โโโ copilot-instructions.md โ Copilot custom instructions
โ โโโ skills/
โ โโโ convert-paper/SKILL.md โ Paper conversion skill
โ โโโ forge-improve/SKILL.md โ Forge session analysis skill
โ โโโ research-agent-investigation/SKILL.md
โโโ research/
โ โโโ fundamentals/
โ โ โโโ 01-what-are-agents.md
โ โโโ techniques/
โ โ โโโ 02-context-engineering.md
โ โ โโโ 04-tool-use-function-calling.md
โ โ โโโ 05-memory-systems.md
โ โ โโโ 08-planning-reasoning.md
โ โ โโโ 10-language-selection-for-agents.md
โ โ โโโ 11-self-reflection-verification.md
โ โโโ patterns/
โ โ โโโ 03-architecture-patterns.md
โ โ โโโ 06-multi-agent-systems.md
โ โโโ evaluation/
โ โ โโโ 07-evaluation-reliability.md
โ โโโ copilot-customization-guide.md
โโโ knowledge-base/
โ โโโ agent-taxonomy.md
โ โโโ paper-catalogue.md โ 126 papers with intellectual landscape
โ โโโ audio-lifelogging-research.md
โ โโโ human-wellness-research.md
โ โโโ long-running-life-augmentation-agents.md
โโโ papers/
โ โโโ pdfs/ โ Source PDFs from arXiv
โ โโโ docling/ โ Markdown conversions (126 papers)
โโโ blueprints/
โ โโโ generic-agent/
โ โ โโโ 09-implementation-blueprint.md
โ โโโ research-agent/ โ .NET 10 + Microsoft Agent Framework
โ โโโ coding-agent/ โ Forge coding agent (in progress)
โ โโโ life-agent/ โ Audio lifelogging agent (in progress)
โ โโโ mcp-server/ โ MCP server implementation (in progress)
โโโ scripts/
โโโ convert_papers.py โ arXiv download + Docling pipeline
Phase 1: Research Foundation โ
Phase 2: Academic Research Integration โ
Phase 3: Implementation Blueprints (In Progress)