did the sandbox integration by FireFistisDead · Pull Request #1585 · mofa-org/mofa

FireFistisDead · 2026-04-07T17:59:09Z

📋 Summary

This PR adds sandboxed execution support for the Shell tool so agent-driven commands run in a more isolated and controlled environment, and also adds Semantic Caching for gateway chat inference to reduce latency and cost for repetitive prompts.

Together, these changes improve both:

Security (sandboxed shell execution)
Efficiency (semantic cache hits bypassing expensive inference)

🔗 Related Issues

Closes #1584 #1611
Related to #1584

🧠 Context

Shell Sandbox

The Shell tool previously executed commands directly on the host. In an agent framework, that is risky because model-generated actions can access sensitive files, environment variables, or broader system resources.

Semantic Cache

Standard cache behavior is usually exact-match only. In production, users often ask semantically similar questions with slightly different wording, which still triggers full inference unnecessarily. Semantic caching reduces repeated inference by matching prompt meaning, not only exact text.

🛠️ Changes

Sandbox execution (Shell tool)

Added sandbox execution mode for the Shell tool.
Added Linux sandbox support with isolated execution and resource limits.
Added non-Linux sandbox fallback through the WASM runtime for supported module-based execution.
Added sandbox: true support in agent.yml shell tool config.
Added SDK helpers so shell tool config is applied end to end.
Added tests for sandbox config parsing, directory restrictions, and sandbox downgrade prevention.

Semantic caching (Gateway)

Added semantic cache middleware for gateway chat flow.
Added embedding-based prompt similarity lookup before agent execution.
Added cache write-back after successful non-cached execution.
Added configurable threshold/top_k/provider/model wiring in server config.
Added response metadata fields to indicate cache usage and similarity score.
Added tests for:
- semantic hit behavior
- agent-boundary isolation
- disabled-cache behavior

🧪 How You Tested

Ran focused Shell tests:
- cargo test -p mofa-plugins shell::tests -- --nocapture
Ran the full mofa-plugins library test suite:
- cargo test -p mofa-plugins --lib -- --nocapture
Ran semantic cache tests in gateway:
- cargo test -p mofa-gateway middleware::semantic_cache::tests:: -- --nocapture
Ran gateway compile checks:
- cargo check -p mofa-gateway
Attempted full workspace suite:
- cargo test --workspace --all-features

⚠️ Breaking Changes

No breaking changes
Breaking change

🧹 Checklist

Code Quality

Code follows Rust idioms and project conventions
cargo fmt run
cargo clippy passes without warnings

Testing

Tests added/updated
cargo test passes locally without any error

Documentation

Public APIs documented
README / docs updated if needed

PR Hygiene

PR is small and focused
Branch is up to date with main
No unrelated commits
Commit messages explain why, not only what

🚀 Deployment Notes

No migration is required.

For sandboxed Shell execution, set sandbox: true in Shell tool config in agent.yml.
For semantic caching, enable semantic cache in gateway/server config and tune threshold/top_k/provider/model as needed.

Linux sandbox mode depends on bubblewrap and optional resource-limit tooling.

🧩 Additional Notes for Reviewers

Sandbox mode cannot be downgraded per call once enabled globally.
Fallback behavior is fail-closed, not silent host-execution fallback.
Semantic cache hits are bounded by configured similarity threshold and scoped to avoid cross-agent leakage.

FireFistisDead · 2026-04-07T18:00:01Z

@BH3GEI @lijingrs you can check the pr when you are available

Copilot

Pull request overview

Adds sandbox-aware execution to the built-in shell tool and wires agent.yml tool configuration through the SDK so built-in tools can be created/configured automatically from YAML.

Changes:

Introduces sandbox execution paths for ShellCommandTool (Linux bubblewrap + non-Linux WASM fallback), plus config parsing and related tests.
Adds SDK helpers to build a built-in ToolPlugin / ToolExecutor / LLMAgentBuilder from AgentYamlConfig tool settings.
Updates example agent.yml to demonstrate shell sandbox configuration.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 9 comments.

Show a summary per file

File	Description
examples/config/agent.yml	Adds example shell tool config showing `sandbox: true` and resource limits.
crates/mofa-sdk/src/llm_tools.rs	Adds helpers to construct built-in tool plugin/executor and attach it to an LLM builder from YAML.
crates/mofa-sdk/src/lib.rs	Re-exports the new YAML/built-in-tools helper functions from `mofa_sdk::llm`.
crates/mofa-plugins/src/tools/shell.rs	Implements sandbox config, directory restrictions, timeouts, and sandboxed execution for the shell tool (plus tests).
crates/mofa-plugins/src/tools/mod.rs	Adds a config-aware built-in tool plugin factory (currently only applies per-tool JSON config).

Copilot · 2026-04-07T18:05:21Z