Skip to content

feat: require agent evolver self-validation#22

Merged
StoneHanaMori merged 1 commit intomainfrom
feat/evolver-harness-self-validation
Apr 28, 2026
Merged

feat: require agent evolver self-validation#22
StoneHanaMori merged 1 commit intomainfrom
feat/evolver-harness-self-validation

Conversation

@StoneHanaMori
Copy link
Copy Markdown
Collaborator

Summary

  • Add center-harness self-validation for Agentic Evolver before finalizing changed or newly created skills.
  • Require static checks and small safe smoke tests when possible.
  • Record validation results in each skill’s history evidence file.
  • Keep the existing client-side validation flow unchanged.

Motivation

Previously, Skill validation mainly happened after distribution to validation clients. This PR adds an
earlier Codex / Claude Code-style quality gate inside the Agentic Evolver harness, so obvious skill issues
can be detected and fixed before the candidate skill leaves the central workspace.

@StoneHanaMori StoneHanaMori requested a review from Copilot April 28, 2026 15:14
@StoneHanaMori StoneHanaMori self-assigned this Apr 28, 2026
@StoneHanaMori StoneHanaMori merged commit 26486a5 into main Apr 28, 2026
1 check failed
@StoneHanaMori StoneHanaMori deleted the feat/evolver-harness-self-validation branch April 28, 2026 15:14
Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds an “inner-loop” self-validation gate to the Agentic Evolver center harness, ensuring skills are checked (and results recorded) before they’re finalized and leave the central workspace.

Changes:

  • Update EVOLVE_AGENTS.md to require self-validation (static checks + minimal safe smoke tests) and to record outcomes in history evidence.
  • Update the evolve workspace bootstrap (AGENTS.md template in agent_workspace.py) to point agents at the new self-validation requirement.
  • Add tests that assert the guide and bootstrap mention the new self-validation process.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

File Description
tests/test_agent_evolver_self_validation.py Adds doc/bootstrapping “guardrail” tests to ensure self-validation instructions stay present.
evolve_server/engines/agent_workspace.py Updates the embedded AGENTS.md bootstrap template to require self-validation before finalizing skill changes.
evolve_server/engines/EVOLVE_AGENTS.md Introduces a full self-validation step and reiterates validation requirements near the end of the guide.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.



def test_evolve_agents_md_requires_center_harness_self_validation():
text = Path("evolve_server/engines/EVOLVE_AGENTS.md").read_text(encoding="utf-8")
Comment on lines +18 to +22
text = agent_workspace._EVOLVE_AGENTS_MD

assert "self-validation" in text
assert "EVOLVE_AGENTS.md" in text
assert "Before finalizing" in text
- Before finalizing any changed skill, complete the self-validation required
by `EVOLVE_AGENTS.md`; if validation fails, keep editing or revert the
change rather than leaving a known-failing skill in `skills/`.
- Record self-validation results in the paired `history/v<N>_evidence.md` file.
Comment on lines +398 to +399
- ALWAYS complete center harness self-validation before finalizing a changed
skill, and record the result in the paired `history/v<N>_evidence.md` file.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants