Skip to content

[P0] Clarify eval engine support and remove unsupported headless examples #29

@spazyCZ

Description

@spazyCZ

Objective

Align the eval engine contract, examples, and compatibility text so supported and unsupported eval runtimes are described consistently.

Priority

P0 — Must Fix for v1.0

Details

The spec currently defines claude-code and codex as supported eval engines, with copilot and cursor reserved for future use. However, some eval execution-flow examples still show a Copilot headless command. This leaves the supported-engine contract internally inconsistent.

Acceptance Criteria

  • Eval examples use only supported engines or clearly mark future placeholders as non-normative
  • Unsupported-engine behavior is defined consistently
  • The package format and compatibility sections agree on eval engine support

Notes

Source: work/eval-design-discussion.md

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentationenhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions