Name	Name	Last commit message	Last commit date
parent directory ..
.agentv	.agentv
evals	evals
scripts	scripts
CHANGELOG.md	CHANGELOG.md
README.md	README.md
bun.lock	bun.lock
package.json	package.json

Name

Last commit message

Last commit date

Code Grader SDK Helper

Demonstrates how a TypeScript code_grader evaluator can use defineCodeGrader from @agentv/eval for a declarative, zero-boilerplate approach.

Files

evals/dataset.eval.yaml: Example test that uses a code_grader evaluator.
scripts/verify-attachments.ts: Code grader script using defineCodeGrader.
evals/example.txt, evals/python.instructions.md: Attachment fixtures.

Setup

From repository root:

bun install  # Links workspace dependencies
bun run build  # Builds @agentv/core package

Run

Standalone Test

Test the SDK-based code grader directly with a mock payload:

cd examples/features/code-grader-sdk
cat << 'EOF' | bun run scripts/verify-attachments.ts
{
  "question": "Please echo this request",
  "criteria": "The CLI echoes the prompt and lists attachment names.",
  "expected_output": [{"role": "assistant", "content": "Attachments detected (2): example.txt, python.instructions.md."}],
  "answer": "Attachments detected (2): example.txt, python.instructions.md.",
  "guideline_files": ["evals/python.instructions.md"],
  "input_files": ["evals/example.txt"],
  "input": []
}
EOF

Full Evaluation

From the repository root:

cd examples/features
bun agentv eval code-grader-sdk/evals/dataset.eval.yaml --target local_cli

This requires a CLI target named local_cli configured in .agentv/targets.yaml.

API

The defineCodeGrader helper:

Reads JSON from stdin automatically
Converts snake_case to camelCase
Validates input and output with Zod schemas
Handles errors gracefully

import { defineCodeGrader } from '@agentv/eval';

export default defineCodeGrader(({ answer, criteria }) => ({
  score: answer.includes(criteria) ? 1.0 : 0.0,
  hits: ['Check passed'],
  misses: [],
}));

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Code Grader SDK Helper

Files

Setup

Run

Standalone Test

Full Evaluation

API

FilesExpand file tree

code-grader-sdk

Directory actions

More options

Directory actions

More options

Latest commit

History

code-grader-sdk

Folders and files

parent directory

README.md

Code Grader SDK Helper

Files

Setup

Run

Standalone Test

Full Evaluation

API