add Pydantic-typed state schemas with validation on reads and writes

### Description

`StateManager._state: dict[str, Any]` is the only state contract — workflows have no way to declare a schema for their state. Consequences:

- **Silent typos in templates.** `{state.user_inpt}` (typo) renders as empty string per `SafeFormatDict` (issue #110), and the LLM produces a confused answer. No error.
- **No validation on writes.** A step writing `state.classification = "complaint"` followed by another step reading it as `int` produces a runtime error far from the cause.
- **No editor support.** Without a schema, IDE autocomplete on `state.foo` is impossible; the LSP work in #66 (semantic lint) hits this wall.
- **Resume-from-checkpoint surprises.** A schema change between v1 and v2 of a workflow can resurrect old state that no longer matches what the steps expect — silent corruption, hard-to-diagnose bugs.

### Proposal

**1. Optional Pydantic state schema per workflow:**

```yaml
name: customer-support
version: "1.2"
state_schema:
  module: examples.schemas
  model: SupportState
state:
  user_input: ""
  classification: null
```

Or inline JSON Schema:

```yaml
state_schema:
  type: object
  properties:
    user_input: { type: string }
    classification: { type: string, enum: [question, complaint, other] }
    confidence: { type: number, minimum: 0, maximum: 1 }
  required: [user_input]
  additionalProperties: false
```

**2. Schema enforcement:**

When `state_schema` is declared:

- At workflow load: validate `state` initial values against the schema; reject the workflow if invalid.
- On every `StateManager.set(key, value)`: validate the resulting state against the schema; raise `StateSchemaError` on violation.
- On every `StateManager.get(key)`: type-check the returned value matches the schema field; raise on mismatch (typically caught earlier by the set, but defends against direct dict manipulation).
- On checkpoint resume: validate restored state matches current schema; raise with a clear message if the schema has evolved (with optional `migration_callback`).

**3. Strict mode for templates:**

When a state schema is declared, `SafeFormatDict` operates in strict mode (per #110): missing keys raise `TemplateError` instead of returning empty string. The schema declares which keys exist; anything else is a typo.

**4. Pydantic-first ergonomics:**

```python
# examples/schemas.py
from pydantic import BaseModel
from typing import Literal

class SupportState(BaseModel):
    user_input: str
    classification: Literal["question", "complaint", "other"] | None = None
    confidence: float = 0.0
    resolution_proposed: bool = False
```

Workflow author writes the model once; AgentLoom uses it for validation, the lint pass uses it for static checks, the LSP/IDE uses it for autocomplete.

**5. Step output schemas:**

Steps that write to state declare what they produce:

```yaml
- id: classify
  type: llm_call
  prompt: "..."
  output: state.classification
  output_schema:
    type: pydantic
    model: examples.schemas.Classification   # produces field-compatible value
```

Combined with #117 (response schema for LLM calls), this makes the entire state flow type-checked end-to-end.

**6. Lint integration:**

When #66 (`agentloom lint`) lands, it cross-references:

- Every `{state.X}` template reference against the schema.
- Every `output: state.Y` against the schema (Y must be a writable field).
- Every router expression `state.X == ...` against the schema (X must exist; comparison type must match).

Today's silent runtime failures become validation errors at lint time.

**7. Migration support:**

```yaml
state_schema:
  module: examples.schemas
  model: SupportState
  migrations:
    - from_version: "1.1"
      to_version: "1.2"
      callback: examples.migrations.support_v11_to_v12
```

When resuming a checkpoint with `version=1.1` into a workflow at `version=1.2`, the migration callback runs to coerce old state to new shape. Without it, the resume fails clearly instead of corrupting silently.

### Scope

- `src/agentloom/core/models.py` — `WorkflowDefinition.state_schema`, `StepDefinition.output_schema`.
- `src/agentloom/core/state.py` — schema validation on `set`/`get`/`from_checkpoint`.
- `src/agentloom/core/templates.py` — strict mode triggered by schema presence.
- `src/agentloom/core/parser.py` — load and resolve Pydantic model references at workflow load.
- `src/agentloom/exceptions.py` — `StateSchemaError`.
- `src/agentloom/cli/lint.py` (after #66) — cross-reference schemas with template references.
- `src/agentloom/checkpointing/base.py` — version field + migration callback dispatch.
- `examples/typed_state/` — example workflow with Pydantic state.

### Regression tests

- `test_state_initial_validates_against_schema`
- `test_state_set_invalid_value_raises`
- `test_state_get_after_external_mutation_raises_on_type_mismatch`
- `test_template_missing_key_raises_when_schema_present`
- `test_template_missing_key_silent_when_no_schema` (backward compat)
- `test_checkpoint_resume_with_matching_schema_succeeds`
- `test_checkpoint_resume_with_evolved_schema_runs_migration`
- `test_checkpoint_resume_with_evolved_schema_no_migration_raises`
- `test_step_output_schema_validates_written_value`

### Notes

- **Backward compatible:** workflows without `state_schema` keep working with current dict-based, lenient behavior.
- **Pairs with #129 (templating):** if templating gains conditionals/loops, those reference state too — same schema applies.
- **Pairs with #66 (lint):** schema is the substrate that makes lint useful beyond syntactic checks.
- **Pairs with #117 (structured output):** step output schemas slot naturally into `output_schema`.
- **Migrations are optional:** the cheapest correct behavior is "raise on schema mismatch" — that alone prevents silent corruption. Migration callbacks are for users who choose to maintain compatibility actively.
- This is the largest single quality-of-life improvement for non-trivial workflows. Pays for itself the first time a typo gets caught at lint instead of as confused LLM output.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add Pydantic-typed state schemas with validation on reads and writes #128

Description

Proposal

Scope

Regression tests

Notes

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

add Pydantic-typed state schemas with validation on reads and writes #128

Description

Description

Proposal

Scope

Regression tests

Notes

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions