Fix: Eval hash mismatch due to parameter truncation in DB storage by rlundeen2 · Pull Request #1523 · microsoft/PyRIT

rlundeen2 · 2026-03-19T23:16:23Z

Bug: Running await printer.print_summary_async(scenario_result) in 1_configuring_scenarios.ipynb prints "official evaluation has not been run yet for this specific configuration" — even when evals have been run.

Root cause: Long scorer params (e.g., system prompt templates) are truncated to 80 characters when stored in the DB via ComponentIdentifier.to_dict(max_value_length=80). The identity .hash is correctly preserved through the round-trip, but eval_hash is recomputed from the truncated params by EvaluationIdentifier, producing a different hash than what was stored during the eval run. This causes the metrics lookup to fail silently.

Fix: Store eval_hash inside the ComponentIdentifier serialization (to_dict/from_dict) so it survives DB round-trips without recomputation from truncated params.

ComponentIdentifier: Added stored_eval_hash field and KEY_EVAL_HASH. to_dict(eval_hash=...) includes it in the JSON; from_dict() restores it.
EvaluationIdentifier: Uses stored_eval_hash when available instead of recomputing from (potentially truncated) params.
ScenarioResultEntry/ScoreEntry/AttackResultEntry: Compute eval_hash from untruncated identifiers before truncation and pass to to_dict().
atomic_attack.py: Same fix for the enriched identifier persistence path.

No DB schema migration needed — eval_hash is stored inside the existing JSON columns. Old data without it falls back to recomputation (same as prior behavior).

pyrit/identifiers/component_identifier.py

pyrit/score/scorer.py

pyrit/memory/memory_models.py

hannahwestra25 · 2026-03-20T15:17:35Z

added a few comments. They can be addressed in a follow up PR if you want to get the release started / out

Store eval_hash inside ComponentIdentifier serialization (to_dict/from_dict) so it survives DB round-trips without recomputation from truncated params. - ComponentIdentifier: added stored_eval_hash field and KEY_EVAL_HASH - EvaluationIdentifier: uses stored_eval_hash when available - ScenarioResultEntry/ScoreEntry/AttackResultEntry: compute eval_hash before truncation - atomic_attack.py: same fix for enriched identifier persistence - Tests: round-trip, double round-trip, and regression tests Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

pyrit/score/scorer.py

pyrit/identifiers/component_identifier.py

…icrosoft#1523) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

rlundeen2 added 3 commits March 19, 2026 15:56

fixing eval hash

8272883

copilot

5fcd6e2

adding hash to atomic attack

41a56e7

jsong468 reviewed Mar 19, 2026

View reviewed changes

pyrit/identifiers/component_identifier.py Show resolved Hide resolved

rlundeen2 force-pushed the users/rlundeen/2026_03_19_eval_hash_bug branch from 409d1b7 to 7084a21 Compare March 20, 2026 01:17

rlundeen2 commented Mar 20, 2026

View reviewed changes

pyrit/identifiers/component_identifier.py Outdated Show resolved Hide resolved

rlundeen2 commented Mar 20, 2026

View reviewed changes

pyrit/score/scorer.py Outdated Show resolved Hide resolved

rlundeen2 commented Mar 20, 2026

View reviewed changes

pyrit/score/scorer.py Outdated Show resolved Hide resolved

hannahwestra25 reviewed Mar 20, 2026

View reviewed changes

pyrit/score/scorer.py Outdated Show resolved Hide resolved

hannahwestra25 reviewed Mar 20, 2026

View reviewed changes

pyrit/memory/memory_models.py Outdated Show resolved Hide resolved

rlundeen2 force-pushed the users/rlundeen/2026_03_19_eval_hash_bug branch from 7084a21 to 94dbf6a Compare March 20, 2026 16:40

rlundeen2 force-pushed the users/rlundeen/2026_03_19_eval_hash_bug branch from 94dbf6a to 2e09b7b Compare March 20, 2026 17:12

rlundeen2 added 2 commits March 20, 2026 12:04

refactor

8a9097f

unswallowing exceptions

297ffb9

hannahwestra25 reviewed Mar 20, 2026

View reviewed changes

pyrit/score/scorer.py Outdated Show resolved Hide resolved

hannahwestra25 reviewed Mar 20, 2026

View reviewed changes

pyrit/identifiers/component_identifier.py Show resolved Hide resolved

hannahwestra25 approved these changes Mar 20, 2026

View reviewed changes

pr feedback

7a1e485

rlundeen2 merged commit 7822646 into microsoft:main Mar 20, 2026
30 checks passed

riyosha pushed a commit to riyosha/PyRIT that referenced this pull request Mar 24, 2026

Fix: Eval hash mismatch due to parameter truncation in DB storage (m…

9465aba

…icrosoft#1523) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

jbolor21 pushed a commit to jbolor21/jbolor-PyRIT that referenced this pull request Mar 25, 2026

Fix: Eval hash mismatch due to parameter truncation in DB storage (m…

3a737cb

…icrosoft#1523) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Eval hash mismatch due to parameter truncation in DB storage#1523

Fix: Eval hash mismatch due to parameter truncation in DB storage#1523
rlundeen2 merged 7 commits intomicrosoft:mainfrom
rlundeen2:users/rlundeen/2026_03_19_eval_hash_bug

rlundeen2 commented Mar 19, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hannahwestra25 commented Mar 20, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

rlundeen2 commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hannahwestra25 commented Mar 20, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rlundeen2 commented Mar 19, 2026 •

edited

Loading