feat: reduce JSON report size (#437) by ssrhaso · Pull Request #440 · AI-SDC/SACRO-ML

ssrhaso · 2026-04-10T11:36:19Z

Summary

Reduce JSON report file size by removing large derived arrays and externalising individual record data.

Strip fpr, tpr, and roc_thresh arrays from JSON serialisation via a _json_exclude_keys class attribute on the Attack base class. PDF generation is unaffected as it reads from the in-memory dict before serialisation.
Externalise LiRA individual record scores to a compressed .npz file, storing a relative filename in JSON for portability. Includes y_pred_proba and y_test to allow ROC recomputation from stored data.
Add backwards compatibility guard in LogLogROCModule for JSON files without ROC arrays.

For a CIFAR10-scale dataset, this reduces a single attack JSON from approximately 1.2MB to under 100KB.

Closes #437

codecov · 2026-04-10T11:46:02Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.52%. Comparing base (1863c65) to head (2b70e0e).

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #440   +/-   ##
=======================================
  Coverage   99.51%   99.52%           
=======================================
  Files          23       23           
  Lines        2692     2713   +21     
=======================================
+ Hits         2679     2700   +21     
  Misses         13       13

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

ssrhaso · 2026-04-10T11:47:01Z

@rpreen Please let me know what you think of this when you get a chance!

jim-smith · 2026-04-16T15:53:45Z

@ssrhaso does this affect the meta-attack?

…t-size # Conflicts: # sacroml/attacks/attack.py # sacroml/attacks/likelihood_attack.py

Avoids stray .npz files when callers request individual scores in-memory but no report on disk (e.g. MetaAttack sub-runs).

ssrhaso · 2026-04-17T09:23:40Z

@ssrhaso does this affect the meta-attack?

@jim-smith To my understanding and after checking thoroughly, no it does not.

MetaAttack reads per-record scores from the in-memory sub-attack object, not from the JSON, so the key-stripping doesn't affect it.

Spotted one related side-effect while checking: the new .npz write in LIRAAttack._save_attack_metrics was gated on report_individual only, so MetaAttack sub-runs (which set write_report=False) would have dropped stray lira_individual.npz files in each sub_dir. Pushed a fix to also gate on write_report.

feat: reduce JSON report size (#437)

2053db9

ssrhaso self-assigned this Apr 10, 2026

ssrhaso requested a review from rpreen April 10, 2026 11:46

ssrhaso added 2 commits April 17, 2026 10:15

Merge remote-tracking branch 'origin/main' into 437-reduce-json-repor…

7e6057a

…t-size # Conflicts: # sacroml/attacks/attack.py # sacroml/attacks/likelihood_attack.py

fix: gate LiRA .npz write behind write_report

2b70e0e

Avoids stray .npz files when callers request individual scores in-memory but no report on disk (e.g. MetaAttack sub-runs).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: reduce JSON report size (#437)#440

feat: reduce JSON report size (#437)#440
ssrhaso wants to merge 3 commits intomainfrom
437-reduce-json-report-size

ssrhaso commented Apr 10, 2026

Uh oh!

codecov bot commented Apr 10, 2026 •

edited

Loading

Uh oh!

ssrhaso commented Apr 10, 2026

Uh oh!

jim-smith commented Apr 16, 2026

Uh oh!

ssrhaso commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ssrhaso commented Apr 10, 2026

Summary

Uh oh!

codecov bot commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ssrhaso commented Apr 10, 2026

Uh oh!

jim-smith commented Apr 16, 2026

Uh oh!

ssrhaso commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Apr 10, 2026 •

edited

Loading