feat: MetaAttack model by shamykyzer · Pull Request #441 · AI-SDC/SACRO-ML

shamykyzer · 2026-04-12T09:53:47Z

Closes #428

Added a MetaAttack class that runs multiple privacy attacks on the same target and combines their per-record results into one DataFrame.

Runs LiRA, QMIA, and/or Structural attacks (with optional repeated runs)
Extracts each record's vulnerability score from every attack
Aggregates scores in two levels:
- Within-attack: mean, std, and consistency across repeated runs
- Cross-attack: MIA ensemble mean (arithmetic + geometric), structural flag, and a count of how many attacks flagged each record
Outputs a vulnerability_matrix.csv and a standard JSON report

meta = MetaAttack(
    attacks=[("lira", {"n_shadow_models": 100}, 5), ("qmia", {}), ("structural", {})],
)
meta.attack(target)
df = meta.vulnerability_df  # one row per record, one column group per attack

available via the factory: factory.attack(target, "meta", attacks=[...])

…sues, remove CatBoost refs

…gitignore

…8-meta-attack

Add MetaAttack(Attack) with validated constructor, _parse_attacks(), and abstract method stubs. Register as "meta" in the attack factory. Supports (name, params, n_reps) tuples with validation against supported attacks (lira, qmia, structural). Loads k-anonymity threshold from ACRO config when not explicitly provided. Includes design spec and staged implementation plan.

Add _run_sub_attack() and the orchestration loop in _attack(). Each sub-attack runs in an isolated subdirectory under output_dir to prevent shadow model and report collisions between runs. MIA attacks (LiRA, QMIA) get report_individual=True injected automatically. Structural always computes record-level results. Sub-attack objects are collected for score extraction in Stage 3.

Add _extract_mia_scores() and _extract_structural_scores() with a field-mapping dict (_MIA_SCORE_FIELDS) for LiRA/QMIA score paths. Wire extraction into _attack() loop: scores collected immediately after each sub-attack run into mia_scores and structural_scores dicts, keyed by attack name with one list per repetition.

- Guard against sub-attack not running: check return value from attack() and raise RuntimeError with clear message if empty - Reject empty attacks list in _parse_attacks with ValueError - Use copy.deepcopy(params) instead of shallow dict(params) to prevent nested mutable values leaking between repetitions - Add logging.basicConfig to match peer attack file conventions

Implement _build_dataframe() with: - Level 1 (within-attack): mean, std, consistency per MIA attack across n_reps; mean k / majority vote for structural reps - Level 2 (cross-attack): arithmetic and geometric mean of MIA per-attack means; binary structural flag; n_vulnerable count - NaN padding for structural columns on test records - Epsilon-stabilised geometric mean to handle log(0) Wire into _attack(): DataFrame stored on self.vulnerability_df after all sub-attacks complete and scores are extracted.

- Clip MIA scores to [0, 1] during extraction to handle LiRA Carlini modes that produce unbounded log-likelihood ratios - Document LiRA score convention: score = CDF under out-distribution, high values = evidence for membership (not against) - Replace 'v is True' identity check with truthiness test 'if v:' to handle numpy bools correctly - Round averaged k-anonymity to int for multi-rep structural runs (fractional k is not meaningful)

Complete the MetaAttack pipeline: - _compute_global_metrics: uses mia_mean as membership predictor with get_metrics() for AUC/TPR/Advantage; falls back to summary dict for structural-only configs - _construct_metadata: enriches report with thresholds and key metrics - _get_attack_metrics_instances: standard report structure with sub-attack summary and full DataFrame under "individual" - CSV export: saves vulnerability_matrix.csv alongside JSON report - _attack() now returns a proper report dict (no more NotImplementedError)

10 test cases covering: - Validation: unsupported attack, invalid tuple, empty list, bad n_reps - Integration: QMIA + structural basic run, DataFrame shape and columns - Structural NaN for test records - Repeated runs: std column exists, consistency in [0, 1] - Threshold effects: lower threshold flags more records - Global metrics: AUC and TPR in [0, 1] - Report structure: standard nested JSON keys - Factory integration: factory.attack(target, "meta", ...) works - CSV export: vulnerability_matrix.csv written and loadable

Demonstrates end-to-end usage: synthetic data, Target construction, MetaAttack with QMIA (2 reps) + structural, DataFrame inspection, summary statistics, and top-10 most vulnerable records.

codecov · 2026-04-12T10:06:39Z

Codecov Report

❌ Patch coverage is 96.04863% with 13 lines in your changes missing coverage. Please review.
✅ Project coverage is 99.10%. Comparing base (1863c65) to head (c08312a).

Files with missing lines	Patch %	Lines
sacroml/attacks/meta_attack.py	93.37%	12 Missing ⚠️
sacroml/attacks/utils.py	96.29%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #441      +/-   ##
==========================================
- Coverage   99.51%   99.10%   -0.42%     
==========================================
  Files          23       25       +2     
  Lines        2692     3009     +317     
==========================================
+ Hits         2679     2982     +303     
- Misses         13       27      +14

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…ack)

ssrhaso and others added 25 commits March 28, 2026 06:13

feat: initial skeleton files with docstring and signatures

2b8dece

Merge remote-tracking branch 'origin/main' into HEAD

1b9cd38

feat: QMIA hinge score and label remapping utilities

1beb007

feat: QMIA CatBoost attack with multiclass and (x,y) conditioning

2f4a779

test: QMIA hinge score, multiclass, and attack tests

c73e796

chore: clean up unused files and ignore catboost artifacts

7101aa7

feat: QMIA benchmark scripts with formatted comparison tables

6dd69d7

docs: add QMIA usage and benchmark documentation to README

17b49e6

feat: add QMIAAttack quantile regression membership inference attack

b66b3bb

refactor: switch QMIA to HistGradientBoostingRegressor, fix review is…

50bfe11

…sues, remove CatBoost refs

fix: remove stale CatBoost references from README, factory test, and …

6594c23

…gitignore

fix: resolve ruff lint errors in benchmarks, tests, and summarize script

21f3ffa

style: pre-commit fixes

c50ec07

Merge branch 'main' into 306-quantile-regression

89e759f

fix: tighten factory test assertions, fix conftest RNG seed

891cff1

Merge remote-tracking branch 'origin/306-quantile-regression' into 42…

4d5635e

…8-meta-attack

docs: add MetaAttack example script

0afe169

Demonstrates end-to-end usage: synthetic data, Target construction, MetaAttack with QMIA (2 reps) + structural, DataFrame inspection, summary statistics, and top-10 most vulnerable records.

shamykyzer self-assigned this Apr 12, 2026

shamykyzer added the in-progress label Apr 12, 2026

shamykyzer changed the title ~~428 meta attack~~ MetaAttack Apr 12, 2026

shamykyzer changed the title ~~MetaAttack~~ feat: MetaAttack model Apr 12, 2026

style: pre-commit fixes

b5c008c

shamykyzer and others added 12 commits April 15, 2026 20:05

fix: harden QMIA regressor, calibration drift, margin rank

61f0027

docs: fix smart quotes in QMIA README example

ad8e788

docs: update citation.cff

bcb64e1

style: pre-commit fixes

1ec4e61

refactor: drop extract_true_label_probs, resolve ruff errors

3eec3bb

test: tighten QMIA public FPR tolerance

fb313a3

fix: warn when QMIA skips non-sklearn target remapping

7e245ad

fix: reject non-finite predict_proba in QMIA

42b06b8

perf: enable HGBR early stopping for QMIA fits with n >= 1000

d34b9a1

Merge branch '306-quantile-regression' into 428-meta-attack

eab2b5e

fix: resolve merge conflict in utils.py (merge main into 428-meta-att…

7673f58

…ack)

style: fix ruff lint errors in meta_attack and test

c08312a

shamykyzer marked this pull request as ready for review April 17, 2026 09:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: MetaAttack model#441

feat: MetaAttack model#441
shamykyzer wants to merge 38 commits intomainfrom
428-meta-attack

shamykyzer commented Apr 12, 2026

Uh oh!

codecov bot commented Apr 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

shamykyzer commented Apr 12, 2026

Uh oh!

codecov bot commented Apr 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Apr 12, 2026 •

edited

Loading