feat: add QMIAAttack quantile regression membership inference attack (#306) by ssrhaso · Pull Request #435 · AI-SDC/SACRO-ML

ssrhaso · 2026-04-01T11:18:45Z

Summary

Implements the quantile regression membership inference attack from [Bertran et al., NeurIPS 2023] (https://arxiv.org/abs/2307.03694) - as described in #306.

Trains a single HistGradientBoostingRegressor at quantile level (1 − α) on non-member hinge scores to learn per-sample membership thresholds. A record is predicted as a member when its observed score exceeds the predicted threshold.

No shadow models required.

New files

sacroml/attacks/qmia_attack.py - QMIAAttack class (18 tests, 100% coverage)
sacroml/attacks/utils.py - hinge score, margin conversion, label helpers
sacroml/attacks/report.py - create_qmia_report() with QMIA_INTRODUCTION
sacroml/attacks/factory.py - registered as "qmia"
tests/attacks/test_qmia_attack.py - 18 tests
tests/attacks/test_factory.py - factory integration test
tests/attacks/test_report.py - JSON sanitisation test
examples/sklearn/benchmark_qmia_*.py - benchmark scripts
CHANGELOG.md, README.md - updated

Incidental fixes

attribute_attack.py: close matplotlib figures after saving (resource leak)
target.py: skip serialising data arrays when dataset module is provided
test_sklearn.py: fix LabelEncoder deprecation warning
test_structural_attack.py: suppress ConvergenceWarning in MLP tests
conftest.py: set matplotlib Agg backend, add session cleanup
utils.py: refactor check_and_update_dataset to use dict lookup

Test results

157 passed, 0 failures
qmia_attack.py: 100% line coverage

…sues, remove CatBoost refs

…gitignore

codecov · 2026-04-01T11:37:00Z

Codecov Report

❌ Patch coverage is 97.18310% with 4 lines in your changes missing coverage. Please review.
✅ Project coverage is 99.39%. Comparing base (a9524af) to head (891cff1).

Files with missing lines	Patch %	Lines
sacroml/attacks/utils.py	88.57%	4 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #435      +/-   ##
==========================================
- Coverage   99.51%   99.39%   -0.12%     
==========================================
  Files          23       24       +1     
  Lines        2687     2818     +131     
==========================================
+ Hits         2674     2801     +127     
- Misses         13       17       +4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

jim-smith · 2026-04-01T13:44:58Z

tests/attacks/test_factory.py

I am not happy with the greatly relaxed bounds on the various attack metrics, for example going from:

assert metrics["TPR"] == pytest.approx(0.91, abs=0.01) assert metrics["FPR"] == pytest.approx(0.41, abs=0.01)

to

assert 0.5 <= metrics["TPR"] <= 1.0 assert 0.0 <= metrics["FPR"] <= 1.0

no longer checks that the algorithm is behaving reproducibly and the same way as previous versions,
because we are going from +/- 1% (which allows for minor changes in precision of floats etc on different platforms) to +/- 25%

I know this is partly to do with using randomly created data, but by specifying the random seeds it should still be possible to get reproducible behaviour and keep the acceptable difference in behaviour to within +/- 1% of whatever the new value is.

Hello Jim,

Sorry for the oversight. I have tightened the QMIA test assertions to +/- 1% using pytest.approx, following the same pattern as test_factory. I have now replaced the loose bounds with exact expected values. I also added TPR@FPR threshold assertions because they test QMIA in detecting members at a controlled false positive rate. I think these are worth keeping, but let me know if you think otherwise.

I went more in-depth and just wanted to confirm my understanding, as I do not want to change anything outside of the QMIA scope.

Does test_factory checks correctness with tight bounds while other tests just verify the code runs and produces valid metrics? and wouldn't it have been better to add random_state to those fixtures and tighten them from the start?

test_lira_attack and test_worst_case_attack use loose assertions like assert 0 <= metrics["TPR"] <= 1 because their fixtures do not set random_state, so the metric values differ between runs.

Since test_factory runs the full end-to-end pipeline and is the only test with tight assertions because its get_target fixture sets random_state=1 making the results reproducible.

The QMIA fixtures do have random_state set, but I kept the loose format to stay consistent with the other individual attack tests. Should I tighten those as well, like I did in the factory test?

Please let me know if I have misunderstood anything.

Thanks!

tests/attacks/test_factory.py

ssrhaso and others added 11 commits March 28, 2026 06:13

feat: initial skeleton files with docstring and signatures

2b8dece

Merge remote-tracking branch 'origin/main' into HEAD

1b9cd38

feat: QMIA hinge score and label remapping utilities

1beb007

feat: QMIA CatBoost attack with multiclass and (x,y) conditioning

2f4a779

test: QMIA hinge score, multiclass, and attack tests

c73e796

chore: clean up unused files and ignore catboost artifacts

7101aa7

feat: QMIA benchmark scripts with formatted comparison tables

6dd69d7

docs: add QMIA usage and benchmark documentation to README

17b49e6

feat: add QMIAAttack quantile regression membership inference attack

b66b3bb

refactor: switch QMIA to HistGradientBoostingRegressor, fix review is…

50bfe11

…sues, remove CatBoost refs

fix: remove stale CatBoost references from README, factory test, and …

6594c23

…gitignore

ssrhaso assigned ssrhaso and shamykyzer Apr 1, 2026

ssrhaso requested review from jim-smith and rpreen April 1, 2026 11:19

fix: resolve ruff lint errors in benchmarks, tests, and summarize script

21f3ffa

ssrhaso force-pushed the 306-quantile-regression branch from 247033d to 21f3ffa Compare April 1, 2026 11:26

style: pre-commit fixes

c50ec07

Merge branch 'main' into 306-quantile-regression

89e759f

jim-smith reviewed Apr 1, 2026

View reviewed changes

tests/attacks/test_factory.py Outdated Show resolved Hide resolved

shamykyzer linked an issue Apr 1, 2026 that may be closed by this pull request

Implement quantile regression and other non-shadow attacks #306

Open

fix: tighten factory test assertions, fix conftest RNG seed

891cff1

shamykyzer requested a review from jim-smith April 2, 2026 11:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add QMIAAttack quantile regression membership inference attack (#306)#435

feat: add QMIAAttack quantile regression membership inference attack (#306)#435
ssrhaso wants to merge 15 commits intomainfrom
306-quantile-regression

ssrhaso commented Apr 1, 2026

Uh oh!

codecov bot commented Apr 1, 2026 •

edited

Loading

Uh oh!

jim-smith Apr 1, 2026

Uh oh!

shamykyzer Apr 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ssrhaso commented Apr 1, 2026

Summary

New files

Incidental fixes

Test results

Uh oh!

codecov bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jim-smith Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

shamykyzer Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Apr 1, 2026 •

edited

Loading