Official implementation of "Bootstrap Wasserstein Alignment for Stable Feature Attribution in Low-Data Regimes" (ICML 2025).
BWA is a geometric framework that stabilizes feature attributions in low-data regimes (
Figure 1: BWA recovers digit structure while Euclidean mean produces noise
- Theorem: Prove Euclidean averaging causes norm collapse in low-data regimes (Lemma 3.1)
- Method: BWA framework using Wasserstein barycenters for geometric consensus
- Empirical: 78% sign accuracy on synthetic data (vs 45% Euclidean) and 35% higher sparsity than SmoothGrad on MNIST
- Uncertainty: Calibrated estimates with 94% empirical coverage
| Method | Sign Accuracy | Norm Preservation |
|---|---|---|
| Vanilla Mean | 45.2% | 0.082 |
| Bootstrap Median | 58.7% | 0.126 |
| Bootstrapped SHAP | 61.3% | 0.143 |
| BWA (Ours) | 78.4% | 0.487 |
| Metric | Vanilla IG | SmoothGrad | BWA |
|---|---|---|---|
| Gini Sparsity | 0.412 | 0.556 | 0.684 |
| ∥e∥₂ Preservation | 0.158 | — | 0.892 |
# Clone repository
git clone https://github.com/mujahidmahfuz/bootstrap-wasserstein-alignment.git
cd bootstrap-wasserstein-alignment
# Install dependencies
pip install -r requirements.txt
# Install BWA package
pip install -e .