Fix NaN t-statistics across 7 locations for consistent undefined inference by igerber · Pull Request #118 · igerber/diff-diff

igerber · 2026-01-31T20:41:36Z

Summary

Replace else 0.0 with else np.nan when SE is non-finite or zero in t-stat calculations across sun_abraham.py (4 locations), triple_diff.py (1 location), and diagnostics.py (2 locations)
Add CI guards returning (np.nan, np.nan) for 4 downstream confidence interval computations (sun_abraham.py: 3, triple_diff.py: 1, diagnostics.py: 1)
Matches the CallawaySantAnna pattern established in PR Add base_period parameter to CallawaySantAnna for pre-treatment effects #97
Add 9 new tests across 3 test files verifying NaN behavior and t_stat = effect/SE consistency
Update docs/methodology/REGISTRY.md with NaN edge case documentation for SunAbraham, TripleDifference, and new PlaceboTests section

Methodology references (required if estimator / math changes)

Method name(s): SunAbraham, TripleDifference, permutation_test, leave_one_out_test
Paper / source link(s):
- Sun, L., & Abraham, S. (2021). Journal of Econometrics, 225(2), 175-199.
- Ortiz-Villavicencio, M., & Sant'Anna, P.H.C. (2025). arXiv:2505.09942.
Any intentional deviations from the source (and why): Defensive enhancement — uses NaN instead of 0.0 for undefined t-statistics (when SE ≤ 0 or non-finite). R's fixest::sunab() may produce Inf/NaN without warning in these edge cases. This matches the CallawaySantAnna NaN convention already in the codebase.

Validation

Tests added/updated:
- tests/test_sun_abraham.py — TestSunAbrahamTStatNaN (4 tests: per-effect, overall, bootstrap, aggregated)
- tests/test_triple_diff.py — TestTripleDifferenceTStatNaN (2 tests: NaN when SE zero, consistency across methods)
- tests/test_diagnostics.py — TestDiagnosticsTStatNaN (3 tests: permutation NaN, LOO NaN, permutation consistency)
Full test suite: 1101 passed
Pattern verification: grep -n "if.*se.*>.*0.*else 0" diff_diff/*.py returns zero matches

Security / privacy

Confirm no secrets/PII in this PR: Yes

Generated with Claude Code

…rence Replace `else 0.0` with `else np.nan` when SE is non-finite or zero in t-stat calculations across sun_abraham.py, triple_diff.py, and diagnostics.py. Add CI guards returning (NaN, NaN) for 4 downstream confidence interval computations. Matches the CallawaySantAnna pattern established in PR #97. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

github-actions · 2026-01-31T20:45:20Z

Overall assessment: ⚠️ Needs changes

Executive summary

P1: SunAbraham bootstrap inference can emit NaN t-stats while still returning non‑NaN p‑values/CIs, which conflicts with the new Registry rule for NaN propagation.
P3: Added test for overall p‑value propagation is a no‑op (or True), so it won’t catch regressions.
Analytical (non‑bootstrap) NaN guards look consistent across SunAbraham and TripleDifference.

Methodology

P1 | Impact: Registry says p_value/CI are NaN when t_stat is NaN, but in the bootstrap path t_stat is NaN while p_value/CI remain populated from the bootstrap distribution; this is a documented behavior mismatch for SunAbraham and can mislead users expecting NaN propagation. | Concrete fix: either (a) gate bootstrap overall_att_p_value/overall_att_ci and event‑study p_value/conf_int to (np.nan, np.nan) when the corresponding bootstrap SE is non‑finite/zero, or (b) update the Registry note to explicitly scope the NaN propagation rule to analytical inference only. (diff_diff/sun_abraham.py:L624-L639, docs/methodology/REGISTRY.md:L323-L326)

Code Quality

No issues found.

Performance

No issues found.

Maintainability

No issues found.

Tech Debt

No issues found.

Security

No issues found.

Documentation/Tests

P3 | Impact: assert np.isnan(results.overall_p_value) or True always passes, so the test doesn’t validate NaN propagation and could mask regressions. | Concrete fix: remove or True and assert the expected behavior explicitly (or conditionalize on bootstrap vs analytical inference). (tests/test_sun_abraham.py:L953)

…ertion - Clarify SunAbraham NaN scope in REGISTRY.md: distinguish analytical vs bootstrap inference for p_value/CI when SE/t_stat is NaN - Fix no-op assertion `assert ... or True` in test_sun_abraham.py to properly verify p_value is NaN under analytical inference Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

igerber · 2026-01-31T20:58:44Z

/ai-review

github-actions · 2026-01-31T21:02:47Z

🔁 AI review rerun (requested by @igerber)

Head SHA: aa660a5e95b4f73f1a747034a1b1ee7aea5a746d

Overall assessment: ✅ Looks good

Executive summary

NaN guard changes align with the Methodology Registry updates for SunAbraham, TripleDifference, and PlaceboTests; no method-spec mismatches found.
Pattern scan found no remaining se > 0 else 0.0 in diff_diff/*.py.
Minor test-coverage gap: new tests are conditional and may not deterministically exercise the zero/NaN SE branch (see P3).

Methodology

No findings.

Code Quality

No findings.

Performance

No findings.

Maintainability

No findings.

Tech Debt

No findings.

Security

No findings.

Documentation/Tests

Severity: P3 | Impact: Several new tests only assert NaN behavior if SE happens to be zero/non‑finite, so the specific regression this PR targets may not be exercised deterministically. | Concrete fix: Add fixtures that force degenerate SE (e.g., constant permutation/LOO distributions or explicit SE overrides/mocking) for each estimator to assert the NaN path. (tests/test_diagnostics.py:L690-L799, tests/test_sun_abraham.py:L902-L1060, tests/test_triple_diff.py:L980-L1030)

igerber merged commit 45b4a29 into main Jan 31, 2026
7 checks passed

igerber deleted the fix/nan-tstat-consistency branch January 31, 2026 21:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix NaN t-statistics across 7 locations for consistent undefined inference#118

Fix NaN t-statistics across 7 locations for consistent undefined inference#118
igerber merged 2 commits intomainfrom
fix/nan-tstat-consistency

igerber commented Jan 31, 2026

Uh oh!

github-actions bot commented Jan 31, 2026

Uh oh!

igerber commented Jan 31, 2026

Uh oh!

github-actions bot commented Jan 31, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

igerber commented Jan 31, 2026

Summary

Methodology references (required if estimator / math changes)

Validation

Security / privacy

Uh oh!

github-actions bot commented Jan 31, 2026

Methodology

Code Quality

Performance

Maintainability

Tech Debt

Security

Documentation/Tests

Uh oh!

igerber commented Jan 31, 2026

Uh oh!

github-actions bot commented Jan 31, 2026

Methodology

Code Quality

Performance

Maintainability

Tech Debt

Security

Documentation/Tests

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant