NaN-handling for ensemble CRPS by frazane · Pull Request #118 · frazane/scoringrules

frazane · 2026-03-30T16:39:09Z

No description provided.

sallen12 · 2026-04-01T08:33:59Z

scoringrules/core/crps/_approx.py

+        )
+        # NaN members are zeroed so their contributions to β_0 and β_1 are 0.
+        β_0 = B.sum(fct, axis=-1) / M_eff
+        β_1 = B.sum(fct * B.arange(0, M), axis=-1) / (M_eff * (M_eff - 1.0))


I assume we would need to replace B.arange(0, M) with B.arange(0, M_eff), since we're only using the non-nans to calculate the score?

We could check whether this is necessary by testing whether the crps is the same with the "fair" and "pwm" estimators when nan_policy = "omit"

sallen12 · 2026-04-01T08:36:59Z

scoringrules/core/crps/_approx.py

+    if nan_mask is not None:
+        # Assumes the ensemble is sorted with NaN members zeroed at the end.
+        M_eff = B.sum(B.where(nan_mask, B.asarray(0.0), B.asarray(1.0)), axis=-1)
+        alpha = B.arange(1, M + 1) - 0.5  # shape (M,)


Similarly here, possibly we should use M_eff instead of M. This becomes a bit messier though, since alpha would need to be different lengths in different dimensions. Not sure what the best solution is yet.

This could be checked by testing whether these functions give the same output as the numba functions, since the corresponding gufunc uses M_eff.

If I understand correctly, this is also the same issue when implementing this for the int estimator

sallen12 · 2026-04-01T08:41:49Z

scoringrules/_crps.py

        Default is False.
    estimator : str
        Indicates the CRPS estimator to be used.
+    nan_policy : {'propagate', 'omit', 'raise'}, default 'propagate'


I like these three options

sallen12 · 2026-04-01T09:16:29Z

scoringrules/core/crps/_approx.py

-    return e_1 - 0.5 * e_2
+    if nan_mask is not None:
+        M_eff = B.sum(B.where(nan_mask, B.asarray(0.0), B.asarray(1.0)), axis=-1)
+        e_1 = (


I assume I am overlooking something, but is there a reason for not defining a B.nansum function, and then just using this instead of B.sum? This would avoid needing to specify nan_mask, and would keep the code essentially the same as before. If the nan policy is "propagate", then we could just fill the resulting vector with nans in the correct places: e.g. something along the lines of

s = B.where(B.any(B.isnan(fct), axis=-1), B.nan, s)

or we could specify the nan_policy as an argument, and then use B.nansum or B.sum depending on whether nan_policy is propagate or omit.

One issue arises when all ensemble members are nan, in which case np.nansum returns zero, I think. But this is handled already using
B.where(M_eff <=1, B.asarray(float("nan")), result)

An alternative straightforward approach would be to use the weighted ensemble functions, and set the weight of the nan ensemble members to zero. This would arguably not be the most computationally efficient approach, but this would require only very small changes to the code, and could also readily be used for the energy score, variogram score, kernel scores, etc.

e.g. by setting ens_w = B.where(B.isnan(fct), 0.0, ens_w)

sallen12 · 2026-04-01T10:25:30Z

scoringrules/core/crps/_approx.py

+            / M_eff
+        )
+        result = e_1 - 0.5 * e_2
+        return B.where(M_eff == 0, B.asarray(float("nan")), result)


Minor comment, but M_eff == 0 is used here, whereas M_eff <= 1 is typically used above. Is there a reason why these are different?

sallen12 · 2026-04-01T10:36:48Z

scoringrules/core/crps/_approx.py

-    return e_1 - 0.5 * e_2
+    if nan_mask is not None:
+        M_eff = B.sum(B.where(nan_mask, B.asarray(0.0), B.asarray(1.0)), axis=-1)
+        fw = B.where(nan_mask, B.asarray(0.0), fw)


I think this could actually be simplified by just setting fw = B.where(nan_mask, B.asarray(0.0), fw), renormalising, and then calculating the owcrps with these updated weights. This would simply put zero weight on the ensemble members that are nans. The same should also hold for the vrcrps below.

sallen12 · 2026-04-01T10:52:14Z

scoringrules/core/crps/_gufuncs.py

    for i, forecast in enumerate(fct):
        if i == 0:
-            i = M - 1
+            i = M


sallen12 · 2026-04-01T11:05:45Z

scoringrules/core/utils.py

+def apply_nan_policy_ens_uv(obs, fct, nan_policy="propagate", backend=None):
+    """Apply NaN policy to univariate ensemble forecasts (fct shape: ..., M).
+
+    For 'propagate': no-op, returns (obs, fct, None).


Perhaps not immediately clear from the docstring that the function will return (obs, fct, nan_mask). I was initially a bit confused about what the extra None was for for propogate

Yep, type hints are missing.

sallen12 · 2026-04-01T11:07:24Z

scoringrules/core/utils.py

+                "NaN values encountered in input. "
+                "Use nan_policy='propagate' or nan_policy='omit' to handle NaN values."
+            )
+        return obs, fct, None


Why are the obs and forecasts still returned in this case?

Because we have to when the policy is "omit", and we cannot always adjust the number of expected return arguments whenever we call this function based on the policy. The signature must stay the same.

sallen12 · 2026-04-01T11:12:52Z

tests/test_crps_nan_policy.py

+    The array backend uses a roll-based masking approach for the circular kernel,
+    which masks out pairs involving NaN members but also loses the wrap-around pair
+    when NaN sits at the tail.  This means the array-backend result does NOT match
+    the clean-ensemble result (a known limitation).  The numba gufuncs correctly


Ah yes, interesting point. I don't see a nice way round this

sallen12 · 2026-04-01T11:15:25Z

tests/test_crps_nan_policy.py

+_FCT_CLEAN = np.array([0.1, 0.3, 0.7, 0.9, 1.1])
+_FCT_WITH_NAN = np.array([0.1, np.nan, 0.3, np.nan, 0.7, 0.9, 1.1])
+# NaN members of _FCT_WITH_NAN replaced by _OBS — same shape, no NaN, used for
+# building 2-row batches where one row is clean and the other contains NaN.


I don't understand this yet. Setting the nans to the obs will not change the first term of the CRPS, since the differences will all be zero, but it will change the second term.

frazane added 6 commits March 29, 2026 15:49

core utilities

5f87adb

crps: public API functions

80a241f

crps: nan-omit implementations for array backend

5f9fe51

crps: nan-omit implementations for numba gufuncs

277a3f6

crps: fix akr circular ring

f8cb1e2

crps: add nan policy tests

915fa2a

sallen12 reviewed Apr 1, 2026

View reviewed changes

scoringrules/core/crps/_gufuncs.py

for i, forecast in enumerate(fct):

if i == 0:

i = M - 1

i = M

Copy link
Copy Markdown

Collaborator

sallen12 Apr 1, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good spot!

sallen12 reviewed Apr 1, 2026

View reviewed changes

Conversation

frazane commented Mar 30, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sallen12 Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sallen12 Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sallen12 Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sallen12 Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sallen12 Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sallen12 Apr 1, 2026 •

edited

Loading

sallen12 Apr 1, 2026 •

edited

Loading

sallen12 Apr 1, 2026 •

edited

Loading

sallen12 Apr 1, 2026 •

edited

Loading

sallen12 Apr 1, 2026 •

edited

Loading