Add UR2CUTE model for intermittent demand forecasting #1401

FH-Prevail · 2025-10-24T19:03:53Z

Summary

This PR adds UR2CUTE (Using Repetitively 2 CNNs for Unsteady Timeseries Estimation), a specialized neural forecasting model designed for intermittent time series data, to the NeuralForecast library.

Model Description

UR2CUTE employs a two-step hurdle approach combining:

Classification CNN: Predicts occurrence of demand (zero vs. non-zero)
Regression CNN: Estimates magnitude of demand when non-zero

This dual-phase architecture significantly improves forecasting accuracy for intermittent demand patterns, which are common in retail, inventory management, and supply chain forecasting.

Changes

Files Added

neuralforecast/models/ur2cute.py - Complete UR2CUTE implementation (~545 lines)

Files Modified

neuralforecast/models/__init__.py - Added UR2CUTE to model exports
README.md - Added UR2CUTE to the list of available models

Implementation Details

Architecture

CNNClassifier (Classification Branch):

Conv1D(1→64) → Conv1D(64→64) → MaxPool → Dropout(0.4)
→ FC(32) → FC(h) → Sigmoid → Probabilities [batch, h]

CNNRegressor (Regression Branch):

Conv1D(1→32) → Conv1D(32→32) → MaxPool → Dropout(0.2)
→ FC(46) → FC(h) → ReLU → Quantities [batch, h]

Combined Prediction:

forecast[t] = regression_output[t] if classification_prob[t] > threshold else 0

Key Features

Inherits from BaseModel: Full integration with NeuralForecast framework
Custom Training Loop: Overrides training_step() and validation_step() for combined loss
Combined Loss Function: total_loss = α × BCE_loss + (1-α) × regression_loss
Flexible Thresholding: Configurable probability threshold for zero/non-zero classification
Comprehensive Logging: Tracks classification accuracy, BCE loss, and regression loss separately
PyTorch Lightning: Automatic GPU support, early stopping, checkpointing

Model Configuration

class UR2CUTE(BaseModel):
    EXOGENOUS_FUTR = False   # Can be extended in future
    EXOGENOUS_HIST = False   # Can be extended in future
    EXOGENOUS_STAT = False   # Can be extended in future
    MULTIVARIATE = False     # Univariate forecasting
    RECURRENT = False        # Direct multi-step forecasting

Usage Example

from neuralforecast import NeuralForecast
from neuralforecast.models import UR2CUTE
import pandas as pd

# Prepare intermittent demand data
df = pd.DataFrame({
    'unique_id': ['product_1'] * 100,
    'ds': pd.date_range('2020-01-01', periods=100, freq='W'),
    'y': [0, 5, 0, 0, 12, 0, 0, 0, 7, 0, ...]  # Many zeros
})

# Create and train model
model = UR2CUTE(
    h=12,                              # Forecast 12 periods ahead
    input_size=24,                     # Use 24 periods history
    classification_threshold=0.5,       # Probability threshold
    classification_weight=0.3,          # Loss weighting (30% classification, 70% regression)
    dropout_classification=0.4,
    dropout_regression=0.2,
    max_steps=1000,
    learning_rate=0.001,
)

# Fit and predict using NeuralForecast interface
nf = NeuralForecast(models=[model], freq='W')
nf.fit(df=df)
forecasts = nf.predict()

print(forecasts['UR2CUTE'])

Parameters

Parameter	Type	Default	Description
`h`	int	Required	Forecast horizon
`input_size`	int	-1	Historical window size (-1 = automatic)
`classification_threshold`	float	0.5	Probability threshold for zero/non-zero (0-1)
`dropout_classification`	float	0.4	Dropout rate for classification CNN
`dropout_regression`	float	0.2	Dropout rate for regression CNN
`classification_weight`	float	0.3	Weight for classification loss in combined loss
`learning_rate`	float	0.001	Optimizer learning rate
`max_steps`	int	1000	Maximum training iterations
`batch_size`	int	32	Training batch size

When to Use UR2CUTE

Ideal Use Cases ✅

Intermittent demand with >30% zero periods
Lumpy demand patterns (irregular, unpredictable)
Zero-inflated time series
Retail/inventory forecasting with sporadic sales
Spare parts demand
Seasonal products with off-seasons

Not Recommended ❌

Continuous demand with <10% zeros (use NBEATS, NHITS instead)
Smooth, predictable trends
Very short time series (<50 observations)

Technical Validation

Compatibility

✅ Follows BaseModel API conventions
✅ Compatible with NeuralForecast's windowing system
✅ Works with standard loss functions (MAE, MSE, etc.)
✅ Supports validation splits and early stopping
✅ GPU acceleration via PyTorch Lightning
✅ Multiple time series support
✅ Familiar sklearn-style .fit() and .predict() interface

Code Quality

✅ Comprehensive docstrings (NumPy format)
✅ Type hints where appropriate
✅ Clear variable naming
✅ Follows PEP 8 style guidelines
✅ No external dependencies beyond NeuralForecast requirements

Scientific Background

Original Paper:
Mirshahi, S., Brandtner, P., & Komínková Oplatková, Z. (2024).
Intermittent Time Series Demand Forecasting Using Dual Convolutional Neural Networks.
MENDEL — Soft Computing Journal, 30(1).

Key Findings from Paper:

UR2CUTE outperforms traditional methods (Croston, SBA, TSB) on intermittent demand
Superior performance compared to XGBoost, Random Forest, Prophet on sparse data
Particularly effective at predicting zero-demand periods
Improved R² and reduced MAE/RMSE on real-world retail datasets

Testing Recommendations

Basic Integration Test

import pandas as pd
import numpy as np
from neuralforecast import NeuralForecast
from neuralforecast.models import UR2CUTE

# Generate intermittent data
np.random.seed(42)
df = pd.DataFrame({
    'unique_id': 'test_series',
    'ds': pd.date_range('2020-01-01', periods=150, freq='W'),
    'y': np.random.poisson(3, 150) * np.random.binomial(1, 0.3, 150)
})

# Train model
model = UR2CUTE(h=12, input_size=24, max_steps=50)
nf = NeuralForecast(models=[model], freq='W')
nf.fit(df=df)

# Predict
forecasts = nf.predict()
assert 'UR2CUTE' in forecasts.columns
assert len(forecasts) == 12
assert (forecasts['UR2CUTE'] >= 0).all()  # Non-negative predictions

print("✓ UR2CUTE integration test passed!")

Comparison Test

from neuralforecast.models import UR2CUTE, NBEATS, NHITS

models = [
    UR2CUTE(h=12, input_size=24, max_steps=100),
    NBEATS(h=12, input_size=24, max_steps=100),
    NHITS(h=12, input_size=24, max_steps=100),
]

nf = NeuralForecast(models=models, freq='W')
nf.fit(df=df, val_size=24)
forecasts = nf.predict()

# All models should produce forecasts
assert all(model in forecasts.columns for model in ['UR2CUTE', 'NBEATS', 'NHITS'])
print("✓ Multi-model comparison test passed!")

Performance Considerations

Training Time: Similar to NBEATS/NHITS (depends on max_steps)
Memory Usage: Two CNN networks increase memory ~1.5x vs single network
Inference Speed: Fast - direct multi-step forecasting (no autoregression)
GPU Benefit: Significant speedup on GPU due to CNN operations

Future Enhancements (Not in this PR)

Potential future improvements:

Exogenous Variables Support: Extend to support futr_exog, hist_exog, stat_exog
Multivariate Extension: Support multivariate intermittent forecasting
Attention Mechanisms: Add attention layers for long sequences
Quantile Predictions: Extend to probabilistic forecasting
Transfer Learning: Pre-trained weights for common intermittent patterns

Breaking Changes

None - this is a new model addition with no changes to existing functionality.

Checklist

Additional Notes

Design Decisions

Why override training_step()?
The two-step approach requires computing separate losses (BCE for classification, MAE/MSE for regression) which isn't easily handled by a single loss function. Overriding allows explicit control over the combined loss calculation.
Why separate CNNs?
Separate architectures allow each branch to specialize: the classifier focuses on occurrence patterns while the regressor focuses on magnitude patterns. This matches the original paper's design.
Why no exogenous support initially?
To keep the initial implementation clean and focused on the core two-step architecture. Exogenous support can be added in a future enhancement.

Maintainability

Code is well-commented and documented
Clear separation between classification and regression logic
Easy to extend or modify individual components
Follows established NeuralForecast patterns

Related Issues

This addresses the need for specialized models for intermittent/sparse time series, which is a common pain point in retail and inventory forecasting applications.

Authors

Original UR2CUTE: Sina Mirshahi, Patrick Brandtner, Zuzana Komínková Oplatková
NeuralForecast Integration: [Your Name/GitHub Username]

License

This implementation maintains compatibility with NeuralForecast's Apache 2.0 license.

Request for Review

Please review:

API compatibility: Does it properly integrate with BaseModel?
Code quality: Is the code clean, readable, and well-documented?
Use case: Is this a valuable addition for intermittent forecasting?
Documentation: Is the usage clear for end users?
Testing: Are there any additional tests you'd recommend?

Thank you for considering this contribution!

- Add UR2CUTE (Using Repetitively 2 CNNs for Unsteady Timeseries Estimation) - Two-step hurdle approach: Classification CNN + Regression CNN - Specialized for intermittent/sparse time series (>30% zeros) - Inherits from BaseModel with full NeuralForecast integration - Comprehensive documentation and docstrings - Reference: Mirshahi et al. (2024) MENDEL Journal"

CLAassistant · 2025-10-24T19:04:00Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ FH-Prevail
❌ sina

sina seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

CLAassistant · 2025-10-24T19:04:00Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.

sina seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

FH-Prevail and others added 2 commits November 13, 2025 14:52

Merge branch 'main' into main

ef1cf66

Fix bugs

d6dcec3

FH-Prevail closed this Nov 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add UR2CUTE model for intermittent demand forecasting #1401

Add UR2CUTE model for intermittent demand forecasting #1401

Uh oh!

FH-Prevail commented Oct 24, 2025

Uh oh!

CLAassistant commented Oct 24, 2025 •

edited

Loading

Uh oh!

CLAassistant commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add UR2CUTE model for intermittent demand forecasting #1401

Add UR2CUTE model for intermittent demand forecasting #1401

Uh oh!

Conversation

FH-Prevail commented Oct 24, 2025

Summary

Model Description

Changes

Files Added

Files Modified

Implementation Details

Architecture

Key Features

Model Configuration

Usage Example

Parameters

When to Use UR2CUTE

Ideal Use Cases ✅

Not Recommended ❌

Technical Validation

Compatibility

Code Quality

Scientific Background

Testing Recommendations

Basic Integration Test

Comparison Test

Performance Considerations

Future Enhancements (Not in this PR)

Breaking Changes

Checklist

Additional Notes

Design Decisions

Maintainability

Related Issues

Authors

License

Request for Review

Uh oh!

CLAassistant commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CLAassistant commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CLAassistant commented Oct 24, 2025 •

edited

Loading