Add DTW, nDTW, and SDTW trajectory metrics by nalinraut · Pull Request #12 · AmeyaWagh/robometric-frame

nalinraut · 2026-02-19T14:48:06Z

Add Dynamic Time Warping based metrics for evaluating trajectories that may have different lengths or temporal alignment. These metrics are particularly useful for evaluating VLA models and policies using action chunking (e.g., ACT, Diffusion Policy).

New metrics:

DTWDistance: Raw DTW distance using dynamic programming (lower=better)
NormalizedDTW: Mapped to [0,1] using exp(-DTW/(|R|*d)) (higher=better)
SuccessWeightedDTW: nDTW weighted by task success (SDTW = nDTW * Success)

Key features:

Support for trajectories of different lengths (core advantage over MSE/ATE)
Tolerates temporal misalignment (hesitation, speed differences)
Optional custom normalization factor
Full torchmetrics.Metric compatibility with distributed training support
Comprehensive test suite and example usage

Reference: Ilharco et al., "General Evaluation for Instruction Conditioned Navigation using Dynamic Time Warping," arXiv:1907.05446, NeurIPS ViGIL Workshop, 2019.

Add Dynamic Time Warping based metrics for evaluating trajectories that may have different lengths or temporal alignment. These metrics are particularly useful for evaluating VLA models and policies using action chunking (e.g., ACT, Diffusion Policy). New metrics: - DTWDistance: Raw DTW distance using dynamic programming (lower=better) - NormalizedDTW: Mapped to [0,1] using exp(-DTW/(|R|*d)) (higher=better) - SuccessWeightedDTW: nDTW weighted by task success (SDTW = nDTW * Success) Key features: - Support for trajectories of different lengths (core advantage over MSE/ATE) - Tolerates temporal misalignment (hesitation, speed differences) - Optional custom normalization factor - Full torchmetrics.Metric compatibility with distributed training support - Comprehensive test suite and example usage Reference: Ilharco et al., "General Evaluation for Instruction Conditioned Navigation using Dynamic Time Warping," arXiv:1907.05446, NeurIPS ViGIL Workshop, 2019.

AmeyaWagh · 2026-02-22T03:35:00Z

src/robometric_frame/trajectory_quality/dtw.py

+from torchmetrics import Metric
+
+
+def _compute_dtw(predicted: Tensor, reference: Tensor) -> Tensor:


Does this need to be an independenct function? can this be part of the metric class?

AmeyaWagh · 2026-02-22T17:13:00Z

src/robometric_frame/trajectory_quality/dtw.py

+    accumulated[0, 0] = cost_matrix[0, 0]
+
+    # Initialize first column
+    for i in range(1, t_pred):


Can we avoid loops and use torch.linspace to index instead?

AmeyaWagh reviewed Feb 22, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add DTW, nDTW, and SDTW trajectory metrics#12

Add DTW, nDTW, and SDTW trajectory metrics#12
nalinraut wants to merge 1 commit intoAmeyaWagh:mainfrom
nalinraut:feature/dtw-metrics

nalinraut commented Feb 19, 2026

Uh oh!

AmeyaWagh Feb 22, 2026

Uh oh!

AmeyaWagh Feb 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		from torchmetrics import Metric


		def _compute_dtw(predicted: Tensor, reference: Tensor) -> Tensor:

Conversation

nalinraut commented Feb 19, 2026

Uh oh!

AmeyaWagh Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

AmeyaWagh Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants