Allow explicit compatibility tensor dtype casts by lesj0610 · Pull Request #167 · turboderp-org/exllamav3

lesj0610 · 2026-03-07T03:29:53Z

This is a small compatibility improvement for architecture-preserving checkpoint format variation. The goal is not to support arbitrary model-specific rewrites, but to make required state tensor loading more robust when modified checkpoints keep the same module graph while changing storage dtype.

Some modified checkpoints resave required state tensors at a lower precision than the receiving op expects. This can happen after resaves, merges, abliterations, repacks, or other tooling that keeps the architecture but rewrites tensor storage or dtype.

This adds an opt-in cast_dtype argument to SafetensorsCollection.get_tensor() and uses it for GatedDeltaNet A_log, which the fused path expects in float.

The cast is explicit, happens after materialization, and leaves unrelated tensors such as dt_bias on their current path.

Adds focused tests for cast and no-cast behavior.

Allow explicit compatibility tensor dtype casts

d98f8b0

lesj0610 force-pushed the fix/checkpoint-required-dtype-casts branch from b8bb790 to d98f8b0 Compare March 11, 2026 15:52

lesj0610 mentioned this pull request Mar 11, 2026

feat(deepseek): add DeepSeek MLA and VL2 support #158

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow explicit compatibility tensor dtype casts#167

Allow explicit compatibility tensor dtype casts#167
lesj0610 wants to merge 1 commit intoturboderp-org:devfrom
lesj0610:fix/checkpoint-required-dtype-casts

lesj0610 commented Mar 7, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

lesj0610 commented Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

lesj0610 commented Mar 7, 2026 •

edited

Loading