You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In both modes an **unk fallback** advances one full UTF-8 character (not one
637
+
byte) when no vocabulary piece covers the current position, preventing split
638
+
multi-byte sequences from producing garbage tokens.
639
+
640
+
Scores are written by `convert.py --tokenizer` (via
641
+
`tok.sp_model.GetScore(i)`) and preserved through quantization by
642
+
`ltx-quantize` (via `gguf_set_kv`).
628
643
629
644
---
630
645
@@ -661,8 +676,8 @@ and where contributions are most welcome.
661
676
| 2 |**VAE encoder**| Only the first `conv_in` layer is used; pseudo-encoding fallback | Implement full encoder stack for accurate I2V latent inversion |
662
677
| 3 |**AdaLN-single**| Timestep embedding is computed but per-block scale/shift is not fully applied | Apply `ada_params` chunks as scale/shift in each block's norms |
663
678
| 4 |**3-D RoPE**| Positional embeddings are not yet applied | Add rotary embeddings along (t, h, w) axes to Q and K tensors |
664
-
| 5 |**T5 tokenizer**| Whitespace-split + per-char fallback| Replace with a proper SentencePiece unigram/BPE tokenizer|
665
-
| 6 |**`ltx-quantize` metadata**| String arrays (tokenizer vocab) are skipped during quantization| Copy `GGUF_TYPE_ARRAY` entries in the KV copy loop|
679
+
| 5 |**T5 tokenizer**|~~Whitespace-split + per-char fallback~~**Fixed**: full SentencePiece unigram Viterbi DP (when scores in GGUF) or greedy longest-match | —|
680
+
| 6 |**`ltx-quantize` metadata**|~~String arrays (tokenizer vocab) are skipped during quantization~~**Fixed**: `gguf_set_kv` copies all KV pairs including arrays | —|
666
681
| 7 |**Persistent scratch**| DiT allocates 1 GB of ggml scratch per forward call | Pre-allocate a single scratch context and reset between calls |
667
682
| 8 |**Batch size > 1**| Only batch=1 is implemented | Add batch dimension to enable parallel generation |
668
683
| 9 |**CFG single-pass**| CFG requires two full forward passes | Implement single-pass CFG by duplicating the batch |
0 commit comments