-
-
Notifications
You must be signed in to change notification settings - Fork 73
Pull requests: turboderp-org/exllamav3
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(formatron): handle duplicate BPE token IDs and kbnf mask/accept inconsistency
#170
opened Mar 16, 2026 by
lesj0610
Loading…
test(eval): add flashinfer regression and benchmark coverage
#157
opened Mar 2, 2026 by
lesj0610
Loading…
feat(attn): add switchable flash-attn and flashinfer backends
#156
opened Mar 2, 2026 by
lesj0610
Loading…
Switch ExLlamaV3 to flashinfer and add MLA/Qwen3.5 support
#152
opened Feb 25, 2026 by
lesj0610
Loading…
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.