-
Couldn't load subscription status.
- Fork 269
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Transform] SpinQuant fix OOM
transforms
Related to transforms-based modifiers like SpinQuant and Quip
#1976
opened Oct 29, 2025 by
zhanglei1172
Loading…
[AWQ] Allow users to disable quantization during AWQ
#1973
opened Oct 28, 2025 by
brian-dellabetta
•
Draft
Modernize entrypoints module with type hints and use generic types
ready
When a PR is ready for review
#1965
opened Oct 25, 2025 by
sugatmahanti
Loading…
[Misc] Remove NeuralMagic copyright
ready
When a PR is ready for review
#1964
opened Oct 24, 2025 by
kylesayrs
Loading…
Fixing untie to be used only as needed and automatic
#1963
opened Oct 24, 2025 by
HDCharles
Loading…
[Oneshot] Add validation for empty dataset and enhance oneshot function parameters
#1957
opened Oct 21, 2025 by
ArkaSanka
Loading…
[tests] Update lm_eval VL tests to qwen 3
ready
When a PR is ready for review
#1953
opened Oct 20, 2025 by
brian-dellabetta
Loading…
3 tasks done
[Attention] Support FP4 attention quantization
nvfp4
For any PR / issue related to NVFP4 support
#1924
opened Oct 14, 2025 by
kylesayrs
Loading…
[Cache] Fix environment variable handling for offline mode
ready
When a PR is ready for review
#1902
opened Oct 7, 2025 by
ralphbean
Loading…
[Training] Fix When a PR is ready for review
tokenizer attribute of SessionMixin
ready
#1895
opened Oct 1, 2025 by
kylesayrs
Loading…
[Dependencies] update When a PR is ready for review
lm_eval version pin
ready
#1862
opened Sep 24, 2025 by
brian-dellabetta
Loading…
[Logging] clean up CompressionLogger verbosity
ready
When a PR is ready for review
#1861
opened Sep 23, 2025 by
brian-dellabetta
Loading…
[MoE Calibration] Simplify MoE calibration interface
llama
For any PR / issue related to Llama herd support
qwen
For any PR / issue related to Qwen support
ready
When a PR is ready for review
#1851
opened Sep 22, 2025 by
sairampillai
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.