Transforms benchmarks experiment #14

brian-dellabetta · 2025-10-28T16:18:04Z

This PR leverages the pre-built tasks LLMCompresorTask and LMEvalTask to create an example script to run several llm-compressor recipes at a time, in a single pipeline, in parallel. Each model is then run through an evaluation using lm_eval. Results available at https://spaces.redhat.com/spaces/vLLM/pages/714211727/Transforms+Benchmarks+v1

I had to make a few updates, pinning dependency versions so that they would install correctly in clearml, based on feedback from @Chibukach
I added some logic to make sure the LMEvalTask installs vllm with VLLM_USE_PRECOMIPLED=1. This was needed to run an experiment with a transforms feature on main that hadn't been released yet. I think we want to always do this, but I can move it to an input to the LMEvalTask constructor if that is preferred.

Signed-off-by: Brian Dellabetta <bdellabe@redhat.com>

brian-dellabetta added 11 commits October 7, 2025 17:03

initial commit

01bdcd7

Signed-off-by: Brian Dellabetta <bdellabe@redhat.com>

p2

0b60c68

Signed-off-by: Brian Dellabetta <bdellabe@redhat.com>

llmcompressor task updates

ad0e36e

Signed-off-by: Brian Dellabetta <bdellabe@redhat.com>

transforms benchmark v1

2e369f7

Signed-off-by: Brian Dellabetta <bdellabe@redhat.com>

hf hub dep version

c0bea4e

Signed-off-by: Brian Dellabetta <bdellabe@redhat.com>

set VLLM_USE_PRECOMPILED

49d1133

Signed-off-by: Brian Dellabetta <bdellabe@redhat.com>

typo

9cbc2be

Signed-off-by: Brian Dellabetta <bdellabe@redhat.com>

networkx pin

ad921a0

Signed-off-by: Brian Dellabetta <bdellabe@redhat.com>

quip uv

697675c

Signed-off-by: Brian Dellabetta <bdellabe@redhat.com>

spinquant experiments

62f2ceb

Signed-off-by: Brian Dellabetta <bdellabe@redhat.com>

spinquant experiments p2

e94c2b5

Signed-off-by: Brian Dellabetta <bdellabe@redhat.com>

brian-dellabetta requested review from Chibukach and anmarques October 28, 2025 16:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Transforms benchmarks experiment #14

Transforms benchmarks experiment #14

Uh oh!

brian-dellabetta commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Transforms benchmarks experiment #14

Are you sure you want to change the base?

Transforms benchmarks experiment #14

Uh oh!

Conversation

brian-dellabetta commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant