[MM][Bugfix] Add MoE verification for multi-modal models #3897

shen-shanshan · 2025-10-30T08:21:23Z

What this PR does / why we need it?

The empty of moe_comm_method in the above issue is due to the wrong check for MoE models. To be specific, the method is_moe_model only checks whether a text-only model is a MoE model, without considering multi-modal models, e.g., VL and Omni.

Config of text-only models looks like:

https://huggingface.co/Qwen/Qwen3-30B-A3B/blob/main/config.json

We can verify MoE models by checking if there is a key named num_experts.

This is different from VL or Omni models, whose config files look like:

We can verify MoE models by checking if there is a key named num_experts in text_config dict.

Part of #3508.

Does this PR introduce any user-facing change?

How was this patch tested?

Update: 2025/11/03

Check the config dict recursively to find if it has a key contains "expert", without checking the model architecture.

It is worth noting that, we can't verify a model by if it contains FusedMoE module because is_moe_model is called somewhere before the model loading, e.g., it's called when updating the ACLGraph config in platform initialization.

vLLM version: v0.11.0
vLLM main: vllm-project/vllm@83f478b

github-actions · 2025-10-30T08:21:30Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request fixes a bug in is_moe_model to correctly identify multi-modal MoE models. The change introduces a check for text_config in multi-modal model configurations. While the overall logic is sound, I've identified a couple of areas for improvement. Firstly, a new global variable is modified within a function, which should be a module-level constant. Secondly, the method for detecting MoE models can be made more precise and efficient by directly checking for the num_experts key, as mentioned in the PR description, instead of using a substring search. My review comments provide specific suggestions to address these points.

vllm_ascend/utils.py

shen-shanshan · 2025-10-31T06:19:57Z

Acc test result: https://github.com/vllm-project/vllm-ascend/actions/runs/18961831286/job/54150706922?pr=3923

vllm_ascend/utils.py

Signed-off-by: shen-shanshan <467638484@qq.com>

shen-shanshan · 2025-11-03T07:16:16Z

CC @wangxiyuan

shen-shanshan · 2025-11-03T14:29:27Z

The CI has passed @wangxiyuan

github-actions bot added the module:core label Oct 30, 2025

shen-shanshan changed the title ~~[MM][Bugfix] Add MoE check for multi-modal models~~ [MM][Bugfix] Add MoE verification for multi-modal models Oct 30, 2025

gemini-code-assist bot reviewed Oct 30, 2025

View reviewed changes

vllm_ascend/utils.py Outdated Show resolved Hide resolved

vllm_ascend/utils.py Outdated Show resolved Hide resolved

vllm_ascend/utils.py Outdated Show resolved Hide resolved

github-actions bot added the module:tests label Oct 30, 2025

whx-sjtu suggested changes Oct 31, 2025

View reviewed changes

vllm_ascend/utils.py Show resolved Hide resolved

Add MoE verification for multi-modal models

fb67914

Signed-off-by: shen-shanshan <467638484@qq.com>

shen-shanshan force-pushed the bugfix-mm branch from c1d1b74 to fb67914 Compare November 3, 2025 07:02

github-actions bot removed the module:tests label Nov 3, 2025

update var

fe78b82

Signed-off-by: shen-shanshan <467638484@qq.com>

wangxiyuan approved these changes Nov 3, 2025

View reviewed changes

wangxiyuan added ready read for review ready-for-test start test by label for PR labels Nov 3, 2025

MengqingCao mentioned this pull request Nov 3, 2025

[Test]Add accuracy test for multiple models #3823

Open

zhangxinyuehfad mentioned this pull request Nov 3, 2025

[Test] Refactor accuracy test to nightly test #3814

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MM][Bugfix] Add MoE verification for multi-modal models #3897

[MM][Bugfix] Add MoE verification for multi-modal models #3897

shen-shanshan commented Oct 30, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Oct 30, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shen-shanshan commented Oct 31, 2025

Uh oh!

Uh oh!

shen-shanshan commented Nov 3, 2025

Uh oh!

shen-shanshan commented Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[MM][Bugfix] Add MoE verification for multi-modal models #3897

Are you sure you want to change the base?

[MM][Bugfix] Add MoE verification for multi-modal models #3897

Conversation

shen-shanshan commented Oct 30, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Oct 30, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shen-shanshan commented Oct 31, 2025

Uh oh!

Uh oh!

shen-shanshan commented Nov 3, 2025

Uh oh!

shen-shanshan commented Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shen-shanshan commented Oct 30, 2025 •

edited by github-actions bot

Loading