Cleanup basic and entrypoint test organisation #27747

hmellor · 2025-10-29T14:46:58Z

Don't call basic test files individually
Splits the long entrypoints test group into 2 shards
Move the unit tests into a unit directory so we don't have to manually ignore everything else
Move openai/tool_parsers into the unit tests because that's what they are

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

gemini-code-assist

Code Review

This pull request focuses on reorganizing the test structure for better maintainability and efficiency. It involves moving unit tests into a dedicated directory, splitting a large test group into shards, and updating test pipeline configurations. The review identifies a critical issue related to potential test failures due to missing multi-GPU configurations.

gemini-code-assist · 2025-10-29T14:48:46Z

tests/entrypoints/llm/test_collective_rpc.py

+

 @pytest.mark.parametrize("tp_size", [1, 2])
 @pytest.mark.parametrize("backend", ["mp", "ray"])
 @create_new_process_for_each_test()
 def test_collective_rpc(tp_size, backend, monkeypatch):
-    if torch.cuda.device_count() < tp_size:
-        pytest.skip(f"Not enough GPUs for tensor parallelism {tp_size}")


The test_collective_rpc test is marked as a multi-GPU test, but the pytestmark is defined after the imports. This could lead to tests being run without the multi-gpu setup if pytest runs before the mark is applied. It's better to define pytestmark at the top of the file to ensure it's always applied. This is a critical issue as it can lead to tests passing incorrectly when they should be failing due to insufficient GPU resources.

pytestmark = pytest.mark.multi_gpu_test(num_gpus=2) from ...utils import create_new_process_for_each_test @pytest.mark.parametrize("tp_size", [1, 2])

We don't do this anywhere else that we use pytestmark

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2025-10-29T14:51:19Z

tests/entrypoints/llm/test_collective_rpc.py

 import pytest
-import torch

 from vllm import LLM

 from ...utils import create_new_process_for_each_test

+pytestmark = pytest.mark.multi_gpu_test(num_gpus=2)
+

 @pytest.mark.parametrize("tp_size", [1, 2])
 @pytest.mark.parametrize("backend", ["mp", "ray"])
 @create_new_process_for_each_test()
 def test_collective_rpc(tp_size, backend, monkeypatch):
-    if torch.cuda.device_count() < tp_size:
-        pytest.skip(f"Not enough GPUs for tensor parallelism {tp_size}")
    if tp_size == 1 and backend == "ray":
        pytest.skip("Skip duplicate test case")


Removing device-count guard lets multi-GPU test run on single GPU

The test previously skipped when torch.cuda.device_count() < tp_size, but the new module-level pytestmark = pytest.mark.multi_gpu_test(num_gpus=2) does not perform that check—the multi_gpu_test skip logic lives in the decorator in tests/utils.py, not in a pytest marker. As a result, on environments with only one GPU this test will now run and attempt to create an LLM with tensor_parallel_size=2, causing a failure instead of a skip. To restore the behaviour, either keep the explicit device-count guard or apply the multi_gpu_test decorator (or its marks) so the skip happens before the LLM is constructed.

Useful? React with 👍 / 👎.

bbrowning · 2025-10-29T14:55:33Z

Move openai/tool_parsers into the unit tests because that's what they are

This is mostly correct, except test_hermes_tool_parser.py that actually spins up vLLM in some of its tests. Those tests may need to get split out from the rest here, and generally tool parser tests that spin up a server are under tests/tool_use instead. The hermes test file has some of both, so it could either be moved entirely for now or split into two files in the appropriate place.

hmellor · 2025-10-29T15:27:08Z

Thanks for the additional context. My reasoning for moving them there was that they were run in the Entrypoints Unit Tests job.

I'll move the ones that start the server into tests/tool_use

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Cleanup entrypoint test organisation

ac48cca

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor requested review from DarkLight1337, NickLucche, aarnphm, robertgshaw2-redhat and simon-mo as code owners October 29, 2025 14:46

Remove unused import

04e0269

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

mergify bot added ci/build llama Related to Llama models labels Oct 29, 2025

gemini-code-assist bot reviewed Oct 29, 2025

View reviewed changes

chatgpt-codex-connector bot reviewed Oct 29, 2025

View reviewed changes

Move the non unit tests to tool_use

fe584b3

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

mergify bot added the tool-calling label Oct 29, 2025

github-project-automation bot added this to Tool Calling Oct 29, 2025

hmellor added 2 commits October 29, 2025 16:42

import from the correct utils

183ff2e

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Merge branch 'main' into cleanup-entrypoints

30369e8

bbrowning mentioned this pull request Oct 31, 2025

[CI/Build] Add common tool call parser test suite #27599

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Cleanup basic and entrypoint test organisation #27747

Cleanup basic and entrypoint test organisation #27747

hmellor commented Oct 29, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 29, 2025

Uh oh!

hmellor Oct 29, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Oct 29, 2025

Uh oh!

bbrowning commented Oct 29, 2025

Uh oh!

hmellor commented Oct 29, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Cleanup basic and entrypoint test organisation #27747

Are you sure you want to change the base?

Cleanup basic and entrypoint test organisation #27747

Conversation

hmellor commented Oct 29, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

hmellor Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

bbrowning commented Oct 29, 2025

Uh oh!

hmellor commented Oct 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hmellor commented Oct 29, 2025 •

edited

Loading