Skip to content

Conversation

@danehans
Copy link
Contributor

What type of PR is this?
/kind test
/area conformance-test

What this PR does / why we need it:

  • Adds a conformance test that tests routing to endpoints with data parallelism enabled.
  • Bumps the EPP image tag in conformance to v20251023-d788a2c.

Which issue(s) this PR fixes:

Fixes #1680

Does this PR introduce a user-facing change?:

Conformance: Adds test to exercise data parallelism routing. Bumps confomrance EPP image tag to `v20251023-d788a2c`.

@k8s-ci-robot k8s-ci-robot added the area/conformance-test Issues or PRs related to Conformance tests. label Oct 24, 2025
@k8s-ci-robot
Copy link
Contributor

@danehans: The label(s) kind/test cannot be applied, because the repository doesn't have them.

In response to this:

What type of PR is this?
/kind test
/area conformance-test

What this PR does / why we need it:

  • Adds a conformance test that tests routing to endpoints with data parallelism enabled.
  • Bumps the EPP image tag in conformance to v20251023-d788a2c.

Which issue(s) this PR fixes:

Fixes #1680

Does this PR introduce a user-facing change?:

Conformance: Adds test to exercise data parallelism routing. Bumps confomrance EPP image tag to `v20251023-d788a2c`.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@k8s-ci-robot k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Oct 24, 2025
@k8s-ci-robot k8s-ci-robot requested a review from ahg-g October 24, 2025 21:39
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: danehans

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot requested a review from elevran October 24, 2025 21:39
@k8s-ci-robot k8s-ci-robot added approved Indicates a PR has been approved by an approver from all required OWNERS files. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Oct 24, 2025
@netlify
Copy link

netlify bot commented Oct 24, 2025

Deploy Preview for gateway-api-inference-extension ready!

Name Link
🔨 Latest commit 78e6755
🔍 Latest deploy log https://app.netlify.com/projects/gateway-api-inference-extension/deploys/68ff8d445ad39a00082340c3
😎 Deploy Preview https://deploy-preview-1769--gateway-api-inference-extension.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@danehans
Copy link
Contributor Author

/cc @shmuelk @robscott @zetxqx

Signed-off-by: Daneyon Hansen <daneyon.hansen@solo.io>
targetPorts:
- number: 3000
- number: 3002
- number: 3004
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

While this is an interesting configuration, I don't think you could do this with a real vLLM server

spec:
containers:
- name: echoserver-3000
image: gcr.io/k8s-staging-gateway-api/echo-basic:v20240412-v1.0.0-394-g40c666fd
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Other than your use of non-contiguous ports here, why not not use llm-d-inference-sim which supports --data-parallel-size=N ?

{
name: "DP routes only to all pods (EPP returns all; ranks balanced internally)",
podIPsToBeReturnedByEPP: []string{podIPs[0], podIPs[1], podIPs[2]},
expectAllRequestsRoutedWithinPodNames: []string{podNames[0], podNames[1], podNames[2]},
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You are only checking Pod IPs. Shouldn't youalso be checking Pod Ports?


// Filter selects pods that match the IP addresses specified in the request header.
// Filter selects pods whose IPs match any value in the "test-epp-endpoint-selection" header.
// Values may be "IP" or "IP:port"; ports (ranks) are ignored here because DP fan-out happens later.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why do you want to ignore the port? I would have thought you would add code that if a port was specified, it filters by both IP and port.

Without that how do you know that DP really worked and sent requests to a "non-base" of the model server?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. area/conformance-test Issues or PRs related to Conformance tests. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Conformance: Add a complience test for Gateway support of multiple targetPorts in an InferencePool

3 participants