Conformance: Adds Data Parallelism Test #1769

danehans · 2025-10-24T21:39:28Z

What type of PR is this?
/kind test
/area conformance-test

What this PR does / why we need it:

Adds a conformance test that tests routing to endpoints with data parallelism enabled.
Bumps the EPP image tag in conformance to v20251023-d788a2c.

Which issue(s) this PR fixes:

Does this PR introduce a user-facing change?:

Conformance: Adds test to exercise data parallelism routing. Bumps confomrance EPP image tag to `v20251023-d788a2c`.

k8s-ci-robot · 2025-10-24T21:39:33Z

@danehans: The label(s) kind/test cannot be applied, because the repository doesn't have them.

In response to this:

What type of PR is this?
/kind test
/area conformance-test

What this PR does / why we need it:

Adds a conformance test that tests routing to endpoints with data parallelism enabled.

Bumps the EPP image tag in conformance to v20251023-d788a2c.

Which issue(s) this PR fixes:

Fixes #1680

Does this PR introduce a user-facing change?:
Conformance: Adds test to exercise data parallelism routing. Bumps confomrance EPP image tag to `v20251023-d788a2c`.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-ci-robot · 2025-10-24T21:39:36Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: danehans

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [danehans]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

netlify · 2025-10-24T21:40:01Z

✅ Deploy Preview for gateway-api-inference-extension ready!

Name	Link
🔨 Latest commit	`78e6755`
🔍 Latest deploy log	https://app.netlify.com/projects/gateway-api-inference-extension/deploys/68ff8d445ad39a00082340c3
😎 Deploy Preview	https://deploy-preview-1769--gateway-api-inference-extension.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

danehans · 2025-10-24T21:40:30Z

/cc @shmuelk @robscott @zetxqx

Signed-off-by: Daneyon Hansen <daneyon.hansen@solo.io>

shmuelk · 2025-10-28T15:01:04Z

conformance/resources/base.yaml

+  targetPorts:
+    - number: 3000
+    - number: 3002
+    - number: 3004


While this is an interesting configuration, I don't think you could do this with a real vLLM server

shmuelk · 2025-10-28T15:02:41Z

conformance/resources/base.yaml

+    spec:
+      containers:
+      - name: echoserver-3000
+        image: gcr.io/k8s-staging-gateway-api/echo-basic:v20240412-v1.0.0-394-g40c666fd


Other than your use of non-contiguous ports here, why not not use llm-d-inference-sim which supports --data-parallel-size=N ?

shmuelk · 2025-10-28T15:04:19Z

conformance/tests/gateway_following_epp_routing_dp.go

+			{
+				name:                                  "DP routes only to all pods (EPP returns all; ranks balanced internally)",
+				podIPsToBeReturnedByEPP:               []string{podIPs[0], podIPs[1], podIPs[2]},
+				expectAllRequestsRoutedWithinPodNames: []string{podNames[0], podNames[1], podNames[2]},


You are only checking Pod IPs. Shouldn't youalso be checking Pod Ports?

shmuelk · 2025-10-28T15:07:01Z

pkg/epp/scheduling/framework/plugins/test/filter/request_header_based_filter.go


-// Filter selects pods that match the IP addresses specified in the request header.
+// Filter selects pods whose IPs match any value in the "test-epp-endpoint-selection" header.
+// Values may be "IP" or "IP:port"; ports (ranks) are ignored here because DP fan-out happens later.


Why do you want to ignore the port? I would have thought you would add code that if a port was specified, it filters by both IP and port.

Without that how do you know that DP really worked and sent requests to a "non-base" of the model server?

k8s-ci-robot added the area/conformance-test Issues or PRs related to Conformance tests. label Oct 24, 2025

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Oct 24, 2025

k8s-ci-robot requested a review from ahg-g October 24, 2025 21:39

k8s-ci-robot requested a review from elevran October 24, 2025 21:39

k8s-ci-robot added approved Indicates a PR has been approved by an approver from all required OWNERS files. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Oct 24, 2025

k8s-ci-robot requested review from robscott, shmuelk and zetxqx October 24, 2025 21:40

Conformance: Adds Data Parallelism Test

78e6755

Signed-off-by: Daneyon Hansen <daneyon.hansen@solo.io>

danehans force-pushed the issue_1680 branch from 4b2a410 to 78e6755 Compare October 27, 2025 15:18

danehans mentioned this pull request Oct 27, 2025

Conformance: Support Pod Port in Data Parallelism Tests #1773

Open

shmuelk reviewed Oct 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Conformance: Adds Data Parallelism Test #1769

Conformance: Adds Data Parallelism Test #1769

danehans commented Oct 24, 2025

Uh oh!

k8s-ci-robot commented Oct 24, 2025

Uh oh!

k8s-ci-robot commented Oct 24, 2025

Uh oh!

netlify bot commented Oct 24, 2025 •

edited

Loading

Uh oh!

danehans commented Oct 24, 2025

Uh oh!

shmuelk Oct 28, 2025

Uh oh!

shmuelk Oct 28, 2025

Uh oh!

shmuelk Oct 28, 2025

Uh oh!

shmuelk Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conformance: Adds Data Parallelism Test #1769

Are you sure you want to change the base?

Conformance: Adds Data Parallelism Test #1769

Conversation

danehans commented Oct 24, 2025

Uh oh!

k8s-ci-robot commented Oct 24, 2025

Uh oh!

k8s-ci-robot commented Oct 24, 2025

Uh oh!

netlify bot commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for gateway-api-inference-extension ready!

Uh oh!

danehans commented Oct 24, 2025

Uh oh!

shmuelk Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

shmuelk Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

shmuelk Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

shmuelk Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

netlify bot commented Oct 24, 2025 •

edited

Loading