feat: Add pipeline run parallelism config #12442

sduvvuri1603 · 2025-11-12T21:20:56Z

Summary

Replace the previous semaphore/mutex knobs with a single pipeline_run_parallelism option on dsl.PipelineConfig. This lets the API server own the Argo semaphore lifecycles instead of expecting users to edit shared ConfigMaps—eliminating a Kubernetes-heavy workflow and ensuring keys align to <pipeline>/<version>.
Thread the new field through SDK, compiler, and backend so the requested concurrency cap lands in Argo’s spec.parallelism.
Add the pipeline_with_run_parallelism sample (three-item ParallelFor) to exercise the setting while leaving the workspace fixture focused on workspace behaviour.

Validation

SDK and backend goldens now include the updated sample, showing consistent IR and Argo outputs with the parallelism limit.
Built custom API server and driver images from this branch, loaded them into a kind cluster, ran the sample, and confirmed that the number of simultaneously running component pods never exceeded the configured limit.
Added the parallelism validation helper to the e2e suite (e2e_utils.go + invocation in pipeline_e2e_test.go), rebuilt the test cluster with the fresh backend images, exercised the focused pipeline_run_parallelism scenario, and then ran the end-to-end suite to confirm the new check passes with the concurrency cap enforced.

Follow up to PR - remove unused semaphore_key and mutex_name fields

google-oss-prow · 2025-11-12T21:21:02Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign chensun for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

google-oss-prow · 2025-11-12T21:21:06Z

Hi @sduvvuri1603. Thanks for your PR.

I'm waiting for a kubeflow member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

alyssacgoins · 2025-11-13T15:39:46Z

/retest

google-oss-prow · 2025-11-17T14:35:21Z

@sduvvuri1603: Cannot trigger testing until a trusted user reviews the PR and leaves an /ok-to-test message.

In response to this:

/retest

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

hbelmiro · 2025-11-17T14:37:52Z

/ok-to-test

hbelmiro · 2025-11-17T14:37:57Z

/retest

Signed-off-by: sduvvuri1603 <sduvvuri@redhat.com>

VaniHaripriya · 2025-11-19T23:50:48Z

README.md


 Consult the [Python SDK reference docs](https://kubeflow-pipelines.readthedocs.io/en/stable/) when writing pipelines using the Python SDK.

+> New in master: `dsl.PipelineConfig` now accepts an optional `pipeline_run_parallelism` integer to cap concurrent task execution for a run. The backend stores the requested limit in a shared ConfigMap and surfaces it to Argo Workflows via `spec.parallelism`.


It may be more appropriate to add this entry to the CHANGELOG.

Sure, but will this be a part of a new section called "Unreleased Features" ? because I only see version release details in the file.

I believe the PR will be included here as part of the release process. @mprahl , could you confirm if that’s correct?

proposals/11875-pipeline-workspace/README.md

Signed-off-by: Sruthi Duvvuri <sduvvuri@redhat.com>

Signed-off-by: sduvvuri1603 <sduvvuri@redhat.com>

nsingla · 2025-11-21T18:50:21Z

sdk/python/kfp/dsl/pipeline_config.py

+
+    @pipeline_run_parallelism.setter
+    def pipeline_run_parallelism(self, value: Optional[int]) -> None:  # pylint: disable=attribute-defined-outside-init
+        if value is None:


is this required given https://github.com/kubeflow/pipelines/pull/12442/files#diff-631f096829954a5226e87a37347ab04d3e551465fdf0371b37fd9f9363c0c156R120 will set it to None if value is None?

Yes, it's right about the serialization part. But we need this guard to prevent a crash during initialization. Since init passes None by default, removing this check would cause it to hit the isinstance line and fail immediately. This just ensures we can safely create the object with no value set.

I guess then you can just add 1 if statment:

if value: if not isinstance(value, int): raise ValueError( 'pipeline_run_parallelism must be an integer if specified.') if value <= 0: raise ValueError( 'pipeline_run_parallelism must be a positive integer.') self._pipeline_run_parallelism = value

Updated code with suggested change

nsingla · 2025-11-21T18:51:42Z

test_data/sdk_compiled_pipelines/valid/critical/pipeline_with_workspace.py

    name="pipeline-with-workspace",
    description="A pipeline that demonstrates workspace functionality",
    pipeline_config=dsl.PipelineConfig(
+        pipeline_run_parallelism=3,


can we actually set this to None here if we have an explicit test to test +ve values?

Yep, That makes sense. Will set this to none

nsingla · 2025-11-21T18:54:26Z

@sduvvuri1603 can you please add what this config is suppose to do, to the PR description? and a section about how you;ve validated the functionality.

nsingla · 2025-11-21T18:57:10Z

test_data/sdk_compiled_pipelines/valid/essential/pipeline_with_run_parallelism.py

+
+@dsl.pipeline(
+    name='pipeline-with-run-parallelism',
+    pipeline_config=dsl.PipelineConfig(pipeline_run_parallelism=7),


Isn't 7 too high when the number of tasks in this pipeline is just 1? May be you should add more components to it or add a parallelFor loop and iterate over > pipeline_run_parallelism constants, so that we can validate that the config actually works.
Also what validation logic did you add to confirm the number of tasks created for a pipeline with this config?

This specific test case is part of the SDK compilation suite to verify that the pipeline_run_parallelism field is correctly serialized from the Python SDK into the compiled YAML's PlatformSpec. It relies on the 'Golden File' comparison for validation here (ensuring the YAML contains pipelineRunParallelism: 7 correctly populated)

(It is not related to actual runtime limit covered by the backend integration tests where we submit these workflows to Argo is my understanding) Pls Lmk if this is correct!

Any pipeline yaml file in this directory will be part of the end to end tests, so yes, the workflow will get submitted to argo.

Updated code with ParallelFor loop and reduced pipeline_run_paralellism to '2'

Signed-off-by: sduvvuri1603 <sduvvuri@redhat.com>

nsingla

/lgtm

google-oss-prow bot added the do-not-merge/work-in-progress label Nov 12, 2025

google-oss-prow bot requested review from HumairAK, droctothorpe, mprahl and zazulam November 12, 2025 21:21

google-oss-prow bot added needs-ok-to-test size/L labels Nov 12, 2025

sduvvuri1603 force-pushed the feature/pipeline-run-parallelism branch 2 times, most recently from 99f2fc8 to d34a1b2 Compare November 12, 2025 21:22

sduvvuri1603 force-pushed the feature/pipeline-run-parallelism branch 7 times, most recently from 82756e1 to 60a35d8 Compare November 14, 2025 21:27

google-oss-prow bot added ok-to-test and removed needs-ok-to-test labels Nov 17, 2025

sduvvuri1603 marked this pull request as ready for review November 17, 2025 17:06

google-oss-prow bot removed the do-not-merge/work-in-progress label Nov 17, 2025

google-oss-prow bot requested review from DharmitD, alyssacgoins and gmfrasca November 17, 2025 17:06

sduvvuri1603 marked this pull request as draft November 17, 2025 17:06

google-oss-prow bot assigned nsingla Nov 19, 2025

sduvvuri1603 added 2 commits November 19, 2025 16:15

feat: add pipeline run parallelism config

73307bc

Signed-off-by: sduvvuri1603 <sduvvuri@redhat.com>

Add cluster RBAC for pipeline parallelism configmap

f7e180e

Signed-off-by: sduvvuri1603 <sduvvuri@redhat.com>

sduvvuri1603 force-pushed the feature/pipeline-run-parallelism branch from 60a35d8 to 39fb3dd Compare November 19, 2025 21:24

Allow configmap manager RBAC to follow custom namespace

c587b03

Signed-off-by: sduvvuri1603 <sduvvuri@redhat.com>

sduvvuri1603 force-pushed the feature/pipeline-run-parallelism branch from 39fb3dd to c587b03 Compare November 19, 2025 21:32

sduvvuri1603 added 2 commits November 19, 2025 16:43

Add essential pipeline for run parallelism config

41df1d3

Signed-off-by: sduvvuri1603 <sduvvuri@redhat.com>

Update compiler goldens for run parallelism

fe21d6d

Signed-off-by: sduvvuri1603 <sduvvuri@redhat.com>

google-oss-prow bot added size/XL and removed size/L labels Nov 19, 2025

sduvvuri1603 requested a review from nsingla November 19, 2025 22:32

VaniHaripriya reviewed Nov 20, 2025

View reviewed changes

sduvvuri1603 added 4 commits November 20, 2025 11:08

Merge branch 'master' into feature/pipeline-run-parallelism

b56c3bd

Signed-off-by: Sruthi Duvvuri <sduvvuri@redhat.com>

Docs: keep workspace proposal focused

28aed8e

Signed-off-by: sduvvuri1603 <sduvvuri@redhat.com>

Refresh compiler goldens

90b4b75

Signed-off-by: sduvvuri1603 <sduvvuri@redhat.com>

Merge branch 'master' into feature/pipeline-run-parallelism

9fb894f

nsingla reviewed Nov 21, 2025

View reviewed changes

sduvvuri1603 added 3 commits November 21, 2025 20:20

Reset workspace pipeline parallelism

42c13d3

Signed-off-by: sduvvuri1603 <sduvvuri@redhat.com>

Merge branch 'master' into feature/pipeline-run-parallelism

665c9ee

Refine pipeline_run_parallelism tests

8ccafa9

Signed-off-by: sduvvuri1603 <sduvvuri@redhat.com>

google-oss-prow bot added size/XXL and removed size/XL labels Nov 26, 2025

test: validate pipeline run parallelism e2e

dcbc311

Signed-off-by: sduvvuri1603 <sduvvuri@redhat.com>

sduvvuri1603 force-pushed the feature/pipeline-run-parallelism branch from 5edaa23 to dcbc311 Compare November 26, 2025 14:49

Merge branch 'master' into feature/pipeline-run-parallelism

df17e24

nsingla approved these changes Dec 2, 2025

View reviewed changes

google-oss-prow bot added the lgtm label Dec 2, 2025


		Consult the [Python SDK reference docs](https://kubeflow-pipelines.readthedocs.io/en/stable/) when writing pipelines using the Python SDK.

		> New in master: `dsl.PipelineConfig` now accepts an optional `pipeline_run_parallelism` integer to cap concurrent task execution for a run. The backend stores the requested limit in a shared ConfigMap and surfaces it to Argo Workflows via `spec.parallelism`.

feat: Add pipeline run parallelism config #12442

Are you sure you want to change the base?

feat: Add pipeline run parallelism config #12442

Conversation

sduvvuri1603 commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Validation

Uh oh!

google-oss-prow bot commented Nov 12, 2025

Uh oh!

google-oss-prow bot commented Nov 12, 2025

Uh oh!

alyssacgoins commented Nov 13, 2025

Uh oh!

google-oss-prow bot commented Nov 17, 2025

Uh oh!

hbelmiro commented Nov 17, 2025

Uh oh!

hbelmiro commented Nov 17, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nsingla commented Nov 21, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sduvvuri1603 Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nsingla left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

sduvvuri1603 commented Nov 12, 2025 •

edited

Loading

sduvvuri1603 Nov 21, 2025 •

edited

Loading