Skip to content

Conversation

@warrenzhu25
Copy link
Contributor

@warrenzhu25 warrenzhu25 commented Jan 6, 2026

What changes were proposed in this pull request?

This PR adds a new parentIds field to the StageData model and its corresponding protobuf representation.

Specific changes include:

  • Updated store_types.proto to include repeated int64 parent_ids.
  • Added parentIds to v1.StageData in api.scala as a defaulted parameter at the end of the constructor to maintain source compatibility.
  • Updated LiveStage, AppStatusStore, and StageDataWrapperSerializer to handle parentIds.
  • Added tests for parentIds in KVStoreProtobufSerializerSuite.

Why are the changes needed?

The parentIds field is necessary for tracking stage dependencies in the Spark UI and other status-related tools.

Does this PR introduce any user-facing change?

No, this change updates internal models and the REST API but does not alter the primary user-facing UI or behavior.

How was this patch tested?

The changes were tested by running the KVStoreProtobufSerializerSuite:

build/sbt "core/testOnly org.apache.spark.status.protobuf.KVStoreProtobufSerializerSuite"

All 20 tests passed.

Was this patch authored or co-authored using generative AI tooling?

Yes, assisted by Gemini.

@github-actions
Copy link

github-actions bot commented Jan 6, 2026

JIRA Issue Information

=== Improvement SPARK-54921 ===
Summary: Add parentIds in StageData
Assignee: None
Status: Open
Affected: ["4.1.0"]


This comment was automatically generated by GitHub Actions

@warrenzhu25 warrenzhu25 changed the title [SPARK-XXXXX][CORE] Add parentIds to StageData and fix compilation in JobPage [SPARK-54921][CORE] Add parentIds to StageData Jan 6, 2026
### What changes were proposed in this pull request?

This PR adds a new `parentIds` field to the `StageData` model and its corresponding protobuf representation.

Specific changes include:
- Updated `store_types.proto` to include `repeated int64 parent_ids`.
- Added `parentIds` to `v1.StageData` in `api.scala` as a defaulted parameter at the end of the constructor to maintain source compatibility.
- Updated `LiveStage`, `AppStatusStore`, and `StageDataWrapperSerializer` to handle `parentIds`.
- Added tests for `parentIds` in `KVStoreProtobufSerializerSuite`.

### Why are the changes needed?

The `parentIds` field is necessary for tracking stage dependencies in the Spark UI and other status-related tools.

### Does this PR introduce _any_ user-facing change?

No, this change updates internal models and the REST API but does not alter the primary user-facing UI or behavior.

### How was this patch tested?

The changes were tested by running the `KVStoreProtobufSerializerSuite`:
```
build/sbt "core/testOnly org.apache.spark.status.protobuf.KVStoreProtobufSerializerSuite"
```
All 20 tests passed.

### Was this patch authored or co-authored using generative AI tooling?

Yes, assisted by Gemini.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant