Jet Stateful Stage Documentation [CTT-811] #2037

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

shultseva wants to merge 10 commits into hazelcast:main from shultseva:stateful_force_eviction

Contributor

shultseva commented Dec 15, 2025 •

edited

Loading

https://github.com/hazelcast/hazelcast-mono/pull/5716


          stateful docs

e8d837a

shultseva requested a review from a team as a code owner

December 15, 2025 12:20

netlify bot commented Dec 15, 2025 •

edited

Loading

✅ Deploy Preview for hardcore-allen-f5257d ready!

Name	Link
🔨 Latest commit	`97fc680`
🔍 Latest deploy log	https://app.netlify.com/projects/hardcore-allen-f5257d/deploys/6945820b5bef2a0008e3ade3
😎 Deploy Preview	https://deploy-preview-2037--hardcore-allen-f5257d.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

Rob-Hazelcast self-assigned this

shultseva requested review from k-jamroz and rajbarua

December 15, 2025 14:14


          Merge branch 'main' into stateful_force_eviction

bd6c8c7

k-jamroz reviewed

View reviewed changes

docs/modules/pipelines/pages/transforms.adoc Outdated

Comment on lines 768 to 775

    
              The major use case of stateful mapping is recognizing a pattern in the

              event stream, such as matching start-transaction with end-transaction

              events based on an event correlation ID. More generally, you can

              implement any kind of state machine and detect patterns in an input of

              any complexity.

              As with other stateful operations, you can also use a `groupingKey` to

              have a unique state per key.

Contributor

k-jamroz Dec 17, 2025 •

edited

Loading

these paragraphs ("The major use case...", "As with other stateful operations...") should be first - they are high-level description

Contributor Author

shultseva Dec 18, 2025

I disagree. I think it would be better to first provide a general description of the parameter, explain the difference between keyed and non-keyed cases, and then describe the general pattern and how it can be solved using this method. Let’s leave it to @Rob-Hazelcast to decide.

Contributor

Rob-Hazelcast Dec 19, 2025

Either works. I think the current structure is good because the high level use case description leads into the example code. The alternative would work better if it was broken up into child sections - intro, detailed description, example.

Contributor

k-jamroz Dec 19, 2025

I prefer top-to-bottom approach in documentation (first what it is for and later how it works), but that is not a hill I am willing to die on

k-jamroz reviewed

View reviewed changes

docs/modules/pipelines/pages/transforms.adoc Outdated Show resolved Hide resolved

k-jamroz reviewed

View reviewed changes

docs/modules/pipelines/pages/transforms.adoc

    
              The state object is created using `createFn` and is passed to `mapFn` together with each input item.

              The function can update the state and emit a single output item (or null if no output is needed).

              For each grouping key, Jet maintains a dedicated state object created by the supplied `createFn`.

Contributor

k-jamroz Dec 17, 2025 •

edited

Loading

nit: I'd consider merging some of the paragraphs - currently there are a lot of short paragraphs, which makes it harder to scan the text

Contributor

k-jamroz Dec 17, 2025

or possibly group keyed and non-keyed descrtiption in bulletpoints or subsections

Contributor

Rob-Hazelcast Dec 19, 2025

I think the short paragraphs are fine. Bullets are a good idea and it might be nice to resolve the repetition ("the function can update the state and..."), but I'm not too worried about it.

NB: Sentences on adjacent lines are combined into paragraphs in the output, so they're not as short as they appear here - not sure if that was your concern or if you think they're too short in the output too.

Contributor

k-jamroz Dec 19, 2025

"The non-keyed mapStateful" paragraph applies to non-keyed variant, the next 5 paragraphs apply ONLY to keyed variant (TTL, eviction, forced eviction) but this is not visible in the text and a bit confusing and harder to understand

k-jamroz reviewed

View reviewed changes

docs/modules/pipelines/pages/transforms.adoc Outdated Show resolved Hide resolved

k-jamroz reviewed

View reviewed changes

docs/modules/pipelines/pages/transforms.adoc Outdated Show resolved Hide resolved

k-jamroz reviewed

View reviewed changes

docs/modules/pipelines/pages/transforms.adoc Outdated Show resolved Hide resolved

k-jamroz reviewed

View reviewed changes

docs/modules/pipelines/pages/transforms.adoc Outdated Show resolved Hide resolved

k-jamroz reviewed

View reviewed changes

docs/modules/pipelines/pages/transforms.adoc Outdated Show resolved Hide resolved

Rob-Hazelcast and others added 6 commits

December 17, 2025 16:53


          Merge branch 'main' into stateful_force_eviction

8a37035


          Update docs/modules/pipelines/pages/transforms.adoc

1b97a2f

Co-authored-by: Krzysztof Jamróz <79092062+k-jamroz@users.noreply.github.com>


          Update docs/modules/pipelines/pages/transforms.adoc

20b7a2f

Co-authored-by: Krzysztof Jamróz <79092062+k-jamroz@users.noreply.github.com>


          Update docs/modules/pipelines/pages/transforms.adoc

cf3aec5

Co-authored-by: Krzysztof Jamróz <79092062+k-jamroz@users.noreply.github.com>


          Update docs/modules/pipelines/pages/transforms.adoc

a831a71

Co-authored-by: Krzysztof Jamróz <79092062+k-jamroz@users.noreply.github.com>


          stateful docs

c91b126

shultseva requested a review from k-jamroz

December 18, 2025 14:36

Rob-Hazelcast added 2 commits

December 19, 2025 16:19


          Merge branch 'main' into stateful_force_eviction

f704016


          copy edit

97fc680

Rob-Hazelcast approved these changes

View reviewed changes

Contributor

Rob-Hazelcast left a comment •

edited

Loading

Looks great, thanks! Good level of explanation / examples, very clear.

docs/modules/pipelines/pages/transforms.adoc

    
              The state object is created using `createFn` and is passed to `mapFn` together with each input item.

              The function can update the state and emit a single output item (or null if no output is needed).

              For each grouping key, Jet maintains a dedicated state object created by the supplied `createFn`.

Contributor

Rob-Hazelcast Dec 19, 2025

I think the short paragraphs are fine. Bullets are a good idea and it might be nice to resolve the repetition ("the function can update the state and..."), but I'm not too worried about it.

NB: Sentences on adjacent lines are combined into paragraphs in the output, so they're not as short as they appear here - not sure if that was your concern or if you think they're too short in the output too.

docs/modules/pipelines/pages/transforms.adoc Outdated

Comment on lines 768 to 775

    
              The major use case of stateful mapping is recognizing a pattern in the

              event stream, such as matching start-transaction with end-transaction

              events based on an event correlation ID. More generally, you can

              implement any kind of state machine and detect patterns in an input of

              any complexity.

              As with other stateful operations, you can also use a `groupingKey` to

              have a unique state per key.

Contributor

Rob-Hazelcast Dec 19, 2025

Either works. I think the current structure is good because the high level use case description leads into the example code. The alternative would work better if it was broken up into child sections - intro, detailed description, example.

k-jamroz reviewed

View reviewed changes

docs/modules/pipelines/pages/transforms.adoc

    
              The state object is created using `createFn` and is passed to `mapFn` together with each input item.

              The `mapFn` function can update the state and emit a single output item or return null if no output is needed.

              For each grouping key, Jet maintains a dedicated state object created by the supplied `createFn`.

Contributor

k-jamroz Dec 19, 2025

Suggested change

      
            For each grouping key, Jet maintains a dedicated state object created by the supplied `createFn`.
          
            In the keyed `mapStateful` Jet maintains a dedicated state object created by the supplied `createFn` for each grouping key.

so it differentiates from previous paragraph

k-jamroz reviewed

View reviewed changes

docs/modules/pipelines/pages/transforms.adoc

    
              implement any kind of state machine and detect patterns in an input of

              any complexity.

              As with other stateful operations, you can also use a `groupingKey` to

Contributor

k-jamroz Dec 19, 2025

"As with other stateful operations, you can also use a groupingKey to" - is this part of the use case or rather a general description of the operator? We are introducing keyed variant again, which we did technically and not very explicitly ~7 paragraphs before. In the original version it was very close to the beginning so was much more natural, now it is after a long technical description.

k-jamroz reviewed

View reviewed changes

docs/modules/pipelines/pages/transforms.adoc

Comment on lines +869 to +870

    
                      () -> new TransactionEvent[2],

                 (state, id, event) -> {

Contributor

k-jamroz Dec 19, 2025

Suggested change

      
                    () -> new TransactionEvent[2],
          
               (state, id, event) -> {
          
               () -> new TransactionEvent[2],
          
               (state, id, event) -> {

k-jamroz reviewed

View reviewed changes

docs/modules/pipelines/pages/transforms.adoc

Comment on lines +885 to +886

    
                      // End of transaction received; this state is no longer needed

                      return state[1] != null;

Contributor

k-jamroz Dec 19, 2025 •

edited

Loading

we cannot evict after receiving only TRANSACTION_END because the events can come out of order:

Suggested change

      
                    // End of transaction received; this state is no longer needed
          
                    return state[1] != null;
          
                    // we have both start and end events
          
                    return state[0] != null && state[1] != null;

Contributor

k-jamroz Dec 19, 2025

BTW, this seems to be a general pattern: deleteStatePredicate will often have the same condition as condition for generating final output from mapFn

k-jamroz reviewed

View reviewed changes

docs/modules/pipelines/pages/transforms.adoc

    
                              },

                              // delete state immediately after transaction end

                              (state, txId, event) ->

                                      event.getType() == EventType.TRANSACTION_END,

Contributor

k-jamroz Dec 19, 2025

this example is less than ideal if events are allowed out of order and some may come after TRANSACTION_END

k-jamroz approved these changes

View reviewed changes

Contributor

k-jamroz left a comment

Approving in advance, the content as a whole is good. Need to fix example. As to the form - I expressed my preferences, but we can also keep it as is.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet