[Rllib] Modify `StatelessCartPole` APPO example to use new APPO attributes #59894

simonsays1980 · 2026-01-06T10:45:12Z

Description

The StatelessCartPole example form APPO is timing out. This could be due to the latest changes in the APPO data pipeline. This PR modifies the setup of the example by using the new APPO attributes.

Related issues

Fixes https://buildkite.com/ray-project/postmerge/builds/15188#019b8f6e-2850-465e-a98c-63c29fbf98f7/L4702

Additional information

Optional: Add implementation details, API changes, usage examples, screenshots, etc.

Signed-off-by: simonsays1980 <simon.zehnder@gmail.com>

gemini-code-assist

Code Review

This pull request updates the StatelessCartPole APPO example to address a timeout issue, likely by using new APPO configuration attributes. The changes involve enabling MeanStdFilter for observation normalization and adjusting training parameters by setting use_circular_buffer to False and increasing broadcast_interval. My main feedback is regarding a leftover TODO comment, which could cause confusion about the stability of MeanStdFilter.

gemini-code-assist · 2026-01-06T10:46:42Z

rllib/examples/algorithms/appo/stateless_cartpole_appo.py

+    .env_runners(
+        env_to_module_connector=lambda env, spaces, device: MeanStdFilter(),
+    )


You've enabled MeanStdFilter here, but the TODO comment on lines 25-26 mentions that it might cause NaNs during training. If this issue has been resolved, it would be great to remove the TODO comment to avoid confusion for future readers. If the issue is still present, perhaps a note explaining why it's being enabled despite the potential problem would be helpful.

Signed-off-by: simonsays1980 <simon.zehnder@gmail.com>

pseudo-rnd-thoughts

LSTM

Modified APPO example to use new attributes in config.

c3ab6c6

Signed-off-by: simonsays1980 <simon.zehnder@gmail.com>

simonsays1980 requested a review from a team as a code owner January 6, 2026 10:45

gemini-code-assist bot reviewed Jan 6, 2026

View reviewed changes

ray-gardener bot added the rllib RLlib related issues label Jan 6, 2026

Removed redundant comment.

b7b8145

Signed-off-by: simonsays1980 <simon.zehnder@gmail.com>

simonsays1980 requested a review from pseudo-rnd-thoughts January 6, 2026 16:31

simonsays1980 added rllib-algorithms An RLlib algorithm/Trainer is not learning. rllib-system system issues, runtime env, oom, etc go add ONLY when ready to merge, run all tests labels Jan 6, 2026

simonsays1980 self-assigned this Jan 6, 2026

pseudo-rnd-thoughts approved these changes Jan 6, 2026

View reviewed changes

simonsays1980 enabled auto-merge (squash) January 6, 2026 16:51

simonsays1980 merged commit b22f6a6 into ray-project:master Jan 6, 2026
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Rllib] Modify `StatelessCartPole` APPO example to use new APPO attributes #59894

[Rllib] Modify `StatelessCartPole` APPO example to use new APPO attributes #59894

simonsays1980 commented Jan 6, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jan 6, 2026

Uh oh!

pseudo-rnd-thoughts left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Rllib] Modify StatelessCartPole APPO example to use new APPO attributes #59894

[Rllib] Modify StatelessCartPole APPO example to use new APPO attributes #59894

Conversation

simonsays1980 commented Jan 6, 2026

Description

Related issues

Additional information

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

pseudo-rnd-thoughts left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Rllib] Modify `StatelessCartPole` APPO example to use new APPO attributes #59894

[Rllib] Modify `StatelessCartPole` APPO example to use new APPO attributes #59894