WorldFlux

Unified Interface for World Models in Reinforcement Learning

One API. Multiple Architectures. Clear Contracts.

Alpha (v0.1.1) — Under active development. API may change between minor versions.

WorldFlux provides a unified Python interface for world models used in reinforcement learning.

Why WorldFlux?

World models let RL agents imagine before acting by predicting future states, rewards, and outcomes without touching the real environment. Upstream literature reports strong sample-efficiency gains for world-model methods in many settings (Hafner et al., 2023; Hansen et al., 2024).

The problem: every research team reimplements the same core components from scratch. DreamerV3, TD-MPC2, JEPA — different codebases, different APIs, incompatible training loops. Want to swap an encoder while keeping DreamerV3's dynamics? Rewrite everything.

WorldFlux solves this with a unified interface:

# One API for any world model architecture
model = create_world_model("dreamerv3:size12m")
state = model.encode(obs)
trajectory = model.rollout(state, actions)  # imagine 15 steps ahead

Swap components independently with the 5-layer pluggable architecture
Reference-family implementations with an evidence-backed MVP focused on DreamerV3 and TD-MPC2 local training; proof/public-evidence work remains an advanced workflow and public proof claims require published evidence bundles
Training infrastructure with replay buffers, checkpointing, and callbacks
One API — encode(), transition(), decode(), rollout() — works across all model families

Features

Unified API: Common interface across model families
API Stability Tiers: Public surfaces can be classified as stable or experimental via a generated manifest
v3-first API: create_world_model() defaults to api_version="v3" (strict contracts enabled)
Universal Payload Layer: ActionPayload / ConditionPayload for polymorphic conditioning
Planner Contract: planners return ActionPayload with extras["wf.planner.horizon"]
Simple Usage: One-liner model creation with create_world_model()
Pluggable 5-layer core: optional component_overrides for encoder/dynamics/conditioner/decoder/rollout
Training Infrastructure: Complete training loop with callbacks, checkpointing, and logging
Type Safe: Full type annotations and mypy compatibility
Reference Tiers: DreamerV3 profiles now distinguish compatibility, reference, and proof roles for docs/tooling alignment

Installation

Install uv first if you do not have it yet: uv installation guide.

Global CLI Install (cargo new style)

uv tool install worldflux
worldflux init my-world-model

Optional: enable the InquirerPy-powered prompt UI.

uv tool install --with inquirerpy worldflux

worldflux init now performs cross-platform pre-init dependency assurance. It provisions a user-scoped bootstrap virtual environment and installs the selected environment dependencies before scaffolding:

Linux/macOS default: ~/.worldflux/bootstrap/py<major><minor>
Windows default: %LOCALAPPDATA%/WorldFlux/bootstrap/py<major><minor>

Environment variables:

WORLDFLUX_BOOTSTRAP_HOME: override bootstrap root path
WORLDFLUX_INIT_ENSURE_DEPS=0: disable auto-bootstrap (emergency bypass)

From Source (recommended)

git clone https://github.com/worldflux/WorldFlux.git
cd worldflux
uv sync
source .venv/bin/activate
worldflux init my-world-model

# With training dependencies
uv sync --extra training

# With all optional dependencies
uv sync --extra all

# For development
uv sync --extra dev

From PyPI

uv pip install worldflux
worldflux init my-world-model

Verify Environment

worldflux doctor

Tiered Quick Verification

quick_verify supports lightweight verification tiers for checkpoint-centric workflows:

synthetic: default compatibility path
offline: baseline-backed quick verification without parity tooling
real_env_smoke: reserved short-horizon smoke tier for real-environment checks

Build Docs Locally

cd website
npm ci
npm run build

# Optional: local docs dev server
npm start

Quick Start

CPU-First Success Path (Official)

uv sync --extra dev
uv run python examples/quickstart_cpu_success.py --quick

This official smoke path uses a random replay buffer and a CI-sized model to validate installation and core contracts on CPU. It is not a benchmark or a real-environment reproduction path.

Unified Comparison Demo (Official)

uv sync --extra dev --extra training
uv run python examples/compare_unified_training.py --quick

This repository-level public demo shows DreamerV3 and TD-MPC2 running through the same unified API, the same training contract, and the same quick verification flow. It emits a shared summary.json, per-family imagination artifacts, and per-family quick_verify.json outputs.

Treat this as a contract demonstration only. It is not a benchmark, paper reproduction, or public proof claim. In this demo, quick verify separates workflow completion from statistical quality. A per-family quick verify warning means the artifacts are structurally valid but the run is not yet a quality gate pass.

Supported Surface

WorldFlux now treats the public default path as an evidence-first surface:

supported: DreamerV3 / TD-MPC2 local native training and evaluation
advanced: canonical proof-oriented presets such as dreamerv3:official_xl and tdmpc2:proof_5m
experimental / internal: non-default families kept for research or plugin work

By default, worldflux models list shows only the supported surface. Use --surface public when you intentionally want advanced proof-oriented presets, and --surface all when you intentionally want experimental or skeleton families.

For the first scaffolded end-to-end walkthrough after that smoke test, use Train Your First Model. The two official newcomer lanes are:

contract smoke
meaningful local training

The supported newcomer path starts with the contract-smoke lane:

worldflux init
worldflux train
worldflux verify --target ./outputs --mode quick

This lane is the current evidence-backed MVP surface for installation and contract validation. Treat it as a local compatibility workflow, not as a benchmark or public proof claim.

In this lane, worldflux verify --mode quick may return a workflow warning instead of a hard failure when the synthetic threshold is missed. That warning means the command executed and the generated artifacts are interpretable, but you have not yet cleared a stronger quality gate.

The meaningful-local-training lane starts after that smoke passes:

for the guaranteed DreamerV3 lane, install Atari extras with uv sync --extra training --extra atari
set data.source = "gym" in worldflux.toml
use ALE/Breakout-v5
rerun worldflux train
inspect outputs/run_manifest.json and confirm run_classification is meaningful_local_training with no degraded_modes

The newcomer wheel-install smoke is exercised in CI on Linux and macOS; Windows bootstrap support exists in implementation but is not yet part of that E2E guarantee.

In scaffolded projects, worldflux train reuses the generated onboarding helpers such as dataset.py, local_dashboard.py, and dashboard/index.html.

Create a Model

from worldflux import create_world_model

model = create_world_model("dreamerv3:size12m")

Universal Payload Usage (v3)

from worldflux import ActionPayload, ConditionPayload

state = model.encode(obs)
next_state = model.transition(
    state,
    ActionPayload(kind="continuous", tensor=action),
    conditions=ConditionPayload(goal=goal_tensor),
)

Component Overrides (5-layer core)

from worldflux import create_world_model

model = create_world_model(
    "tdmpc2:ci",
    obs_shape=(4,),
    action_dim=2,
    component_overrides={
        # values can be registered component ids, classes, or instances
        "action_conditioner": "my_plugin.zero_action_conditioner",
    },
)

External packages can register plugins through entry-point groups:

worldflux.models
worldflux.components

Imagination Rollout

import torch

obs = torch.randn(1, 3, 64, 64)
state = model.encode(obs)

actions = torch.randn(15, 1, 6)  # [horizon, batch, action_dim]
trajectory = model.rollout(state, actions)

print(f"Predicted rewards: {trajectory.rewards.shape}")
print(f"Continue probs: {trajectory.continues.shape}")

Train a Model

from worldflux import create_world_model
from worldflux.training import train, ReplayBuffer

model = create_world_model("dreamerv3:size12m", obs_shape=(3, 64, 64), action_dim=6)
buffer = ReplayBuffer.load("trajectories.npz")
trained_model = train(model, buffer, total_steps=50_000)
trained_model.save_pretrained("./my_model")

Available Models

Family	Presets	Status
DreamerV3	`size12m`, `size25m`, `size50m`, `size100m`, `size200m`	Reference-family
TD-MPC2	`5m`, `19m`, `48m`, `317m`	Reference-family

Reference-family models map to maintained upstream families and internal proof-mode parity workflows. Public proof claims require published evidence bundles; local fixtures and internal runs are not enough on their own. Experimental and skeleton families remain available behind explicit surface opt-in and are not part of the default MVP promise.

Reference-family Dreamer profiles additionally expose alignment metadata for docs/tooling:

dreamer:ci -> compatibility
dreamerv3:size12m through dreamerv3:size200m -> reference
dreamerv3:official_xl -> proof

Reference-family TD-MPC2 profiles expose the same tier vocabulary:

tdmpc2:ci -> compatibility
tdmpc2:5m, tdmpc2:19m, tdmpc2:48m, tdmpc2:317m -> reference
tdmpc2:proof_5m -> proof
tdmpc2:5m_legacy -> compatibility

This table lists the supported MVP presets. For the public default catalog, run:

worldflux models list --verbose

To inspect advanced proof-oriented presets explicitly:

worldflux models list --surface public --format json

To inspect experimental families explicitly:

worldflux models list --surface all --maturity experimental --format json
worldflux models list --surface all --maturity skeleton --format json

API Reference

Core Methods

All world models implement the WorldModel base class:

state = model.encode(obs)
next_state = model.transition(state, action)
next_state = model.update(state, action, obs)
output = model.decode(state)
preds = output.preds  # e.g. {"obs", "reward", "continue"}
trajectory = model.rollout(initial_state, actions)
loss_out = model.loss(batch)  # LossOutput (loss_out.loss, loss_out.components)

Training API

from worldflux.training import (
    Trainer,
    TrainingConfig,
    ReplayBuffer,
    train,
)

from worldflux.training.callbacks import (
    LoggingCallback,
    CheckpointCallback,
    EarlyStoppingCallback,
    ProgressCallback,
)

Examples

See the examples/ directory:

quickstart_cpu_success.py - Official CPU-first smoke path using a random replay buffer
compare_unified_training.py - Official unified comparison demo with the same quick verification flow for DreamerV3 and TD-MPC2
benchmarks/evidence_dreamerv3_breakout.py - Evidence-oriented DreamerV3 Breakout bundle with returns, checkpoints, manifests, and report artifacts
collect_mujoco.py - MuJoCo dataset collection with dataset manifest support and policy-checkpoint collector path
benchmarks/evidence_tdmpc2_halfcheetah.py - Evidence-oriented TD-MPC2 benchmark that emits curves, returns, checkpoints, and report artifacts
worldflux_quickstart.ipynb - Interactive Colab notebook
train_dreamer.py - Family-specific manual Dreamer training example
train_tdmpc2.py - Family-specific manual TD-MPC2 training example
visualize_imagination.py - Imagination rollout visualization

uv run python examples/quickstart_cpu_success.py --quick
uv run python examples/compare_unified_training.py --quick
uv run python examples/train_dreamer.py --test
uv run python examples/train_dreamer.py --data trajectories.npz --steps 100000

Documentation

Full Documentation - Guides and API reference
API Reference - Contract and symbol-level docs
Reference - Operational and quality docs
Release Checklist - Canonical local release validation gates
Release Runbook - Operator flow for publishing a release

Roadmap

See docs/roadmap.md for the current technical priority list.

Community

Join our Discord to discuss world models, get help, and connect with other researchers and developers.

Support channels and response paths: SUPPORT.md
Community expectations and reporting: CODE_OF_CONDUCT.md

Security

See SECURITY.md for security considerations, especially regarding loading model checkpoints from untrusted sources.

License

Apache License 2.0 - see LICENSE and NOTICE for details.

Contributing

Contributions are welcome. Please read our Contributing Guide before submitting pull requests.

Citation

If you use this library in your research, please cite:

@software{worldflux,
  title = {WorldFlux: Unified Interface for World Models},
  year = {2026},
  url = {https://github.com/worldflux/WorldFlux}
}

Name		Name	Last commit message	Last commit date
Latest commit History 460 Commits
.devcontainer		.devcontainer
.github		.github
assets		assets
benchmarks		benchmarks
docker		docker
docs		docs
examples		examples
reports/parity		reports/parity
scripts		scripts
spaces		spaces
src/worldflux		src/worldflux
tests		tests
third_party/dreamerv3_official		third_party/dreamerv3_official
website		website
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
GOVERNANCE.md		GOVERNANCE.md
LICENSE		LICENSE
MAINTAINERS.md		MAINTAINERS.md
Makefile		Makefile
NOTICE		NOTICE
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

WorldFlux

Why WorldFlux?

Features

Installation

Global CLI Install (cargo new style)

From Source (recommended)

From PyPI

Verify Environment

Tiered Quick Verification

Build Docs Locally

Quick Start

CPU-First Success Path (Official)

Unified Comparison Demo (Official)

Supported Surface

Create a Model

Universal Payload Usage (v3)

Component Overrides (5-layer core)

Imagination Rollout

Train a Model

Available Models

API Reference

Core Methods

Training API

Examples

Documentation

Roadmap

Community

Security

License

Contributing

Citation

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages