Decision Intelligence Runtime (DIR)

An architectural framework for building reliable, accountable, and stateful AI decision systems.

Project Goal

Current "agent frameworks" treat Large Language Models (LLMs) as autonomous executors, resulting in non-deterministic behaviors, hallucinations in critical control paths, and insufficient accountability mechanisms.

Decision Intelligence Runtime addresses this by enforcing a strict separation between Reasoning (LLM-based, semantic, probabilistic) and Execution (code-based, deterministic, verifiable). This separation is implemented through a Kernel Space / User Space architectural boundary, inspired by operating system design principles.

Unlike purely theoretical frameworks, DIR provides a concrete, executable implementation of safe delegation patterns.

Origins

DIR emerged from production constraints in the AIvestor automated trading system, where the cost of reasoning failures is measured in capital loss. The patterns documented here are not theoretical-they represent battle-tested solutions to "Day Two" failure modes: state drift, non-idempotent operations, TOCTOU vulnerabilities, and the collapse of accountability in multi-agent systems.

This repository contains the architectural concepts, formal specifications, and reference implementations of the DIR framework.

Architectural Convergence & Validation

The publication of Intelligent AI Delegation by Google DeepMind (February 2026, arXiv:2602.11865) confirms the architectural direction we have been developing since early 2025. Their formalization of "Responsibility Transfer," "Auditability," and "Permission Handling" as fundamental requirements for agentic systems aligns with the patterns we have been validating in production environments.

The independent convergence is notable:

Google's "Responsibility Transfer" ≈ ROA Responsibility Contracts
Google's "Auditability" ≈ DecisionFlow ID (DFID)
Google's "Permission Handling" ≈ Decision Integrity Module (DIM)

This alignment reinforces that these patterns are not vendor-specific abstractions-they are architectural necessities for production-grade agentic systems.

Industry Validation: Google DeepMind vs. DIR Implementation

While Google's Intelligent AI Delegation (2026) outlines the theoretical requirements for safe agentic systems, DIR provides the open-source architectural implementation available today.

Requirement (Google DeepMind)	DIR Implementation (This Repo)	Status
Verifiable Task Completion	Topology C (DL+PCI) Cryptographic Proof-Carrying Intents	✅ Implemented
Transfer of Authority	Responsibility Contracts Machine-readable scope definitions	✅ Implemented
Permission Handling	Kernel Space (DIM) Deterministic policy enforcement	✅ Implemented
Structural Transparency	DecisionFlow ID (DFID) End-to-end distributed tracing	✅ Implemented

Read the full framework comparison: Google DeepMind vs. DIR/ROA

Quick Start

The fastest way to see the DIR architecture in action:

pip install -e .
python samples/00_quick_start/run.py

This sample demonstrates protection against catastrophic actions (e.g. parsing error turning 15.5 ETH into 15,500 ETH) and prompt injection in external data. High-level overview of the full architecture. See samples/00_quick_start.

Start Here

If you are new to DIR/ROA, begin with the introduction article. It explains the core motivation ("Day Two" failures in production systems), the Kernel Space vs. User Space architectural boundary, and why traditional agentic loops are insufficient for high-stakes environments.

Read: Beyond Prompt Engineering - Building a Deterministic Runtime for Responsible AI Agents

Why use DIR today?

Big Tech providers are beginning to theorize about "Agent Runtimes" as proprietary cloud services. DIR is the first open-source implementation of the Zero Trust Agents philosophy - agents propose, the runtime verifies; no implicit trust in LLM output.

No Vendor Lock-in: Run your agents on AWS, Azure, GCP, or on-premise. The runtime logic is yours.
Ready for Production: Solves "Day Two" problems (loops, drifts, audit) that simple orchestration libraries ignore.
Works with Existing Frameworks: DIR is not a competitor to LangChain, AutoGen, or LangGraph - it's the execution shell. Wrap task-oriented agents in ROA contracts, inject mission boundaries, and enforce deterministic validation. (See: 34_langchain_roa_wrapper, 35_crewai_roa_wrapper)
Language Agnostic Architecture: While the reference implementation is Python, the patterns (Kernel/User separation, DFID) are universal.
Context as Code: Documentation is the new compiler - Markdown files act as system prompts for AI agents. (Context as Code)

Core Concepts

1. Responsibility-Oriented Agents (ROA)

Current Status: Published

ROA is the architectural pattern for agents themselves. Instead of open-ended control loops, ROA constrains agents through explicit contracts:

Responsibility Contracts: Formal scope boundaries and authority limits.
Missions: Explicit optimization objectives defining agent purpose.
Stateful Existence: Long-lived memory and persistent identity.
Decision Lifecycle: Explain → Policy → Proposal (strict separation of reasoning from execution).

Read the ROA Manifesto

2. The Runtime Architecture

Current Status: Published

The execution environment that enforces deterministic guarantees:

Decision Integrity Module (DIM): Kernel-space validation layer (schema enforcement, RBAC, state consistency checks).
Context Compilation: Immutable state snapshots preventing TOCTOU vulnerabilities.
DecisionFlow ID (DFID): Distributed tracing for reasoning chain reconstruction.
Safety Invariants: Idempotency guarantees, TTL enforcement, and escalation protocols.

Read the DIR Architectural Pattern

3. Decision Intelligence Topologies

Current Status: Work in Progress

A pluralistic approach to agent orchestration. No single execution model satisfies all operational requirements. DIR defines three distinct topologies optimized for different constraint profiles:

Topology A (EOAM): Event-Oriented Agent Mesh for decentralized strategy coordination.
Topology B (SDS): Structural Decision Streams for high-velocity, grammar-constrained execution.
Topology C (DL+PCI): Decision Ledger with Proof-Carrying Intents for cryptographically verifiable audit trails.

Read Decision Intelligence Topologies

4. Post-Execution Governance & Drift

Current Status: Work in Progress

An architectural defense against "Day Three" problems, where an agent's individual decisions are technically valid (Kernel Compliance), but its aggregate behavior over time erodes business intent (Business Health).

Agent Drift Taxonomy: Formal classification of Optimization (Reward Hacking), Semantic, and Environmental drift.
Rolling Window Monitors: Asynchronous analysis of execution logs and context snapshots linked via DFID.
Circuit Breaking: Automated mechanism to transition an agent's Registry status to SUSPENDED upon detecting statistically significant drift.

Read Governance & Agent Drift

Getting Started

Prerequisites

Python 3.12+
SQLite3 (optional - only for samples that use persistent storage)
Ollama (optional - for local LLM inference in samples such as 31_finance_trading; use USE_MOCK_LLM=1 to run without it)

Installation

Clone the repository.
Install the package in editable mode:

pip install -e .

This installs the dir package (source in src/dir), making it available to all sample implementations.

Repository Structure

decision-intelligence-runtime/
├── README.md
├── FAQ.md                    # Frequently asked questions (architecture, adoption, compliance)
├── pyproject.toml            # pip install -e . installs dir + utils
├── requirements.txt          # Shared dependencies
├── src/
│   ├── dir/                  # Core DIR/ROA components (per docs spec)
│   │   # DFID, EventBus, DIM, Context Store, models, arbitration, PCI, etc.
│   └── utils/                # Supporting utilities for samples
│       # QuoteGenerator, NewsGenerator, market_events, logging_utils
├── samples/                  # Reference implementations (01–11 mechanics, 31+ use cases)
│   ├── README.md             # Sample catalog and run instructions
│   ├── 00_quick_start/ … 11_topology_c_dl_pci/
│   ├── 31_finance_trading/ … 35_crewai_roa_wrapper/ … 36_drift_optimization_discount/ … 37_drift_semantic_refund/ … 38_drift_environmental_bidding/
│   └── 88_meta_context_engineering/   # Meta-sample: System Prompt Toolkit
├── docs/                     # Architectural documentation
│   ├── 00-introduction/      # DIR intro, framework mapping
│   ├── 01-roa-manifesto/
│   ├── 02-decision-runtime/
│   ├── 03-topologies/
│   ├── 07-dir-minified/      # Machine-optimized single-file spec (DIR-minified.md)
│   └── 08-conclusion/        # Context as Code concept
└── assets/                   # Images, diagrams

Samples & Reference Implementations

Execute any sample from the repository root: python samples/<folder>/run.py

Full list and details: samples/README.md

Mechanics & Topologies (00–11)

#	Sample	Focus	Description
00	`00_quick_start`	Quick Start	High-level overview: comma catastrophe, prompt injection
01	`01_roa_agent`	ROA Manifesto	Contract, Explain → Policy → Proposal
02	`02_dfid_propagation`	DIR Pattern	DecisionFlow ID: generation, propagation, logging
03	`03_idempotency_guard`	DIR Pattern	Idempotency: preventing duplicate side effects
04	`04_context_store`	DIR Pattern	4 Layers of Context: Session, State, Memory, Artifacts
05	`05_dim_validation`	DIR Pattern	Decision Integrity Module: deterministic validation gate
06	`06_agent_registry`	DIR Pattern	Agent Registry: manifests and capability handshake
07	`07_event_bus_swappable`	Infrastructure	In-memory Event Bus; note on swapping for Kafka/PubSub
08	`08_bootstrap_sqlite`	Infrastructure	Bootstrap: ensure DB and tables exist before run
09	`09_topology_a_eoam`	Topology A	Event-Oriented Agent Mesh
10	`10_topology_b_sds`	Topology B	Sovereign Decision Stream
11	`11_topology_c_dl_pci`	Topology C	Decision Ledger & Proof-Carrying Intents
88	`88_meta_context_engineering`	Meta-Sample	Travel/insurance: System Prompt Toolkit. Domain: flight delay refunds. Generates Flight Delay Refund System (Topology C).

Business Use Cases (31+)

#	Sample	Domain	What it checks
31	`31_finance_trading`	Finance/trading	Market quotes, news, parallel agents; position spawning and execution. (Topology: EOAM)
32	`32_fraud_gate`	Fraud detection	Real-time payment fraud gate; constrained decoding, JIT state drift, drift-attack demo. (Topology: SDS)
33	`33_insurance_underwriting`	Insurance underwriting	Risk evaluation with cryptographic Proof-Carrying Intents (PCI). (Topology: DL+PCI)
34	`34_langchain_roa_wrapper`	FinOps	LangChain ReAct → ROA. Cloud cost management. Verifies mission injection blocks PROD termination.
35	`35_crewai_roa_wrapper`	Customer claims/refunds	CrewAI Crew → ROA. E-commerce refunds (EUR). Verifies ACCEPT/ESCALATE/REJECT by category, return window, amount; NL intake.
36	`36_drift_optimization_discount`	Retention discounts	Optimization drift (reward hacking): agent offers discounts within DIM hard cap but profitability decays in aggregate. PerformanceMonitor detects rolling-average breach and suspends the agent. Shows that kernel compliance ≠ business health. (See Governance & Drift)
37	`37_drift_semantic_refund`	Support refunds	Semantic drift (emotional manipulation): refunds stay under a EUR cap so DIM accepts, but the delay policy is violated. ComplianceMonitor joins `execution_log` to `context_snapshots` and suspends when rolling semantic violation rate exceeds the threshold. (See Governance & Drift)
38	`38_drift_environmental_bidding`	AdTech / bidding	Environmental drift: market CPC escalates; bids stay under a contract cap so DIM accepts, but rolling average CPC exceeds LTV. BusinessROIMonitor joins `execution_log` to `market_snapshots` and suspends after consecutive negative ROI cycles. (See Governance & Drift)

Documentation

DIR Introduction - Architectural motivation and Kernel Space / User Space boundary.
ROA Manifesto - Responsibility-oriented agent design.
DIR Architecture - Runtime components and invariants.
Topologies - Operational modes (EOAM, SDS, DL+PCI).
Governance & Drift - Managing aggregate safety and business health over time.
Context as Code - Treating documentation as a system prompt.
FAQ - Answers to common engineering questions about DIR/ROA: "Day Two" failure modes, Kernel vs User Space, comparison with orchestration frameworks, JIT state verification, idempotency, compliance (e.g. EU AI Act and Proof-Carrying Intents), and incremental adoption.

Machine-optimized specification: DIR-minified.md

DIR-minified is a single-file, machine-optimized version of the framework specification. It is intended for use as context by LLMs and code-generation agents (e.g. Cursor, Claude, Devin), not as primary reading for humans.

The document is intentionally radical (dense, exhaustive) and redundant (repeats key constraints and examples in multiple places) so that a model loading it as context has a complete, self-contained picture of ROA, DIR, and the three topologies without chasing links. If you are feeding this repo to an AI to implement or extend DIR, attach DIR-minified.md as the main spec; the human-oriented docs in docs/ remain the narrative and tutorial layer.

License

This project is licensed under the Apache License, Version 2.0 – see the LICENSE file for details.

Author

Artur Huk - LinkedIn

This repository represents an evolving architectural framework derived from production constraints in high-stakes AI decision systems.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Decision Intelligence Runtime (DIR)

Project Goal

Origins

Architectural Convergence & Validation

Industry Validation: Google DeepMind vs. DIR Implementation

Quick Start

Start Here

Why use DIR today?

Core Concepts

1. Responsibility-Oriented Agents (ROA)

2. The Runtime Architecture

3. Decision Intelligence Topologies

4. Post-Execution Governance & Drift

Getting Started

Prerequisites

Installation

Repository Structure

Samples & Reference Implementations

Mechanics & Topologies (00–11)

Business Use Cases (31+)

Documentation

Machine-optimized specification: DIR-minified.md

License

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
assets		assets
docs		docs
samples		samples
src		src
tests		tests
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
FAQ.md		FAQ.md
HISTORY.md		HISTORY.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Decision Intelligence Runtime (DIR)

Project Goal

Origins

Architectural Convergence & Validation

Industry Validation: Google DeepMind vs. DIR Implementation

Quick Start

Start Here

Why use DIR today?

Core Concepts

1. Responsibility-Oriented Agents (ROA)

2. The Runtime Architecture

3. Decision Intelligence Topologies

4. Post-Execution Governance & Drift

Getting Started

Prerequisites

Installation

Repository Structure

Samples & Reference Implementations

Mechanics & Topologies (00–11)

Business Use Cases (31+)

Documentation

Machine-optimized specification: DIR-minified.md

License

Author

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages