Main Flow

1. Purpose

This document defines the main runtime flow of the ANDB v1 prototype. Its goal is to keep all contributors attached to one shared end-to-end path instead of building isolated pieces that cannot integrate.

The flow described here is both:

the architectural target for v1
the integration contract that current code should evolve toward

2. End-to-End Loop

The core ANDB loop is:

event input -> event ingest -> canonical object materialization -> retrieval projection -> query planning -> multi-path retrieval -> graph expansion -> evidence assembly -> proof trace -> structured response

This loop is the most important contract in the repository.

3. Why the Main Flow Must Be Frozen Early

If the flow is not defined early, the repository will drift in predictable ways:

event payloads will stop matching object materialization needs
retrieval will optimize for chunks rather than objects
graph expansion will not know its seed contract
response packaging will become inconsistent across modules
experiments will benchmark the wrong interface

For ANDB, the main flow is not documentation after the fact. It is a design artifact.

4. Flow A: Ingest

4.1 Goal

Receive raw event input and convert it into a validated event envelope that becomes the source of downstream state change.

4.2 Current Entry Point

HTTP route: /v1/ingest/events
Gateway implementation: src/internal/access/gateway.go
Runtime entry: src/internal/worker/runtime.go

4.3 Input Shape

The current runtime ingests schemas.Event, defined in src/internal/schemas/canonical.go.

Typical event types include:

user_message
assistant_message
tool_call_issued
tool_result_returned
plan_updated
critique_generated

4.4 Steps

request reaches the access layer
request is decoded into an Event
event is appended to the WAL
append result produces an LSN / logical sequence
downstream consumers are notified through the in-memory bus

4.5 Current Runtime Reality

Today the runtime appends to WAL and immediately feeds the data plane. Full event validation and dedicated materialization workers are still shallow, but the write-first-into-WAL rule is already part of the design.

4.6 Output

persisted event record in the in-memory WAL
ingest acknowledgment
trigger point for later materialization/indexing flow

5. Flow B: Materialization

5.1 Goal

Transform events into canonical objects and version-aware updates.

5.2 Why It Exists

Events are the source of truth for state change, but query execution should operate over object-centric forms rather than raw event streams alone.

5.3 Target Steps

load event envelope
determine which object types are affected
construct canonical objects
create or update ObjectVersion
generate typed edges where needed
persist canonical objects and relation records

5.4 Examples

user_message / assistant_message → Memory (episodic) + ObjectVersion + belongs_to_session + owned_by_agent edges
tool_result_returned → Memory (factual) + ObjectVersion + causal edges
plan_updated → Memory (procedural) + ObjectVersion
critique_generated → Memory (reflective) + ObjectVersion

5.5 Current Runtime Reality

materialization.Service.MaterializeEvent(ev) returns a MaterializationResult containing:

Record — the IngestRecord for the retrieval plane
Memory — a canonical schemas.Memory object
Version — a schemas.ObjectVersion record
Edges — typed edges inferred from the event (belongs_to_session, owned_by_agent, derived_from)

Runtime.SubmitIngest writes all three canonical records to their stores before feeding the retrieval plane. PreComputeService.Compute then builds an EvidenceFragment and stores it in EvidenceCache.

Current anchor:

5.6 Output

Memory persisted to ObjectStore
ObjectVersion persisted to SnapshotVersionStore
typed Edge records persisted to GraphEdgeStore
EvidenceFragment stored in EvidenceCache
IngestRecord fed to TieredDataPlane

6. Flow C: Retrieval Projection

6.1 Goal

Prepare retrievable forms from canonical objects.

6.2 Why Projection Is Separate

Canonical objects represent semantic truth. Retrieval needs dense, sparse, and filterable projections derived from those objects.

6.3 Target Steps

choose retrievable objects
derive dense representation
derive sparse/lexical representation
extract filter attributes
store retrieval entries in the data plane

6.4 Current Runtime Reality

MaterializationResult.Record (IngestRecord) is fed to TieredDataPlane.Ingest() which writes to both the hot segment index (for immediate retrieval) and the warm plane. The object ID follows the pattern mem_<event_id> and carries filter attributes:

tenant_id, workspace_id, agent_id, session_id, event_type, visibility

In v1 retrieval is lexical (term-overlap scoring). Dense/vector retrieval is a planned extension.

Current anchor:

6.5 Output

retrieval-ready object IDs
searchable content representation
metadata for filtering and namespace partitioning

7. Flow D: Query

7.1 Goal

Accept a structured query request and retrieve candidate evidence seeds.

7.2 Current Entry Point

HTTP route: /v1/query
Request type: schemas.QueryRequest
Response type: schemas.QueryResponse

Current implementation:

7.3 Target Request Semantics

The v1 contract is intended to carry:

query text
agent/session context
scope restrictions
temporal filters
object and memory-type filters
relation expansion constraints
response mode

7.4 Current Steps

request reaches the query API
request is decoded into QueryRequest
runtime calls the embedded data plane
data plane performs search over segments
candidate object IDs are returned to response assembly

7.5 Current Runtime Reality

The current implementation is still lighter than the target contract:

dense/sparse separation is not explicit yet
filter application is represented in response notes more than in deep execution
graph expansion is not yet active

But the contract shape already reserves space for those stages.

7.6 Output

seed object IDs
scanned segment information
retrieval path/proof notes for response packaging

8. Flow E: Graph Expansion

8.1 Goal

Transform retrieved seed objects into a local evidence subgraph through typed relations.

8.2 Why It Matters

This is where ANDB diverges from ordinary chunk retrieval. Instead of returning only ranked fragments, the system should assemble related objects and edges that explain why the answer is supported.

8.3 Target Steps

accept seed objects from retrieval
load incoming and outgoing edges
apply hop, edge-type, scope, and confidence constraints
assemble a local evidence graph

8.4 v1 Constraint

In v1, expansion is constrained to 1-hop over the GraphEdgeStore.

8.5 Current Runtime Reality

Assembler.expandEdges(objectIDs) calls GraphEdgeStore.BulkEdges(objectIDs) to load all edges where SrcObjectID or DstObjectID is one of the retrieved object IDs. The result is returned in QueryResponse.Edges and the expansion count is appended to the proof trace as graph_expansion:edges=N.

Edges are populated at ingest time by materialization.Service.MaterializeEvent (belongs_to_session, owned_by_agent, derived_from).

Current anchor:

src/internal/evidence/assembler.go
src/internal/storage/memory.go — memoryGraphEdgeStore.BulkEdges

9. Flow F: Response Assembly

9.1 Goal

Build the final structured response returned to the caller.

9.2 Target Response Content

The target v1 response includes:

objects
edges
provenance
versions
applied_filters
proof_trace

9.3 Current Runtime Reality

Assembler.Build() assembles a QueryResponse with:

objects — retrieved object IDs
edges — 1-hop schemas.Edge records from GraphEdgeStore.BulkEdges
provenance — ["event_projection", "retrieval_projection", "fragment_cache", "graph_expansion"]
versions — reserved (shallow in v1)
applied_filters — policy filters applied by PolicyEngine.ApplyQueryFilters
proof_trace — tier label + shard trace + pre-computed fragment steps + scanned shards

Pre-computed EvidenceFragment records (built at ingest by PreComputeService) are merged into the proof trace via EvidenceCache.GetMany(objectIDs), amortising chain derivation cost over the ingest path.

10. Flow G: Benchmark and Experiment

10.1 Goal

Evaluate whether ANDB improves evidence-oriented retrieval over a simpler baseline.

10.2 Expected Tasks

generate mock events
ingest them through the public API
run representative queries
compare against a top-k-only baseline
collect retrieval and response metrics

10.3 Current Assets

11. Module Ownership Along the Flow

11.1 Access / API

Owns:

route registration
request decoding
public contract exposure

11.2 Event Backbone / Runtime

Owns:

WAL append semantics
worker subscription path
ingest/query orchestration

11.3 Materialization / Semantic Layer

Owns:

event-to-object transformation
edge generation
version handling

11.4 Data Plane / Retrieval

Owns:

retrieval projections
search execution
candidate return

11.5 Graph / Response

Owns:

relation expansion
evidence graph assembly
proof trace packaging

11.6 Experiment Layer

Owns:

seed scripts
benchmark loops
baseline comparison

12. What Must Stay Stable in v1

The following contracts should remain stable unless deliberately reviewed:

event envelope shape
canonical object schema
query request shape
query response categories
candidate seed contract between retrieval and graph stages
edge typing conventions needed for evidence assembly

13. What Can Remain Flexible in v1

The following can still vary internally:

exact storage backend
embedding backend
sparse retrieval implementation
graph storage representation
in-process versus separated worker execution

As long as the shared contracts stay coherent.

14. Summary

All implementation work should connect back to this path:

ingest -> materialize -> project -> retrieve -> expand -> assemble -> explain -> return

That is the operational skeleton of ANDB v1.

FilesExpand file tree

main-flow.md

Latest commit

History

main-flow.md

File metadata and controls

Main Flow

1. Purpose

2. End-to-End Loop

3. Why the Main Flow Must Be Frozen Early

4. Flow A: Ingest

4.1 Goal

4.2 Current Entry Point

4.3 Input Shape

4.4 Steps

4.5 Current Runtime Reality

4.6 Output

5. Flow B: Materialization

5.1 Goal

5.2 Why It Exists

5.3 Target Steps

5.4 Examples

5.5 Current Runtime Reality

5.6 Output

6. Flow C: Retrieval Projection

6.1 Goal

6.2 Why Projection Is Separate

6.3 Target Steps

6.4 Current Runtime Reality

6.5 Output

7. Flow D: Query

7.1 Goal

7.2 Current Entry Point

7.3 Target Request Semantics

7.4 Current Steps

7.5 Current Runtime Reality

7.6 Output

8. Flow E: Graph Expansion

8.1 Goal

8.2 Why It Matters

8.3 Target Steps

8.4 v1 Constraint

8.5 Current Runtime Reality

9. Flow F: Response Assembly

9.1 Goal

9.2 Target Response Content

9.3 Current Runtime Reality

10. Flow G: Benchmark and Experiment

10.1 Goal

10.2 Expected Tasks

10.3 Current Assets

11. Module Ownership Along the Flow

11.1 Access / API

11.2 Event Backbone / Runtime

11.3 Materialization / Semantic Layer

11.4 Data Plane / Retrieval

11.5 Graph / Response

11.6 Experiment Layer

12. What Must Stay Stable in v1

13. What Can Remain Flexible in v1

14. Summary