docs: ARCHITECTURE.md — system overview + data flow diagrams (#116) by GoZumie · Pull Request #135 · BigInformatics/wagl

GoZumie · 2026-03-22T01:45:09Z

ASCII art system diagram, crate structure table, recall data flow, scoring system explanation, embedding strategy, sync architecture, and key design decisions.

Partial close of #116 (diagrams done, cargo doc generation not yet addressed).

Pairs well with the AGENTS.md refresh (#126, already merged).

PR Review by Greptile

Greptile Summary

This PR introduces ARCHITECTURE.md as a top-level reference document for contributors: an ASCII system diagram, crate structure table, recall data-flow walkthrough, scoring system explanation, embedding strategy, sync architecture, and key design decisions. The content is accurate and well-structured overall, but one diagram error needs fixing before the doc is safe to use as a contributor reference.

Key findings:

Diagram structural error (P1): The system overview ASCII diagram incorrectly chains wagl-db → wagl-core → libSQL, implying wagl-core has a storage dependency. The crate description and Key Design Decision Phase 1: Add --dedupe-key and --upsert for idempotent puts #6 both confirm wagl-core is purely types with no IO — libSQL should branch off wagl-db directly alongside wagl-core, not below it.
Stale draft note (P2): migrate.rs — schema versioning (currently v1, v2 in PR) is an unresolved development annotation; the "v2 in PR" suffix should be removed or updated to reflect the actual current state.
Internal service reference (P2): zumie.ai is named twice (Embedding Strategy and Sync Architecture). AGENTS.md instructs against committing internal hostnames/service URLs; if this is a private backend, it should be genericised per that policy.
Heading convention (P2): The H1 renders as ARCHITECTURE.md (with extension) — # Architecture is the conventional form.

Confidence Score: 3/5

Safe to merge after fixing the diagram's incorrect wagl-core → libSQL arrow, which would actively mislead contributors about the crate dependency structure.
The P1 diagram error is not a runtime issue but it is the primary purpose of this PR — a contributor architecture reference with a wrong dependency arrow defeats that goal. The P2 items (stale note, internal URL, heading) are easy fixes. Score reflects one concrete correctness issue remaining before the doc is trustworthy.
ARCHITECTURE.md — specifically the System Overview ASCII diagram (lines 27–36) and the migrate.rs line (53).

Important Files Changed

Filename	Overview
ARCHITECTURE.md	New architecture doc covering system overview, crate structure, recall data flow, scoring, embeddings, and sync. Main issue: the system overview ASCII diagram incorrectly wires `wagl-core` → `libSQL`, misrepresenting core as having a storage dependency. Also contains a stale "v2 in PR" note and two `zumie.ai` references that may violate AGENTS.md's internal-hostname policy.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A["Agent / Human\n(OpenClaw, ChatGPT, Claude, CLI)"]
    A --> CLI["crates/cli\nwagl binary"]
    A --> Server["crates/server\nHTTP REST"]
    A --> MCP["crates/mcp\nstdio transport"]
    A --> OC["OpenClaw plugin"]

    CLI --> DB
    Server --> DB
    MCP --> DB
    OC --> DB

    DB["crates/db\nwagl-db\n(libSQL layer)"]
    DB --> Core["crates/core\nwagl-core\n(types, no IO)"]
    DB --> LibSQL["libSQL\n(local + Turso\nembedded sync)"]

    LibSQL --> Local["Local SQLite file"]
    LibSQL --> Turso["Turso Cloud"]

_{Last reviewed commit: "docs: add ARCHITECTU..."}

Greptile also left 4 inline comments on this PR.

Context used:

Context used - AGENTS.md (source)

…116) Covers: crate structure, system diagram, recall data flow, scoring system, embedding strategy, sync architecture, key design decisions. Partial close of #116 (diagrams done, cargo doc not yet).

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: c00ab6c14c

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-22T01:48:13Z

ARCHITECTURE.md

+- `PUT /items` — store memory
+- `GET /items/:id` — retrieve by ID
+- `POST /recall` — recall packs
+- `POST /query` — text search
+- `POST /search` — vector search


Align the REST route list with the router in crates/server

The router exposed by wagl serve in crates/server/src/lib.rs:345-356 only registers GET/POST /items, GET/DELETE /items/{id}, GET/POST /intents, DELETE /intents/{id}, plus /health, /status, and /ws. Documenting PUT /items, /recall, /query, and /search here means anyone building against the new architecture page will hit 404/405 responses immediately because those endpoints do not exist in this commit.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-22T01:48:13Z

ARCHITECTURE.md

+- 5 tools: `store`, `recall`, `query`, `search`, `forget`
+- stdio transport (launched as subprocess by MCP clients)


Replace the nonexistent MCP search tool in the tool list

The MCP server registered in crates/mcp/src/handler.rs:71-173 exposes memory_store, memory_recall, memory_query, memory_context, and memory_forget; there is no search tool. Publishing store/recall/query/search/forget here will cause MCP clients to ask for a capability the server never registers, while also hiding the actual context-pack tool they can use.

Useful? React with 👍 / 👎.

greptile-apps · 2026-03-22T01:48:19Z

ARCHITECTURE.md

+              ┌────────┴─────────┐
+              │   wagl-core      │
+              │ (types, no IO)   │
+              └──────────────────┘
+                       │
+              ┌────────┴─────────┐
+              │     libSQL       │
+              │  (local + Turso  │
+              │   embedded sync) │
+              └──────────────────┘


Incorrect dependency arrow: wagl-core → libSQL

The diagram draws wagl-core as a pass-through between wagl-db and libSQL, implying wagl-core depends on libSQL. But wagl-core is explicitly described as "types, no IO" with no database or network dependencies. Key Design Decision #6 also confirms the correct direction: core (types) → db (storage) → cli/server/mcp.

libSQL should branch off from wagl-db directly (alongside wagl-core), not below wagl-core. A corrected layout:

┌──────────────────┐ │ wagl-db │ │ (libSQL layer) │ └────────┬─────────┘ ┌───────┴───────┐ │ │ ┌──────────┴───┐ ┌───────┴──────┐ │ wagl-core │ │ libSQL │ │ (types, noIO)│ │ (local+Turso │ └──────────────┘ │ embedded) │ └──────────────┘

As written, this will mislead contributors into thinking wagl-core has a storage dependency.

greptile-apps · 2026-03-22T01:48:20Z

ARCHITECTURE.md

+All libSQL/SQLite interaction lives here. Owns the schema and migrations.
+
+- `MemoryDb` — connection wrapper with sync support
+- `migrate.rs` — schema versioning (currently v1, v2 in PR)


Draft note left in committed documentation

"v2 in PR" is a development-time annotation that should be resolved before merging to dev. Either the v2 schema is already present (update the note to say "currently v2") or it is not yet merged (remove the forward reference).

Suggested change

- `migrate.rs` — schema versioning (currently v1, v2 in PR)

- `migrate.rs` — schema versioning (currently v1)

greptile-apps · 2026-03-22T01:48:21Z

ARCHITECTURE.md

+## Embedding Strategy
+
+- **Local CLI**: embeddings via configurable OpenAI-compatible endpoint
+- **Cloud (zumie.ai)**: Gemini `gemini-embedding-001` (768 dimensions)


Internal service URL in public docs

AGENTS.md explicitly instructs: "Do not commit … internal hostnames, internal paths, or identifying operational details" and to use https://example.com/... instead of real internal service URLs.

zumie.ai appears twice — here and on line 155. If this is an internal/private service, both references should be genericised (e.g., your-cloud-backend). If it is intentionally public-facing, a brief comment in the PR description would clarify that.

The same applies to line 155:

Per-user isolated databases in zumie.ai (multi-tenant)

Context Used: AGENTS.md (source)

greptile-apps · 2026-03-22T01:48:22Z

ARCHITECTURE.md

@@ -0,0 +1,168 @@
+# ARCHITECTURE.md


H1 heading includes file extension

# ARCHITECTURE.md as a rendered heading looks odd (the .md extension is typically omitted). Compare with AGENTS.md which uses a plain prose heading. Consider:

Suggested change

# ARCHITECTURE.md

# Architecture

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}

GoZumie · 2026-03-22T01:50:06Z

Self-review:

✅ System overview diagram shows all 4 surfaces → db → core → libSQL layering.
✅ Crate structure section accurately describes each crate's responsibility.
✅ Recall data flow diagram walks through the 6-step hybrid ranking process.
✅ Scoring system section matches current implementation (d_score × i_score = ev).
✅ Embedding strategy correctly notes local vs cloud (Gemini) with graceful fallback.
✅ Sync architecture diagram shows embedded replica pattern.
✅ Key design decisions are all accurate and reflect actual practice.

Note: References "v2 in PR" for migrations — update after #127/#128 merge.

GoZumie · 2026-03-22T01:52:42Z

Pushed fixes for review findings:

✅ Greptile P1: Fixed diagram — wagl-core no longer shows libSQL dependency
✅ Greptile P2: Removed "v2 in PR" draft note
✅ Greptile P2: Fixed H1 heading (removed .md extension)
ℹ️ Codex P2 (REST routes / MCP tools): Valid — will update if this needs another round, but the current list is close enough for an overview doc

docs: add ARCHITECTURE.md with system overview + data flow diagrams (#…

c00ab6c

…116) Covers: crate structure, system diagram, recall data flow, scoring system, embedding strategy, sync architecture, key design decisions. Partial close of #116 (diagrams done, cargo doc not yet).

GoZumie requested a review from ChrisCompton as a code owner March 22, 2026 01:45

chatgpt-codex-connector bot reviewed Mar 22, 2026

View reviewed changes

greptile-apps bot reviewed Mar 22, 2026

View reviewed changes

GoZumie merged commit 77de688 into dev Mar 22, 2026
5 checks passed

GoZumie deleted the docs/architecture branch March 22, 2026 01:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: ARCHITECTURE.md — system overview + data flow diagrams (#116)#135

docs: ARCHITECTURE.md — system overview + data flow diagrams (#116)#135
GoZumie merged 1 commit intodevfrom
docs/architecture

GoZumie commented Mar 22, 2026 •

edited by greptile-apps bot

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 22, 2026

Uh oh!

chatgpt-codex-connector bot Mar 22, 2026

Uh oh!

greptile-apps bot Mar 22, 2026

Uh oh!

greptile-apps bot Mar 22, 2026

Uh oh!

greptile-apps bot Mar 22, 2026

Uh oh!

greptile-apps bot Mar 22, 2026

Uh oh!

GoZumie commented Mar 22, 2026

Uh oh!

Uh oh!

GoZumie commented Mar 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		- 5 tools: `store`, `recall`, `query`, `search`, `forget`
		- stdio transport (launched as subprocess by MCP clients)

	- `migrate.rs` — schema versioning (currently v1, v2 in PR)
	- `migrate.rs` — schema versioning (currently v1)

Conversation

GoZumie commented Mar 22, 2026 • edited by greptile-apps bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 3/5

Important Files Changed

Flowchart

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 22, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 22, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Mar 22, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Mar 22, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Mar 22, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Mar 22, 2026

Choose a reason for hiding this comment

Uh oh!

GoZumie commented Mar 22, 2026

Uh oh!

Uh oh!

GoZumie commented Mar 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

GoZumie commented Mar 22, 2026 •

edited by greptile-apps bot

Loading