|
8 | 8 | # ├── @create-expert/test-expert — single query execution (pure executor, no evaluation) |
9 | 9 | # └── @create-expert/verify-test — artifact inspection + execution + definition review |
10 | 10 | # ============================================================================= |
| 11 | +# |
| 12 | +# ============================================================================= |
| 13 | +# Design Principles |
| 14 | +# |
| 15 | +# 1. Built-in Verification |
| 16 | +# - Verifier EXECUTES and TESTS artifacts — it is not a code reviewer. |
| 17 | +# - One verifier per team, not one per executor. |
| 18 | +# - Verifier must be direct child of coordinator, not nested under executor. |
| 19 | +# - Verifier needs `exec` in pick list. Without it, verification degrades |
| 20 | +# to file reading, which cannot catch runtime failures. |
| 21 | +# |
| 22 | +# 2. Instruction Quality via Binary Checks |
| 23 | +# - Subjective self-checks ("would removing this make output worse?") |
| 24 | +# do not work — the LLM always judges its own output as necessary. |
| 25 | +# - Binary checks work: "code block present?" "library name present?" |
| 26 | +# have unambiguous yes/no answers with clear remediation actions. |
| 27 | +# - Pattern: structural checks (delegates array, pick list) have always |
| 28 | +# been followed. Content checks must follow the same binary pattern. |
| 29 | +# |
| 30 | +# 3. Domain Agnosticism |
| 31 | +# - create-expert must produce experts for ANY domain — coding, writing, |
| 32 | +# research, design, operations — not just software development. |
| 33 | +# - Rules and examples in the pipeline must use domain-neutral language. |
| 34 | +# - Coding-specific vocabulary in rules biases generated experts toward |
| 35 | +# code-only solutions even when the domain is broader. |
| 36 | +# |
| 37 | +# 4. Plan ↔ write-definition Boundary |
| 38 | +# - Plan outputs role assignments and domain knowledge, NOT instruction |
| 39 | +# content. Architecture section = tree structure only (name, one-line |
| 40 | +# purpose, role). Domain Knowledge section = raw material for |
| 41 | +# instructions. |
| 42 | +# - write-definition composes instructions by selecting relevant domain |
| 43 | +# constraints per expert. It must NOT copy plan details wholesale. |
| 44 | +# - Without this boundary, plan bloat leaks directly into instructions. |
| 45 | +# |
| 46 | +# 5. Failure Conditions |
| 47 | +# - Not the inverse of success criteria — they are hard reject rules |
| 48 | +# derived from deeply understanding the domain. |
| 49 | +# - Each must specify: what is wrong, which expert caused it, and |
| 50 | +# where to restart. |
| 51 | +# - These go into the verifier's instruction so it knows what to reject. |
| 52 | +# |
| 53 | +# 6. Instruction Content = Domain Constraints Only |
| 54 | +# - An instruction should contain ONLY what the LLM cannot derive on |
| 55 | +# its own: business rules, quality bars, domain-specific anti-patterns, |
| 56 | +# completion criteria, priority rules. |
| 57 | +# - Everything else — implementation techniques, library choices, file |
| 58 | +# structures, well-known algorithms — is noise that dilutes the signal. |
| 59 | +# - Enforced by a 6-item binary checklist (no code blocks, no library |
| 60 | +# names, no file paths, no procedures, no technique explanations, |
| 61 | +# ≤15 lines for non-coordinator experts). |
| 62 | +# |
| 63 | +# 7. Brevity = Focus |
| 64 | +# - Verbose instructions dilute LLM attention. The more you write, the |
| 65 | +# less each sentence weighs. Short, dense instructions outperform |
| 66 | +# long, thorough ones because the LLM concentrates on what matters. |
| 67 | +# - This applies at every level: plan.md, instruction fields, and this |
| 68 | +# pipeline's own definitions. If it can be said in fewer words, it |
| 69 | +# should be. |
| 70 | +# ============================================================================= |
11 | 71 |
|
12 | 72 | # ============================================================================= |
13 | 73 | # create-expert — Coordinator |
|
0 commit comments