Skip to content

Commit 2680aff

Browse files
FL4TLiN3claude
andauthored
docs: add design principles to create-expert perstack.toml (#775)
Document the seven design principles established through iterative refinement of the create-expert pipeline: 1. Built-in Verification (verifier executes, not reviews) 2. Instruction Quality via Binary Checks (subjective checks fail) 3. Domain Agnosticism (not just coding) 4. Plan ↔ write-definition Boundary (plan = raw material) 5. Failure Conditions (hard reject rules, not inverse of success) 6. Instruction Content = Domain Constraints Only 7. Brevity = Focus (verbose dilutes LLM attention) Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
1 parent 3cbbf80 commit 2680aff

File tree

1 file changed

+60
-0
lines changed

1 file changed

+60
-0
lines changed

definitions/create-expert/perstack.toml

Lines changed: 60 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -8,6 +8,66 @@
88
# ├── @create-expert/test-expert — single query execution (pure executor, no evaluation)
99
# └── @create-expert/verify-test — artifact inspection + execution + definition review
1010
# =============================================================================
11+
#
12+
# =============================================================================
13+
# Design Principles
14+
#
15+
# 1. Built-in Verification
16+
# - Verifier EXECUTES and TESTS artifacts — it is not a code reviewer.
17+
# - One verifier per team, not one per executor.
18+
# - Verifier must be direct child of coordinator, not nested under executor.
19+
# - Verifier needs `exec` in pick list. Without it, verification degrades
20+
# to file reading, which cannot catch runtime failures.
21+
#
22+
# 2. Instruction Quality via Binary Checks
23+
# - Subjective self-checks ("would removing this make output worse?")
24+
# do not work — the LLM always judges its own output as necessary.
25+
# - Binary checks work: "code block present?" "library name present?"
26+
# have unambiguous yes/no answers with clear remediation actions.
27+
# - Pattern: structural checks (delegates array, pick list) have always
28+
# been followed. Content checks must follow the same binary pattern.
29+
#
30+
# 3. Domain Agnosticism
31+
# - create-expert must produce experts for ANY domain — coding, writing,
32+
# research, design, operations — not just software development.
33+
# - Rules and examples in the pipeline must use domain-neutral language.
34+
# - Coding-specific vocabulary in rules biases generated experts toward
35+
# code-only solutions even when the domain is broader.
36+
#
37+
# 4. Plan ↔ write-definition Boundary
38+
# - Plan outputs role assignments and domain knowledge, NOT instruction
39+
# content. Architecture section = tree structure only (name, one-line
40+
# purpose, role). Domain Knowledge section = raw material for
41+
# instructions.
42+
# - write-definition composes instructions by selecting relevant domain
43+
# constraints per expert. It must NOT copy plan details wholesale.
44+
# - Without this boundary, plan bloat leaks directly into instructions.
45+
#
46+
# 5. Failure Conditions
47+
# - Not the inverse of success criteria — they are hard reject rules
48+
# derived from deeply understanding the domain.
49+
# - Each must specify: what is wrong, which expert caused it, and
50+
# where to restart.
51+
# - These go into the verifier's instruction so it knows what to reject.
52+
#
53+
# 6. Instruction Content = Domain Constraints Only
54+
# - An instruction should contain ONLY what the LLM cannot derive on
55+
# its own: business rules, quality bars, domain-specific anti-patterns,
56+
# completion criteria, priority rules.
57+
# - Everything else — implementation techniques, library choices, file
58+
# structures, well-known algorithms — is noise that dilutes the signal.
59+
# - Enforced by a 6-item binary checklist (no code blocks, no library
60+
# names, no file paths, no procedures, no technique explanations,
61+
# ≤15 lines for non-coordinator experts).
62+
#
63+
# 7. Brevity = Focus
64+
# - Verbose instructions dilute LLM attention. The more you write, the
65+
# less each sentence weighs. Short, dense instructions outperform
66+
# long, thorough ones because the LLM concentrates on what matters.
67+
# - This applies at every level: plan.md, instruction fields, and this
68+
# pipeline's own definitions. If it can be said in fewer words, it
69+
# should be.
70+
# =============================================================================
1171

1272
# =============================================================================
1373
# create-expert — Coordinator

0 commit comments

Comments
 (0)