Skip to content

docs(research): optimization-space suite — axes, layer stress-tests, operator playbook#207

Merged
drewstone merged 1 commit into
mainfrom
docs/optimization-space
Jun 9, 2026
Merged

docs(research): optimization-space suite — axes, layer stress-tests, operator playbook#207
drewstone merged 1 commit into
mainfrom
docs/optimization-space

Conversation

@drewstone

Copy link
Copy Markdown
Contributor

The strategy map requested after the steering/GEPA gate series: rethink the layers, stress-test them against the vision docs, and write the product-integration operator playbook.

8 docs under docs/research/ (indexed in the README):

  • optimization-space.md — the 6-axis taxonomy (timescale · target · objective · validity-scope · serving-architecture · authorship) replacing the single-ladder frame; the evidence map showing the program over-sampled one cell (within-run × single-objective × itsm) while the canon's own success criterion (across-run, Gate B) has n=0; the canon-compatibility audit; the ranked experiment portfolio.
  • 6 layer-*.md stress tests — each with its evidence table, strongest objections, and concrete next experiments. Highlights: the within-run boundary law is settled (negative stateless / positive stateful+keep-best); across-run is the priority (the corpus A/B + 4 falsifiers); the multi-objective mandate is the largest practice-vs-canon inconsistency; tool augmentation (+70pp) is the largest effect measured anywhere; Tangle Intelligence is export-only today and is the natural home of the across-run memory (with the server-side judge firewall as the non-negotiable); defineStrategy makes agent-authored strategies feasible (R0→R3 ladder).
  • product-integration-playbook.md — the 8-step wiring for gtm/tax/creative/etc., the operator role table (humans own the deployable checks, thresholds, and the weekly ship decision; the system owns the rest), and the 3 packaging gaps (publish the suite from bench/ to src/, production-trace→corpus inflow, the first product Environment).

Canon verdict: the new framing is compatible with architecture.md/learning-flywheel.md — and the audit forced two corrections onto it (steering is negative-not-null on stateless; the ladder is one path through the axis space). Two documentation-debt items flagged for follow-up, not edited here.

…er stress tests, operator playbook

Answers "does GEPA/steerers/HALO contextualize everything we should think about?" — no.
Reframes the program as a 6-AXIS space (timescale · target · objective · validity scope ·
serving architecture · authorship) instead of a single ladder, maps every cell to its
evidence status, and stress-tests each layer against the canon (architecture.md /
learning-flywheel.md / eval-substrate.md / roadmap-rsi.md).

Index: optimization-space.md — the taxonomy, the evidence map (the program over-sampled
within-run × single-objective × itsm while the canon's own success criterion, the
across-run flywheel, has n=0), the canon-compatibility audit (compatible; two corrections
forced: "steering is NEGATIVE on stateless, not null"; the multi-objective mandate is the
largest practice-vs-canon inconsistency), and the ranked portfolio.

Layer docs: within-run (boundary law settled; topology = the one open lever),
across-run (the corpus A/B design + four falsifiers — THE priority), economics
(lift-per-dollar; tool augmentation +70pp dominates), domain-generality (n=1-domain
exposure; csm/hr replication nearly free), intelligence-serving (Tangle Intelligence is
export-only today; split by timescale — in-loop critic local, across-run memory served;
the server-side judge firewall is non-negotiable), agent-authored (defineStrategy
skillification, R0→R3 ladder, two structural safety properties).

product-integration-playbook.md: the 8-step product wiring (gtm first), the operator
role table (humans own "what good means" + the ship decision; the system owns the rest),
and the three packaging gaps (publish the suite from bench/ to src/, corpus inflow from
production traces, the first product Environment).

@tangletools tangletools left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ Auto-approved PR — 754bfb8a

Blanket team auto-approval is enabled for this reviewer service.
The full PR reviewer audit still runs separately and will publish findings if it detects issues.

tangletools · auto-approval · reason: blanket_auto_approve · 2026-06-09T22:43:42Z

@drewstone drewstone merged commit 83c9a80 into main Jun 9, 2026
1 check passed
@drewstone drewstone deleted the docs/optimization-space branch June 9, 2026 22:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants