mirror of https://github.com/ultraworkers/claw-code.git synced 2026-04-24 21:28:11 +08:00

YeonGyu-Kim 3262cb3a87 docs: OPT_OUT_DEMAND_LOG.md — evidentiary base for governance decisions

Cycle #21 ships governance infrastructure, not implementation. Maintainership
mode means sometimes the right deliverable is a decision framework, not code.

Problem context:
OPT_OUT_AUDIT.md (cycle #18 bonus) established 'demand-backed audit' as the
next step. But without a structured way to record demand signals, 'demand-backed'
was just a slogan — the next audit cycle would have no evidence to work from.

This commit creates the evidentiary base:

New file: OPT_OUT_DEMAND_LOG.md
- Per-surface entries for all 12 OPT_OUT commands (Groups A/B/C)
- Current state: 0 signals across all surfaces (consistent with audit prediction)
- Signal entry template with required fields:
  - Source (who/what)
  - Use case (concrete orchestration problem)
  - Markdown-alternative-checked (why existing output insufficient)
  - Date
- Promotion thresholds:
  - 2+ independent signals for same surface → file promotion pinpoint
  - 1 signal + existing stable schema → file pinpoint for discussion
  - 0 signals → stays OPT_OUT (rationale preserved)

Decision framework for cycle #22 (audit close):
- If 0 signals total: move to PERMANENTLY_OPT_OUT, close audit
- If 1-2 signals: file individual promotion pinpoints with evidence
- If 3+ signals: reopen audit, question classification itself

Updated files:
- OPT_OUT_AUDIT.md: Added demand log reference in Related section
- CLAUDE.md: Added prerequisites for promotions (must have logged signals),
  added 'File a demand signal' workflow section

Philosophy:
'Prevent speculative expansion' — schema bloat protection discipline.
Every new CLAWABLE surface is a maintenance tax. Evidence requirement keeps
the protocol lean. OPT_OUT surfaces are intentionally not-clawable until
proven otherwise by external demand.

Operational impact:
Next cycles can now:
1. Watch for real claws hitting OPT_OUT surface limits
2. Log signals in structured format (no ad-hoc filing)
3. Run audit at cycle #22 with actual data, not speculation

No code changes. No test changes. Pure governance infrastructure.

Related: #18 cycle (OPT_OUT_AUDIT.md), maintainership phase transition.

2026-04-22 20:34:35 +09:00

7.3 KiB

Raw Blame History

OPT_OUT Surface Audit Roadmap

Status: Pre-audit (decision table ready, survey pending)

This document governs the audit and potential promotion of 12 OPT_OUT surfaces (commands that currently do not support --output-format json).

OPT_OUT Classification Rationale

A surface is classified as OPT_OUT when:

Human-first by nature: Rich Markdown prose / diagrams / structured text where JSON would be information loss
Query-filtered alternative exists: Commands with internal --query / --limit don't need JSON (users already have escape hatch)
Simulation/debug only: Not meant for production orchestration (e.g., mode simulators)
Future JSON work is planned: Documented in ROADMAP with clear upgrade path

OPT_OUT Surfaces (12 Total)

Group A: Rich-Markdown Reports (4 commands)

Rationale: These emit structured narrative prose. JSON would require lossy serialization.

Command	Output	Current use	JSON case
`summary`	Multi-section workspace summary (Markdown)	Human readability	Not applicable; Markdown is the output
`manifest`	Workspace manifest with project tree (Markdown)	Human readability	Not applicable; Markdown is the output
`parity-audit`	TypeScript/Python port comparison report (Markdown)	Human readability	Not applicable; Markdown is the output
`setup-report`	Preflight + startup diagnostics (Markdown)	Human readability	Not applicable; Markdown is the output

Audit decision: These likely remain OPT_OUT long-term (Markdown-as-output is intentional). If JSON version needed in future, would be a separate --output-format json path generating structured data (project summary object, manifest array, audit deltas, setup checklist) — but that's a new contract, not an addition to existing Markdown surfaces.

Pinpoint: #175 (deferred) — audit whether summary/manifest should emit JSON structured versions in parallel with Markdown, or if Markdown-only is the right UX.

Group B: List Commands with Query Filters (3 commands)

Rationale: These already support --query and --limit for filtering. JSON output would be redundant; users can pipe to jq.

Command	Filtering	Current output	JSON case
`subsystems`	`--limit`	Human-readable list	Use `--query` to filter, users can parse if needed
`commands`	`--query`, `--limit`, `--no-plugin-commands`, `--no-skill-commands`	Human-readable list	Use `--query` to filter, users can parse if needed
`tools`	`--query`, `--limit`, `--simple-mode`	Human-readable list	Use `--query` to filter, users can parse if needed

Audit decision: --query / --limit are already the machine-friendly escape hatch. These commands are intentionally list-filter-based (not orchestration-primary). Promoting to CLAWABLE would require:

Formalizing what the structured output is (command array? tool array?)
Versioning the schema per command
Updating tests to validate per-command schemas

Cost-benefit: Low. Users who need structured data can already use --query to narrow results, then parse. Effort to promote > value.

Pinpoint: #176 (backlog) — audit --query UX; consider if a --query-json escape hatch (output JSON of matching items) is worth the schema tax.

Group C: Simulation / Debug Surfaces (5 commands)

Rationale: These are intentionally not production-orchestrated. They simulate behavior, test modes, or debug scenarios. JSON output doesn't add value.

Command	Purpose	Output	Use case
`remote-mode`	Simulate remote execution	Text (mock session)	Testing harness behavior under remote constraints
`ssh-mode`	Simulate SSH execution	Text (mock SSH session)	Testing harness behavior over SSH-like transport
`teleport-mode`	Simulate teleport hop	Text (mock hop session)	Testing harness behavior with teleport bouncing
`direct-connect-mode`	Simulate direct network	Text (mock session)	Testing harness behavior with direct connectivity
`deep-link-mode`	Simulate deep-link invocation	Text (mock deep-link)	Testing harness behavior from URL/deeplink

Audit decision: These are intentionally simulation-only. Promoting to CLAWABLE means:

"This simulated mode is now a valid orchestration surface"
Need to define what JSON output means (mock session state? simulation log?)
Need versioning + test coverage

Cost-benefit: Very low. These are debugging tools, not orchestration endpoints. Effort to promote >> value.

Pinpoint: #177 (backlog) — decide if mode simulators should ever be CLAWABLE (probably no).

Audit Workflow (Future Cycles)

For each surface:

Survey: Check if any external claw actually uses --output-format with this surface
Cost estimate: How much schema work + testing?
Value estimate: How much demand for JSON version?
Decision: CLAWABLE, remain OPT_OUT, or new pinpoint?

Promotion criteria (if promoting to CLAWABLE):

A surface moves from OPT_OUT → CLAWABLE only if:

✅ Clear use case for JSON (not just "hypothetically could be JSON")
✅ Schema is simple and stable (not 20+ fields)
✅ At least one external claw has requested it
✅ Tests can be added without major refactor
✅ Maintainability burden is worth the value

Demote criteria (if staying OPT_OUT):

A surface stays OPT_OUT if:

✅ JSON would be information loss (Markdown reports)
✅ Equivalent filtering already exists (--query / --limit)
✅ Use case is simulation/debug, not production
✅ Promotion effort > value to users

Post-Audit Outcomes

Likely scenario (high confidence)

Group A (Markdown reports): Remain OPT_OUT

summary, manifest, parity-audit, setup-report are intentionally human-first
If JSON-like structure is needed in future, would be separate *-json commands or distinct --output-format, not added to Markdown surfaces

Group B (List filters): Remain OPT_OUT

subsystems, commands, tools have --query / --limit as query layer
Users who need structured data already have escape hatch

Group C (Mode simulators): Remain OPT_OUT

remote-mode, ssh-mode, etc. are debug tools, not orchestration endpoints
No demand for JSON version; promotion would be forced, not driven

Result: OPT_OUT audit concludes that 12/12 surfaces should remain OPT_OUT (no promotions).

If demand emerges

If external claws report needing JSON from any OPT_OUT surface:

File pinpoint with use case + rationale
Estimate cost + value
If value > cost, promote to CLAWABLE with full test coverage
Update SCHEMAS.md
Update CLAUDE.md

Timeline

Post-#174 (now): OPT_OUT audit documented (this file)
Cycles #19–#21 (deferred): Survey period — collect data on external demand
Cycle #22 (deferred): Final audit decision + any promotions
Post-audit: Move to protocol maintenance mode (new commands/fields/surfaces)

OPT_OUT_DEMAND_LOG.md — Active survey recording real demand signals (evidentiary base for any promotion decision)
SCHEMAS.md — Clawable surface contracts
CLAUDE.md — Development guidance
test_cli_parity_audit.py — Parametrized tests for CLAWABLE_SURFACES enforcement
ROADMAP.md — Macro phases (this audit is Phase 3 before Phase 2 closure)

7.3 KiB Raw Blame History Unescape Escape