Add release-safety evidence coverage to observability readiness and refresh rc.1 publication gate docs.
4.1 KiB
ECC 2.0 Observability Readiness
ECC 2.0 should be observable before it becomes more autonomous. The local default is an opt-in, repo-owned readiness gate that checks whether the core signals are present without sending telemetry anywhere.
Run:
npm run observability:ready
node scripts/observability-readiness.js --format json
The gate is deterministic and safe to run in CI. It only checks repository files and reports whether the release surface can expose the signals an operator needs.
Signal Model
- Live status:
scripts/loop-status.jscan emit JSON, watch active loops, and write snapshots for dashboards or handoffs. - HUD/status contract:
docs/architecture/hud-status-session-control.mdandexamples/hud-status-contract.jsondefine the portable payload for context, tool calls, active agents, todos, checks, cost, risk, queues, session controls, and tracker sync. - Session traces:
scripts/session-inspect.jscan inspect Claude, dmux, and adapter-backed sessions, then write canonical snapshots. - Harness baseline:
scripts/harness-audit.jsprovides a repeatable scorecard for tool coverage, context efficiency, quality gates, memory persistence, eval coverage, security guardrails, and cost efficiency. - Tool activity:
scripts/hooks/session-activity-tracker.jsrecords localtool-usage.jsonlevents that ECC2 can sync. - Risk ledger:
ecc2/src/observability/mod.rsscores tool calls and stores a paginated ledger for review. - Progress sync:
docs/architecture/progress-sync-contract.mddefines how GitHub, Linear, local handoffs, the repo roadmap, andscripts/work-items.jsstay aligned during merge batches and release-gate reviews. - Release safety:
docs/releases/2.0.0-rc.1/publication-readiness.md, post-hardening evidence, supply-chain incident response, workflow-security validation, npm pack checks, and release-surface tests must be present before any public tag, package publish, plugin submission, or announcement action.
Reference Pressure
The current agent-tooling ecosystem is converging on the same operating needs:
- dmux, Orca, and Superset emphasize isolated worktrees plus one place to see agent state and merge/review work.
- Claude HUD makes context, tool activity, agent activity, and todo progress visible inside the coding loop.
- Autocontext records every run as durable traces, reports, artifacts, and reusable improvements.
- Meta-Harness treats the harness itself as something to evaluate and improve, which requires clean logs of proposer behavior and outcomes.
- Zed and OpenCode emphasize agent control surfaces, reviewable changes, and harness-specific configuration that should still preserve portable project knowledge.
ECC's answer is not a hosted analytics dependency by default. The first release-candidate gate is local and file-backed. Hosted telemetry can come later, but only after the local event model is useful enough to trust.
Operator Workflow
- Run
npm run observability:ready. - Run
npm run harness:audit -- --format jsonfor the broader harness scorecard. - Run
node scripts/loop-status.js --json --write-dir .ecc/loop-statusduring longer autonomous batches. - Review
examples/hud-status-contract.jsonbefore wiring a new HUD or operator dashboard. - Run
node scripts/session-inspect.js --list-adaptersto confirm which session surfaces are available. - Run
node scripts/work-items.js sync-github --repo <owner/repo>before relying on local work-item status for a tracked repository. - Use ECC2 tool logs for risky operations, conflict analysis, and handoff review before increasing autonomy.
- Re-run the release-safety evidence checks before any public release action: publication readiness, supply-chain incident response, workflow-security validation, package surface, and release-surface tests.
The end-state is practical: before asking ECC to run larger multi-agent loops, the operator can prove the system has live status, durable session traces, baseline scorecards, a local risk ledger, and a progress-sync contract that keeps GitHub, Linear, handoffs, and roadmap evidence from drifting apart.