mirror of
https://github.com/affaan-m/everything-claude-code.git
synced 2026-05-14 02:10:07 +08:00
Make the ECC 2.0 GitHub/Linear/handoff/roadmap progress-sync model part of the local observability readiness gate instead of leaving it as roadmap prose only. - add `docs/architecture/progress-sync-contract.md` for GitHub, Linear, handoff, roadmap, and work-items sync - add a `Tracker Sync` check to `scripts/observability-readiness.js` - update observability tests with passing and missing-contract coverage - update observability and GA roadmap docs so the local readiness gate is now 18/18 and records #1848 supply-chain hardening evidence Validation: - node tests/scripts/observability-readiness.test.js (9 passed, 0 failed) - npm run observability:ready -- --format json (18/18, ready true) - npx markdownlint-cli 'docs/architecture/progress-sync-contract.md' 'docs/architecture/observability-readiness.md' 'docs/ECC-2.0-GA-ROADMAP.md' - git diff --check - node tests/docs/ecc2-release-surface.test.js (18 passed) - node tests/run-all.js (2378 passed, 0 failed) - GitHub CI for #1849 green across Ubuntu, Windows, and macOS No release, tag, npm publish, plugin tag, marketplace submission, or announcement was performed.
79 lines
3.6 KiB
Markdown
79 lines
3.6 KiB
Markdown
# ECC 2.0 Observability Readiness
|
|
|
|
ECC 2.0 should be observable before it becomes more autonomous. The local
|
|
default is an opt-in, repo-owned readiness gate that checks whether the core
|
|
signals are present without sending telemetry anywhere.
|
|
|
|
Run:
|
|
|
|
```bash
|
|
npm run observability:ready
|
|
node scripts/observability-readiness.js --format json
|
|
```
|
|
|
|
The gate is deterministic and safe to run in CI. It only checks repository
|
|
files and reports whether the release surface can expose the signals an
|
|
operator needs.
|
|
|
|
## Signal Model
|
|
|
|
- Live status: `scripts/loop-status.js` can emit JSON, watch active loops, and
|
|
write snapshots for dashboards or handoffs.
|
|
- HUD/status contract: `docs/architecture/hud-status-session-control.md` and
|
|
`examples/hud-status-contract.json` define the portable payload for context,
|
|
tool calls, active agents, todos, checks, cost, risk, queues, session
|
|
controls, and tracker sync.
|
|
- Session traces: `scripts/session-inspect.js` can inspect Claude, dmux, and
|
|
adapter-backed sessions, then write canonical snapshots.
|
|
- Harness baseline: `scripts/harness-audit.js` provides a repeatable scorecard
|
|
for tool coverage, context efficiency, quality gates, memory persistence,
|
|
eval coverage, security guardrails, and cost efficiency.
|
|
- Tool activity: `scripts/hooks/session-activity-tracker.js` records local
|
|
`tool-usage.jsonl` events that ECC2 can sync.
|
|
- Risk ledger: `ecc2/src/observability/mod.rs` scores tool calls and stores a
|
|
paginated ledger for review.
|
|
- Progress sync: `docs/architecture/progress-sync-contract.md` defines how
|
|
GitHub, Linear, local handoffs, the repo roadmap, and `scripts/work-items.js`
|
|
stay aligned during merge batches and release-gate reviews.
|
|
|
|
## Reference Pressure
|
|
|
|
The current agent-tooling ecosystem is converging on the same operating needs:
|
|
|
|
- dmux, Orca, and Superset emphasize isolated worktrees plus one place to see
|
|
agent state and merge/review work.
|
|
- Claude HUD makes context, tool activity, agent activity, and todo progress
|
|
visible inside the coding loop.
|
|
- Autocontext records every run as durable traces, reports, artifacts, and
|
|
reusable improvements.
|
|
- Meta-Harness treats the harness itself as something to evaluate and improve,
|
|
which requires clean logs of proposer behavior and outcomes.
|
|
- Zed and OpenCode emphasize agent control surfaces, reviewable changes, and
|
|
harness-specific configuration that should still preserve portable project
|
|
knowledge.
|
|
|
|
ECC's answer is not a hosted analytics dependency by default. The first
|
|
release-candidate gate is local and file-backed. Hosted telemetry can come
|
|
later, but only after the local event model is useful enough to trust.
|
|
|
|
## Operator Workflow
|
|
|
|
1. Run `npm run observability:ready`.
|
|
2. Run `npm run harness:audit -- --format json` for the broader harness
|
|
scorecard.
|
|
3. Run `node scripts/loop-status.js --json --write-dir .ecc/loop-status`
|
|
during longer autonomous batches.
|
|
4. Review `examples/hud-status-contract.json` before wiring a new HUD or
|
|
operator dashboard.
|
|
5. Run `node scripts/session-inspect.js --list-adapters` to confirm which
|
|
session surfaces are available.
|
|
6. Run `node scripts/work-items.js sync-github --repo <owner/repo>` before
|
|
relying on local work-item status for a tracked repository.
|
|
7. Use ECC2 tool logs for risky operations, conflict analysis, and handoff
|
|
review before increasing autonomy.
|
|
|
|
The end-state is practical: before asking ECC to run larger multi-agent loops,
|
|
the operator can prove the system has live status, durable session traces,
|
|
baseline scorecards, a local risk ledger, and a progress-sync contract that
|
|
keeps GitHub, Linear, handoffs, and roadmap evidence from drifting apart.
|