claw-code

mirror of https://github.com/ultraworkers/claw-code.git synced 2026-04-24 13:08:11 +08:00

Author	SHA1	Message	Date
YeonGyu-Kim	71e77290b9	ROADMAP #105 : claw status ignores .claw.json model, doctor mislabels alias as Resolved, 4 surfaces disagree Dogfooded 2026-04-18 on main HEAD 6580903 from /tmp/cdZ. .claw.json with {"model":"haiku"} produces: claw status → model: 'claude-opus-4-6' (DEFAULT_MODEL, config ignored) claw doctor → 'Resolved model haiku' (raw alias, label lies) turn dispatch → claude-haiku-4-5-20251213 (actually-resolved canonical) ANTHROPIC_MODEL=sonnet → status still says claude-opus-4-6 FOUR separate understandings of 'active model': 1. config file (alias as written) 2. doctor (alias mislabeled as 'Resolved') 3. status (hardcoded DEFAULT_MODEL ignoring config entirely) 4. turn dispatch (canonical, alias-resolved, what turns actually use) Trace: main.rs:59 DEFAULT_MODEL const = claude-opus-4-6 main.rs:400 parse_args starts model = DEFAULT_MODEL main.rs:753 Status dispatch: model.to_string() — never calls resolve_repl_model, never reads config or env main.rs:1125 resolve_repl_model: source of truth for actual model, consults ANTHROPIC_MODEL env + config + alias table. Called from Prompt and Repl dispatch. NOT from Status. main.rs:1701 check_config_health: 'Resolved model {model}' where model is raw configured string, not resolved. Label says Resolved, value is pre-resolution alias. Orchestration hazard: a claw picks tool strategy based on status.model assuming it reflects what turns will use. Status lies: always reports DEFAULT_MODEL unless --model flag was passed. Config and env var completely ignored by status. Fix shape (~30 lines): - call resolve_repl_model from print_status_snapshot - add effective_model field to status JSON (or rename/enrich) - fix doctor 'Resolved model' label (either rename to 'Configured' or actually alias-resolve before emitting) - honor ANTHROPIC_MODEL env in status - regression tests per model source with cross-surface equality Joins truth-audit (#80-#84, #86, #87, #89, #100, #102, #103). Joins two-paths-diverge (#91, #101, #104) — now 4-way with #105. Joins doctor-surface-coverage triangle (#100 + #102 + #105). Filed in response to Clawhip pinpoint nudge 1494819785676947543 in #clawcode-building-in-public.	2026-04-18 07:08:25 +09:00
YeonGyu-Kim	6580903d20	ROADMAP #104 : /export and claw export are two paths with incompatible filename semantics; slash silently .txt-rewrites Dogfooded 2026-04-18 on main HEAD 7447232 from /tmp/cdY. Two-path-diverge problem: A. /export slash command (resolve_export_path at main.rs:5990-6010): - If extension != 'txt', silently appends '.txt' - /export foo.md → writes foo.md.txt - /export report.json → writes report.json.txt - cwd.join(relative_path_with_dotdot) resolves outside cwd - No path-traversal rejection B. claw export CLI (run_export at main.rs:6021-6055): - fs::write(path, &markdown) directly, no suffix munging - /tmp/cli-export.md → writes /tmp/cli-export.md - Also no path-traversal check, absolute paths write wherever Same logical action, incompatible output contracts. A claw that switches between /export and claw export sees different output filenames for the same input. Compounded: - Content is Markdown (render_session_markdown emits '# Conversation Export', '## 1. User', fenced code blocks) but slash path forces .txt extension → content/extension mismatch. File-routing pipelines (archival by extension, syntax highlight, preview) misclassify. - --help says just '/export [file]'. No mention of .txt forcing, no mention of path-resolution semantics. - Claw pipelines that glob *.md won't find /export outputs. Trace: main.rs:5990 resolve_export_path: extension check + conditional .txt append main.rs:6021 run_export: fs::write direct, no path munging main.rs:5975 default_export_filename: hardcodes .txt fallback Content renderer is Markdown (render_session_markdown:6075) Fix shape (~70 lines): - unify both paths via shared export_session_to_path helper - respect caller's extension (pick renderer by extension or accept that content is Markdown and name accordingly) - path-traversal policy decision: restrict to project root or allow-with-warning - --help: document suffix preservation + path semantics - regression tests for extension preservation + dotdot rejection Joins silent-flag cluster (#96-#101) on silent-rewrite axis. New two-paths-diverge sub-cluster: #91 (permission-mode parser disagree) + #101 (CLI vs env asymmetry) + #104 (slash vs CLI export asymmetry) — three instances of parallel entry points doing subtly different things. Natural bundles: #91 + #101 + #104 (two-paths-diverge trio), #96 + #98 + #99 + #101 + #104 (silent-rewrite-or-noop quintet). Filed in response to Clawhip pinpoint nudge 1494812230372294849 in #clawcode-building-in-public.	2026-04-18 06:34:38 +09:00
YeonGyu-Kim	7447232688	ROADMAP #103 : claw agents silently drops every non-.toml file; claude-code convention .md files ignored, no content validation Dogfooded 2026-04-18 on main HEAD 6a16f08 from /tmp/cdX. Two-part gap on agent subsystem: 1. File-format gate silently discards .md (YAML frontmatter): commands/src/lib.rs:3180-3220 load_agents_from_roots filters extension() != 'toml' and silently continues. No log, no warn. .claw/agents/foo.md → agents list count: 0, doctor: ok. Same file renamed to .toml → discovered instantly. 2. No content validation inside accepted .toml: model='nonexistent/model-that-does-not-exist' → accepted. tools=['DoesNotExist', 'AlsoFake'] → accepted. reasoning_effort string → unvalidated. No check against model registry, tool registry, or reasoning-effort enum — all machinery exists elsewhere (#97 validates tools for --allowedTools flag). Compounded: - agents help JSON lists sources but NOT accepted file formats. Operators have zero documentation-surface way to diagnose 'why does my .md file not work?' - Doctor check set has no agents check. 3 files present with 1 silently skipped → summary: 'ok'. - Skills use .md (SKILL.md). MCP uses .json (.claw.json). Agents uses .toml. Three subsystems, three formats, no cross-subsystem consistency or documentation. - Claude Code convention is .md with YAML frontmatter. Migrating operators copy that and silently fail. Fix shape (~100 lines): - accept .md with YAML frontmatter via existing parse_skill_frontmatter helper - validate model/tools/reasoning_effort against existing registries; emit status: 'invalid' + validation_errors instead of silently accepting - agents list summary.skipped: [{path, reason}] - add agents doctor check (total/active/skipped/invalid) - agents help: accepted_formats list Joins truth-audit (#80-#84, #86, #87, #89, #100, #102) on silent-ok-while-ignoring axis. Joins silent-flag (#96-#101) at subsystem scale. Joins unplumbed-subsystem (#78, #96, #100, #102) as 5th unreachable surface: load_agents_from_roots present, parse_skill_frontmatter present, validation helpers present, agents path calls none of them. Also opens new 'Claude Code migration parity' cross-cluster: claw-code silently breaks the expected convention migration path for a first-class subsystem. Natural bundles: #102 + #103 (subsystem-doctor-coverage), #78 + #96 + #100 + #102 + #103 (unplumbed-surface quintet). Filed in response to Clawhip pinpoint nudge 1494804679962661187 in #clawcode-building-in-public.	2026-04-18 06:03:22 +09:00
YeonGyu-Kim	6a16f0824d	ROADMAP #102 : mcp list/show/doctor surface MCP config-time only; no preflight, no liveness, not even command-exists check Dogfooded 2026-04-18 on main HEAD eabd257 from /tmp/cdW2. A .claw.json pointing at command='/does/not/exist' as an MCP server cheerfully reports: mcp show unreachable → found: true mcp list → configured_servers: 1, status field absent doctor → config: ok, MCP servers: 1, has_failures: false The broken server is invisible until agent tries to call a tool from it mid-turn — burning tokens on failed tool call and forcing retry loop. Trace: main.rs:1701-1780 check_config_health counts via runtime_config.mcp().servers().len() No which(). No TcpStream::connect(). No filesystem touch. render_doctor_report has 6 checks (auth/config/install_source/ workspace/sandbox/system). No check_mcp_health exists. commands/src/lib.rs mcp list/show emit config-side repr only. No status field, no reachable field, no startup_state. runtime/mcp_stdio.rs HAS startup machinery with error types, but only invoked at turn-execution time — too late for preflight. Roadmap prescribes this exact surface: - Phase 1 §3.5 Boot preflight / doctor contract explicitly lists 'MCP config presence and server reachability expectations' - Phase 2 §4 canonical lane event schema includes lane.ready - Phase 4.4.4 event provenance / environment labeling - Product Principle #5 'Partial success is first-class' — 'MCP startup can succeed for some servers and fail for others, with structured degraded-mode reporting' All four unimplementable without preflight + per-server status. Fix shape (~110 lines): - check_mcp_health: which(command) for stdio, 1s TcpStream connect for http/sse. Aggregate ok/warn/fail with per-server detail lines. - mcp list/show: add status field (configured/resolved/command_not_found/connect_refused/ startup_failed). --probe flag for deeper handshake. - doctor top-level: degraded_mode: bool, startup_summary. - Wire preflight into prompt/repl bootstrap; emit one-time mcp_preflight event. Joins unplumbed-subsystem cross-cluster (#78, #100, #102) — subsystem exists, diagnostic surface JSON-invisible. Joins truth-audit (#80-#84, #86, #87, #89, #100) — doctor: ok lies when MCP broken. Natural bundle: #78 + #96 + #100 + #102 unplumbed-surface quartet. Also #100 + #102 as pure doctor-surface-coverage 2-way. Filed in response to Clawhip pinpoint nudge 1494797126041862285 in #clawcode-building-in-public.	2026-04-18 05:34:30 +09:00
YeonGyu-Kim	eabd257968	ROADMAP #101 : RUSTY_CLAUDE_PERMISSION_MODE env var silently fails OPEN to danger-full-access on any invalid value Dogfooded 2026-04-18 on main HEAD d63d58f from /tmp/cdV. Qualitatively worse than #96-#100 silent-flag class because this is fail-OPEN, not fail-inert: operator intent 'restrict this lane' silently becomes 'full access.' Tested matrix: VALID → correct mode: read-only → read-only workspace-write → workspace-write danger-full-access → danger-full-access ' read-only ' → read-only (trim works) INVALID → silent danger-full-access: '' → danger-full-access 'readonly' → danger-full-access (typo: missing hyphen) 'read_only' → danger-full-access (typo: underscore) 'READ-ONLY' → danger-full-access (case) 'ReadOnly' → danger-full-access (case) 'dontAsk' → danger-full-access (config alias not recognized by env parser, but ultimate default happens to be dfa) 'garbage' → danger-full-access (pure garbage) 'readonly\n' → danger-full-access CLI asymmetry: --permission-mode readonly → loud structured error. Same misspelling, same input, opposite outcomes via env vs CLI. Trace: main.rs:1099-1107 default_permission_mode: env::var(...).ok().and_then(normalize_permission_mode) .or_else(config...).unwrap_or(DangerFullAccess) → .and_then drops error context on invalid; .unwrap_or fail-OPEN to most permissive mode main.rs:5455-5462 normalize_permission_mode accepts 3 canonical; runtime/config.rs:855-863 parse_permission_mode_label accepts 7 including config aliases (default/plan/acceptEdits/auto/dontAsk). Two parsers, disagree on accepted set, no shared source of truth. Plus: env var RUSTY_CLAUDE_PERMISSION_MODE is UNDOCUMENTED. grep of README/docs/help returns zero hits. Fix shape (~60 lines total): - rewrite default_permission_mode to surface invalid values via Result - share ONE parser across CLI/config/env (extract from config.rs:855) - decide broad (7 aliases) vs narrow (3 canonical) accepted set - document the env var in --help Environment section - add doctor check surfacing permission_mode.source attribution - optional: rename to CLAW_PERMISSION_MODE with deprecation alias Joins permission-audit sweep (#50/#87/#91/#94/#97/#101) on the env axis. Completes the three-way input-surface audit: CLI + config + env. Cross-cluster with silent-flag #96-#100 (worse variant: fail-OPEN) and truth-audit (#80-#87, #89, #100) (operator can't verify source). Natural 6-way bundle: #50 + #87 + #91 + #94 + #97 + #101 closes the entire permission-input attack surface in one pass. Filed in response to Clawhip pinpoint nudge 1494789577687437373 in #clawcode-building-in-public.	2026-04-18 05:04:28 +09:00
YeonGyu-Kim	d63d58f3d0	ROADMAP #100 : claw status/doctor JSON expose no commit identity; stale-base subsystem unplumbed Dogfooded 2026-04-18 on main HEAD 63a0d30 from /tmp/cdU + /tmp/cdO*. Three-fold gap: 1. status/doctor JSON workspace object has 13 fields; none of them contain: head_sha, head_short_sha, expected_base, base_source, stale_base_state, upstream, ahead, behind, merge_base, is_detached, is_bare, is_worktree. A claw cannot answer 'is this lane at the expected base?' from the JSON surface alone. 2. --base-commit flag is silently accepted by status/doctor/sandbox/ init/export/mcp/skills/agents and silently dropped on dispatch. Same silent-no-op class as #98. A claw running 'claw --base-commit $expected status' gets zero effect — flag parses into a local, discharged at dispatch. 3. runtime::stale_base subsystem is FULLY implemented with 30+ tests (BaseCommitState, BaseCommitSource, resolve_expected_base, read_claw_base_file, check_base_commit, format_stale_base_warning). run_stale_base_preflight at main.rs:3058 calls it from Prompt/Repl only, writes output to stderr as human prose. .claw-base file is honored internally but invisible to status/doctor JSON. Complete implementation, wrong dispatch points. Plus: detached HEAD reported as magic string 'git_branch: "detached HEAD"' without accompanying SHA. Bare repo/worktree/submodule indistinguishable from regular repo in JSON. parse_git_status_branch has latent dot-split truncation bug on branch names like 'feat.ui' with upstream. Hits roadmap Product Principle #4 (Branch freshness before blame) and Phase 2 §4.2 (branch.stale_against_main event) directly — both unimplementable without commit identity in the JSON surface. Fix shape (~80 lines plumbing): - add head_sha/head_short_sha/is_detached/head_ref/is_bare/is_worktree - add base_commit: {source, expected, state} - add upstream: {ref, ahead, behind, merge_base} - wire --base-commit into CliAction::Status + CliAction::Doctor - add stale_base doctor check - fix parse_git_status_branch dot-split at :2541 Cross-cluster: truth-audit/diagnostic-integrity (#80-#87, #89) + silent-flag (#96-#99) + unplumbed-subsystem (#78). Natural bundles: #89+#100 (git-state completeness) and #78+#96+#100 (unplumbed surface). Milestone: ROADMAP #100. Filed in response to Clawhip pinpoint nudge 1494782026660712672 in #clawcode-building-in-public.	2026-04-18 04:36:47 +09:00
YeonGyu-Kim	63a0d30f57	ROADMAP #99 : claw system-prompt --cwd/--date unvalidated, prompt-injection via newline Dogfooded 2026-04-18 on main HEAD 0e263be from /tmp/cdN. parse_system_prompt_args at main.rs:1162-1190 does: cwd = PathBuf::from(value); date.clone_from(value); Zero validation. Both values flow through to SystemPromptBuilder::render_env_context (prompt.rs:175-186) and render_project_context (prompt.rs:289-293) where they are formatted into the system prompt output verbatim via format!(). Two injection points per value: - # Environment context - 'Working directory: {cwd}' - 'Date: {date}' - # Project context - 'Working directory: {cwd}' - 'Today's date is {date}.' Demonstrated attacks: --date 'not-a-date' → accepted --date '9999-99-99' → accepted --date '1900-01-01' → accepted --date "2025-01-01'; DROP TABLE users;--" → accepted verbatim --date $'2025-01-01\nMALICIOUS: ignore all previous rules' → newline breaks out of bullet into standalone system-prompt instruction line that the LLM will read as separate guidance --cwd '/does/not/exist' → silently accepted, rendered verbatim --cwd '' → empty 'Working directory: ' line --cwd $'/tmp\nMALICIOUS: pwn' → newline injection same pattern --help documents format as '[--cwd PATH] [--date YYYY-MM-DD]'. Parser enforces neither. Same class as #96 / #98 — documented constraint, unenforced at parse boundary. Severity note: most severe of the #96/#97/#98/#99 silent-flag class because the failure mode is prompt injection, not a silent feature no-op. A claw or CI pipeline piping tainted $REPO_PATH / $USER_INPUT into claw system-prompt is a vector for LLM manipulation. Fix shape: 1. parse --date as chrono::NaiveDate::parse_from_str(value, '%Y-%m-%d') 2. validate --cwd via std::fs::canonicalize(value) 3. defense-in-depth: debug_assert no-newlines at render boundary 4. regression tests for each rejected case Cross-cluster: sibling of #83 (system-prompt date = build date) and #84 (dump-manifests bakes abs path) — all three are about the system-prompt / manifest surface trusting compile-time or operator-supplied values that should be validated. Filed in response to Clawhip pinpoint nudge 1494774477009981502 in #clawcode-building-in-public.	2026-04-18 04:03:29 +09:00
YeonGyu-Kim	0e263bee42	ROADMAP #98 : --compact silently ignored in 9 dispatch paths + stdin-piped Prompt hardcodes compact=false Dogfooded 2026-04-18 on main HEAD 7a172a2 from /tmp/cdM. --help at main.rs:8251 documents --compact as 'text mode only; useful for piping.' The implementation knows the constraint but never enforces it at the parse boundary — the flag is silently dropped in every non-{Prompt+Text} dispatch path: 1. --output-format json prompt: run_turn_with_output (:3807-3817) has no CliOutputFormat::Json if compact arm; JSON branch ignores compact entirely 2. status/sandbox/doctor/init/export/mcp/skills/agents: those CliAction variants have no compact field at all; parse_args parses --compact into a local bool and then discharges it with nowhere to go on dispatch 3. claw --compact with piped stdin: the stdin fallthrough at main.rs:614 hardcodes compact: false regardless of the user-supplied --compact — actively overriding operator intent No error, no warning, no diagnostic. A claw using claw --compact --output-format json '...' to pipe-friendly output gets full verbose JSON silently. Fix shape: - reject --compact + --output-format json at parse time (~5 lines) - reject --compact on non-Prompt subcommands with a named error (~15 lines) - honor --compact in stdin-piped Prompt fallthrough: change compact: false to compact at :614 (1 line) - optionally add CliOutputFormat::Json if compact arm if compact-JSON is desirable Joins silent-flag no-op class with #96 (Resume-safe leak) and #97 (silent-empty allow-set). Natural bundle #96+#97+#98 covers the --help/flag-validation hygiene triangle. Filed in response to Clawhip pinpoint nudge 1494766926826700921 in #clawcode-building-in-public.	2026-04-18 03:32:57 +09:00
YeonGyu-Kim	7a172a2534	ROADMAP #97 : --allowedTools empty-string silently blocks all tools, no observable signal Dogfooded 2026-04-18 on main HEAD 3ab920a from /tmp/cdL. Silent vs loud asymmetry for equivalent mis-input at the tool-allow-list knob: - `--allowedTools "nonsense"` → loud structured error naming every valid tool (works as intended) - `--allowedTools ""` (shell-expansion failure, $TOOLS expanded empty) → silent Ok(Some(BTreeSet::new())) → all tools blocked - `--allowedTools ",,"` → same silent empty set - `.claw.json` with `allowedTools` → fails config load with 'unknown key allowedTools' — config-file surface locked out, CLI flag is the only knob, and the CLI flag has the footgun Trace: tools/src/lib.rs:192-248 normalize_allowed_tools. Input values=[""] is NOT empty (len=1) so the early None guard at main.rs:1048 skips. Inner split/filter on empty-only tokens produces zero elements; the error-producing branch never runs. Returns Ok(Some(empty)), which downstream filter treats as 'allow zero tools' instead of 'allow all tools.' No observable recovery: status JSON exposes kind/model/ permission_mode/sandbox/usage/workspace but no allowed_tools field. doctor check set has no tool_restrictions category. A lane that silently restricted itself to zero tools gets no signal until an actual tool call fails at runtime. Fix shape: reject empty-token input at parse time with a clear error. Add explicit --allowedTools none opt-in if zero-tool lanes are desirable. Surface active allow-set in status JSON and as a doctor check. Consider supporting allowedTools in .claw.json or improving its rejection message. Joins permission-audit sweep (#50/#87/#91/#94) on the tool-allow-list axis. Sibling of #86 on the truth-audit side: both are 'misconfigured claws have no observable signal.' Filed in response to Clawhip pinpoint nudge 1494759381068419115 in #clawcode-building-in-public.	2026-04-18 03:04:08 +09:00
YeonGyu-Kim	3ab920ac30	ROADMAP #96 : claw --help Resume-safe summary leaks 62 STUB_COMMANDS entries Dogfooded 2026-04-18 on main HEAD 8db8e49 from /tmp/cdK. Partial regression of ROADMAP #39 / #54 at the help-output layer. 'claw --help' emits two separate slash-command enumerations: (1) Interactive slash commands block -- correctly filtered via render_slash_command_help_filtered(STUB_COMMANDS) at main.rs:8268 (2) Resume-safe commands one-liner -- UNFILTERED, emits every entry from resume_supported_slash_commands() at main.rs:8270-8278 Programmatic cross-check: intersect the Resume-safe listing with STUB_COMMANDS (60+ entries at main.rs:7240-7320) returns 62 overlaps: budget, rate-limit, metrics, diagnostics, workspace, reasoning, changelog, bookmarks, allowed-tools, tool-details, language, max-tokens, temperature, system-prompt, output-style, privacy-settings, keybindings, thinkback, insights, stickers, advisor, brief, summary, vim, and more. All advertised as resume-safe; all produce 'Did you mean /X' stub-guard errors when actually invoked in resume mode. Fix shape: one-line filter at main.rs:8270 adding .filter(\|spec\| !STUB_COMMANDS.contains(&spec.name)) or extract shared helper resume_supported_slash_commands_filtered. Add regression test parallel to stub_commands_absent_from_repl_ completions that parses the Resume-safe line and asserts no entry matches STUB_COMMANDS. Filed in response to Clawhip pinpoint nudge 1494751832399024178 in #clawcode-building-in-public.	2026-04-18 02:35:06 +09:00
YeonGyu-Kim	8db8e4902b	ROADMAP #95 : skills install is user-scope only, no uninstall, leaks across workspaces Dogfooded 2026-04-18 on main HEAD b7539e6 from /tmp/cdJ. Three stacked gaps on the skill-install surface: (1) User-scope only install. default_skill_install_root at commands/src/lib.rs returns CLAW_CONFIG_HOME/skills -> CODEX_HOME/skills -> HOME/.claw/skills -- all user-level. No project-scope code path. Installing from workspace A writes to ~/.claw/skills/X and makes X active:true in every other workspace with source.id=user_claw. (2) No uninstall. claw --help enumerates /skills [list\|install\|help\|<skill>] -- no uninstall. 'claw skills uninstall X' falls through to prompt-dispatch. REPL /skill is identical. Removing a bad skill requires manual rm -rf on the installed path parsed out of install receipt output. (3) No scope signal. Install receipt shows 'Registry /Users/yeongyu/.claw/skills' but the operator is never asked project vs user, and JSON receipt does not distinguish install scope. Doubly compounds with #85 (skill discovery ancestor walk): an attacker who can write under an ancestor OR can trick the operator into one bad 'skills install' lands a skill in the user-level registry that's active in every future claw invocation. Runs contrary to the project/user/local three-tier scope settings already use (User / Project / Local via ConfigSource). Skills collapse all three onto User at install time. Fix shape (~60 lines): --scope user\|project\|local flag on skills install (no default in --output-format json mode, prompt interactively); claw skills uninstall + /skills uninstall slash-command; installed_path per skill record in --output-format json skills output. Filed in response to Clawhip pinpoint nudge 1494744278423961742 in #clawcode-building-in-public.	2026-04-18 02:03:10 +09:00
YeonGyu-Kim	b7539e679e	ROADMAP #94 : permission rules accept typos, case-sensitive match disagrees with ecosystem convention, invisible in all diagnostic surfaces Dogfooded 2026-04-18 on main HEAD 7f76e6b from /tmp/cdI. Three stacked failures on the permission-rule surface: (1) Typo tolerance. parse_optional_permission_rules at runtime/src/config.rs:780-798 is just optional_string_array with no per-entry validation. Typo rules like 'Reed', 'Bsh(echo:)', 'WebFech' load silently; doctor reports config: ok. (2) Case-sensitive match against lowercase runtime names. PermissionRule::matches does self.tool_name != tool_name strict compare. Runtime registers tools lowercase (bash). Claude Code convention / MCP docs use capitalized (Bash). So 'deny: ["Bash(rm:)"]' never fires because tool_name='bash' != rule.tool_name='Bash'. Cross-harness config portability fails open, not closed. (3) Loaded rules invisible. status JSON has no permission_rules field. doctor has no rules check. A clawhip preflight asking 'does this lane actually deny Bash(rm:*)?' has no machine-readable answer; has to re-parse .claw.json and re-implement parse semantics. Contrast: --allowedTools CLI flag HAS tool-name validation with a 50+ tool registry. The same registry is not consulted when parsing permissions.allow/deny/ask. Asymmetric validation, same shape as #91 (config accepts more permission-mode labels than CLI). Fix shape (~30-45 lines): validate rule tool names against the same registry --allowedTools uses; case-fold tool_name compare in PermissionRule::matches; expose loaded rules in status/doctor JSON with unknown_tool flag. Filed in response to Clawhip pinpoint nudge 1494736729582862446 in #clawcode-building-in-public.	2026-04-18 01:34:15 +09:00
YeonGyu-Kim	7f76e6bbd6	ROADMAP #93 : --resume reference heuristic forks silently; no workspace scoping Dogfooded 2026-04-18 on main HEAD bab66bb from /tmp/cdH. SessionStore::resolve_reference at runtime/src/session_control.rs: 86-116 branches on a textual heuristic -- looks_like_path = direct.extension().is_some() \|\| direct.components().count() > 1. Same-looking reference triggers two different code paths: Repros: - 'claw --resume session-123' -> managed store lookup (no extension, no slash) -> 'session not found: session-123' - 'claw --resume session-123.jsonl' -> workspace-relative file path (extension triggers path branch) -> opens /cwd/session-123.jsonl, succeeds if present - 'claw --resume /etc/passwd' -> absolute path opened verbatim, fails only because JSONL parse errors ('invalid JSONL record at line 1: unexpected character: #') - 'claw --resume /etc/hosts' -> same; file is read, structural details (first char, line number) leak in error - symlink inside .claw/sessions/<fp>/passwd-symlink.jsonl pointing at /etc/passwd -> claw --resume passwd-symlink follows it Clawability impact: operators copying session ids from /session list naturally try adding .jsonl and silently hit the wrong branch. Orchestrators round-tripping session ids through --resume cannot do any path normalization without flipping lookup modes. No workspace scoping, so any readable file on disk is a valid target. Symlinks inside managed path escape the workspace silently. Fix shape (~15 lines minimum): canonicalize the resolved candidate and assert prefix match with workspace_root before opening; return OutsideWorkspace typed error otherwise. Optional cleanup: split --resume <id> and --resume-file <path> into explicit shapes. Filed in response to Clawhip pinpoint nudge 1494729188895359097 in #clawcode-building-in-public.	2026-04-18 01:04:37 +09:00
YeonGyu-Kim	bab66bb226	ROADMAP #92 : MCP config does not expand ${VAR} or ~/ — standard configs fail silently Dogfooded 2026-04-18 on main HEAD d0de86e from /tmp/cdE. MCP command, args, url, headers, headersHelper config fields are loaded and passed to execve/URL-parse verbatim. No ${VAR} interpolation, no ~/ home expansion, no preflight check, no doctor warning. Repros: - {'command':'~/bin/my-server','args':['~/config/file.json']} -> execve('~/bin/my-server', ['~/config/file.json']) -> ENOENT at MCP connect time. - {'command':'${HOME}/bin/my-server','args':['--tenant=${TENANT_ID}']} -> literal ${HOME}/bin/my-server handed to execve; literal ${TENANT_ID} passed to the server as tenant argument. - {'headers':{'Authorization':'Bearer ${API_TOKEN}'}} -> literal string 'Bearer ${API_TOKEN}' sent as HTTP header. Trace: parse_mcp_server_config in runtime/src/config.rs stores strings raw; McpStdioProcess::spawn at mcp_stdio.rs:1150-1170 is Command::new(&transport.command).args(&transport.args).spawn(). grep interpolate/expand_env/substitute/${ across runtime/src/ returns empty outside format-string literals. Clawability impact: every public MCP server README uses ${VAR}/~/ in examples; copy-pasted configs load with doctor:ok and fail opaquely at spawn with generic ENOENT that has lost the context about why. Operators forced to hardcode secrets in .claw.json (triggering #90) or wrap commands in shell scripts -- both worse security postures than the ecosystem norm. Cross-harness round-trip from Claude Code /.mcp.json breaks when interpolation is present. Fix shape (~50 lines): config-load-time interpolation of ${VAR} and leading ~/ in command/args/url/headers/headers_helper; missing- variable warnings captured into ConfigLoader all_warnings; optional {'config':{'expand_env':false}} toggle; mcp_config_interpolation doctor check that flags literal ${ / ~/ remaining after substitution. Filed in response to Clawhip pinpoint nudge 1494721628917989417 in #clawcode-building-in-public.	2026-04-18 00:35:44 +09:00
YeonGyu-Kim	d0de86e8bc	ROADMAP #91 : permission-mode parsers disagree; dontAsk silently means danger-full-access Dogfooded 2026-04-18 on main HEAD 478ba55 from /tmp/cdC. Two permission-mode parsers disagree on valid labels: - Config parse_permission_mode_label (runtime/src/config.rs:851-862) accepts 8 labels and collapses 5 aliases onto 3 canonical modes. - CLI normalize_permission_mode (rusty-claude-cli/src/main.rs:5455- 5461) accepts only the 3 canonical labels. Same binary, same intent, opposite verdicts: .claw.json {"defaultMode":"plan"} -> silent ReadOnly + doctor ok --permission-mode plan -> rejected with 'unsupported permission mode' Semantic collapses of note: - 'default' -> ReadOnly (name says nothing about what default means) - 'plan' -> ReadOnly (upstream plan-mode semantics don't exist in claw; ExitPlanMode tool exists but has no matching PermissionMode variant) - 'acceptEdits'/'auto' -> WorkspaceWrite (ambiguous names) - 'dontAsk' -> DangerFullAccess (FOOTGUN: sounds like 'quiet mode', actually the most permissive; community copy-paste bypasses every danger-keyword audit) Status JSON exposes canonicalized permission_mode only; original label lost. Claw reading status cannot distinguish 'plan' from explicit 'read-only', or 'dontAsk' from explicit 'danger-full-access'. Fix shape (~20-30 lines): align the two parsers to accept/reject identical labels; add permission_mode_raw to status JSON (paired with permission_mode_source from #87); either remove the 'dontAsk' alias or trigger a doctor warn when raw='dontAsk'; optionally introduce a real PermissionMode::Plan runtime variant. Filed in response to Clawhip pinpoint nudge 1494714078965403848 in #clawcode-building-in-public.	2026-04-18 00:05:13 +09:00
YeonGyu-Kim	478ba55063	ROADMAP #90 : claw mcp surface redacts env but dumps args/url/headersHelper Dogfooded 2026-04-17 on main HEAD 64b29f1 from /tmp/cdB. The MCP details surface correctly redacts env -> env_keys and headers -> header_keys (deliberate precedent for 'show config without secrets'), but dumps args, url, and headersHelper verbatim even though all three standardly carry inline credentials. Repros: (1) args leak: {'args':['--api-key','sk-secret-ABC123','--token=...', '--url=https://user:password@host/db']} appears unredacted in both details.args and the summary string. (2) URL leak: 'url':'https://user:SECRET@api.example.com/mcp' and matching summary. (3) headersHelper leak: helper command path + its secret-bearing argv emitted whole. Trace: mcp_server_details_json at commands/src/lib.rs:3972-3999 is the single redaction point. env/headers get key-only projection; args/url/headers_helper carve-out with no explaining comment. Text surface at :3873-3920 mirrors the same leak. Clawability shape: mcp list --output-format json is exactly the surface orchestrators scrape for preflight and that logs / Discord announcements / claw export / CI artifacts will carry. Asymmetric redaction sends the wrong signal -- consumers assume secret-aware, the leak is unexpected and easy to miss. Standard MCP wiring patterns (--api-key, postgres://user:pass@, token helper scripts) all hit the leak. Fix shape (~40-60 lines): redact args with secret heuristic (--api-key, --token, --password, high-entropy tails, user:pass@); redact URL basic-auth + query-string secrets; split headersHelper argv and apply args heuristic; add optional --show-sensitive opt-in; add mcp_secret_posture doctor check. No MCP runtime behavior changes -- only reporting surface. Filed in response to Clawhip pinpoint nudge 1494706529918517390 in #clawcode-building-in-public.	2026-04-17 23:32:40 +09:00
YeonGyu-Kim	64b29f16d5	ROADMAP #89 : claw blind to mid-rebase/merge/cherry-pick git states Dogfooded 2026-04-17 on main HEAD 9882f07. A rebase halted on conflict leaves .git/rebase-merge/ on disk + HEAD detached on the rebase intermediate commit. 'claw --output-format json status' reports git_state='dirty ... 1 conflicted', git_branch='detached HEAD', no rebase flag. 'claw --output-format json doctor' reports workspace: {status:ok, summary:'project root detected on branch detached HEAD'}. Trace: parse_git_workspace_summary at rusty-claude-cli/src/main.rs: 2550-2587 scans git status --short output only; no .git/rebase- merge, .git/rebase-apply, .git/MERGE_HEAD, .git/CHERRY_PICK_HEAD, .git/BISECT_LOG check anywhere in rust/crates/. check_workspace_ health emits Ok so long as a project root was detected. Clawability impact: preflight blindness (doctor ok on paused lane), stale-branch detection breaks (freshness vs base is meaningless when HEAD is a rebase intermediate), no recovery surface (no abort/resume hints), same 'surface lies about runtime truth' family as #80-#87. Fix shape (~20 lines): detect marker files, expose typed workspace.git_operation field (kind/paused/abort_hint/resume_hint), flip workspace doctor verdict to warn when git_operation != null. Filed in response to Clawhip pinpoint nudge 1494698980091756678 in #clawcode-building-in-public.	2026-04-17 23:03:53 +09:00
YeonGyu-Kim	9882f07e7d	ROADMAP #88 : unbounded CLAUDE.md ancestor walk = prompt injection via /tmp Dogfooded 2026-04-17 on main HEAD 82bd8bb from /tmp/claude-md-injection/inner/work. discover_instruction_files at runtime/src/prompt.rs:203-224 walks cursor.parent() until None with no project-root bound, no HOME containment, no git boundary. Four candidate paths per ancestor (CLAUDE.md, CLAUDE.local.md, .claw/CLAUDE.md, .claw/instructions.md) are loaded and inlined verbatim into the agent's system prompt under '# Claude instructions'. Repro: /tmp/claude-md-injection/CLAUDE.md containing adversarial guidance appears under 'CLAUDE.md (scope: /private/tmp/claude-md- injection)' in claw system-prompt from any nested CWD. git init inside the worker does not terminate the walk. /tmp/CLAUDE.md alone is sufficient -- /tmp is world-writable with sticky bit on macOS/ Linux, so any local user can plant agent guidance for every other user's claw invocation under /tmp/anything. Worse than #85 (skills ancestor walk): no agent action required (injection fires on every turn before first user message), lower bar for the attacker (raw Markdown, no frontmatter), standard world-writable drop point (/tmp), no doctor signal. Same structural fix family though: prompt.rs:203, commands/src/lib.rs:2795 (skills), and commands/src/lib.rs:2724 (agents) all need the same project_root / HOME bound. Fix shape (~30-50 lines): bound ancestor walk at project root / HOME; add doctor check that surfaces loaded instruction files with paths; add settings.json opt-in toggle for monorepo ancestor inheritance with 'source: ancestor' annotation. Filed in response to Clawhip pinpoint nudge 1494691430096961767 in #clawcode-building-in-public.	2026-04-17 22:33:13 +09:00
YeonGyu-Kim	82bd8bbf77	ROADMAP #87 : fresh-workspace permission default is danger-full-access, doctor silent Dogfooded 2026-04-17 on main HEAD d6003be against /tmp/cd8. Fresh workspace, no config, no env, no CLI flag: claw status reports 'Permission mode danger-full-access'. 'claw doctor' has no permission-mode check at all -- zero lines mention it. Trace: rusty-claude-cli/src/main.rs:1099-1107 default_permission_mode falls back to PermissionMode::DangerFullAccess when env/config miss. runtime/src/permissions.rs:7-15 PermissionMode ordinal puts DangerFullAccess above WorkspaceWrite/ReadOnly, so current_mode >= required_mode gate at :260-264 auto-approves every tool spec requiring DangerFullAccess or below -- including bash and PowerShell. check_sandbox_health exists at :1895-1910 but no parallel check_permission_health. Status JSON exposes permission_mode but no permission_mode_source field -- fallback indistinguishable from deliberate choice. Interacts badly with #86: corrupt .claw.json silently drops the user's 'plan' choice AND escalates to danger-full-access fallback, and doctor reports Config: ok across both failures. Fix shape (~30-40 lines): add permission doctor check (warn when effective=DangerFullAccess via fallback); add permission_mode_source to status JSON; optionally flip fallback to WorkspaceWrite/Prompt for non-interactive invocations. Filed in response to Clawhip pinpoint nudge 1494683886658257071 in #clawcode-building-in-public.	2026-04-17 22:06:49 +09:00
YeonGyu-Kim	d6003be373	ROADMAP #86 : corrupt .claw.json silently dropped, doctor says config ok Dogfooded 2026-04-17 on main HEAD 586a92b against /tmp/cd7. A valid .claw.json with permissions.defaultMode=plan applies correctly (claw status shows Permission mode read-only). Corrupt the same file to junk text and: (1) claw status reverts to danger-full-access, (2) claw doctor still reports Config: status=ok, summary='runtime config loaded successfully', with loaded_config_files=0 and discovered_files_count=1 side by side in the same check. Trace: read_optional_json_object at runtime/src/config.rs:674-692 sets is_legacy_config = (file_name == '.claw.json') and on parse failure returns Ok(None) instead of Err(ConfigError::Parse). No warning, no eprintln. ConfigLoader::load() continues past the None, reports overall success. Doctor check at rusty-claude-cli/src/main.rs:1725-1754 emits DiagnosticLevel::Ok whenever load() returned Ok, even with loaded 0/1. Compare a non-legacy settings path at .claw/settings.json with identical corruption: doctor correctly fails loudly. Same file contents, different filename -> opposite diagnostic verdict. Intent was presumably legacy compat with stale historical .claw.json. Implementation now masks live user-written typos. A clawhip preflight that gates on 'status != ok' never sees this. Same surface-lies- about-runtime-truth shape as #80-#84, at the config layer. Fix shape (~20-30 lines): replace silent skip with warn-and-skip carrying the parse error; flip doctor verdict when loaded_count < present_count; expose skipped_files in JSON surface. Filed in response to Clawhip pinpoint nudge 1494676332507041872 in #clawcode-building-in-public.	2026-04-17 21:33:44 +09:00
YeonGyu-Kim	586a92ba79	ROADMAP #85 : unbounded ancestor walk enumerates attacker-placed skills Dogfooded 2026-04-17 on main HEAD 2eb6e0c. discover_skill_roots at commands/src/lib.rs:2795 iterates cwd.ancestors() unbounded -- no project-root check, no HOME containment, no git boundary. Any .claw/skills, .omc/skills, .agents/skills, .codex/skills, .claude/skills directory on any ancestor path up to / is enumerated and marked active: true in 'claw --output-format json skills'. Repro 1 (cross-tenant skill injection): write /tmp/trap/.agents/skills/rogue/SKILL.md; cd /tmp/trap/inner/work and 'claw skills' shows rogue as active, sourced as Project roots. git init inside the inner CWD does NOT stop the walk. Repro 2 (CWD-dependent skill set): CWD under $HOME yields ~/.agents/skills contents; CWD outside $HOME hides them. Same user, same binary, 26-skill delta driven by CWD alone. Security shape: any attacker-writable ancestor becomes a skill injection primitive. Skill descriptions are free-form Markdown fed into the agent context -- crafted descriptions become prompt injection. tools/src/lib.rs:3295 independently walks ancestors for dispatch, so the injected skill is also executable via slash command, not just listed. Fix shape (~30-50 lines): bound ancestor walk at project root (ConfigLoader::project_root), optionally also at $HOME; require explicit settings.json toggle for monorepo ancestor inheritance; mirror fix in tools/src/lib.rs::push_project_skill_lookup_roots so listed and dispatchable skill surfaces match. Filed in response to Clawhip pinpoint nudge 1494668784382771280 in #clawcode-building-in-public.	2026-04-17 21:07:10 +09:00
YeonGyu-Kim	2eb6e0c1ee	ROADMAP #84 : dump-manifests bakes build machine's absolute path into binary Dogfooded 2026-04-17 on main HEAD 70a0f0c from /tmp/cd4. 'claw dump-manifests' with no arguments emits: error: Manifest source files are missing. repo root: /Users/yeongyu/clawd/claw-code missing: src/commands.ts, src/tools.ts, src/entrypoints/cli.tsx That path is the build machine's absolute filesystem layout, baked in via env!('CARGO_MANIFEST_DIR') at rusty-claude-cli/src/main.rs:2016. strings on the binary reveals the raw path verbatim. JSON surface (--output-format json) leaks the same path identically. Three problems: (1) broken default for any user running a distributed binary because the path won't exist on their machine; (2) privacy leak -- build user's $HOME segment embedded in the binary and surfaced to every recipient; (3) reproducibility violation -- two binaries built from the same commit on different machines produce different runtime behavior. Same compile-time-vs-runtime family as ROADMAP #83 (build date injected as 'today'). Fix shape (<=20 lines): drop env!('CARGO_MANIFEST_DIR') from the runtime default, require CLAUDE_CODE_UPSTREAM / --manifests-dir / settings entry, reword error to name the required config instead of leaking a path the user never asked for. Optional polish: add a settings.json [upstream] entry. Acceptance: strings <binary> \| grep '^/Users/' returns empty for the shipped binary. Default error surface contains zero absolute paths from the build machine. Filed in response to Clawhip pinpoint nudge 1494661235336282248 in #clawcode-building-in-public.	2026-04-17 20:36:51 +09:00
YeonGyu-Kim	70a0f0cf44	ROADMAP #83 : DEFAULT_DATE injects build date as 'today' in live system prompt Dogfooded 2026-04-17 on main HEAD e58c194 against /tmp/cd3. Binary built 2026-04-10; today is 2026-04-17. 'claw system-prompt' emits 'Today's date is 2026-04-10.' The same DEFAULT_DATE constant (rusty-claude-cli/src/main.rs:69-72) is threaded into build_system_prompt() at :6173-6180 and every ClaudeCliSession / StreamingCliSession / non-interactive runner (lines 3649, 3746, 4165, 4211, ...), so the stale date lives in the LIVE agent prompt, not just the system-prompt subcommand. Agents reason from 'today = compile day,' which silently breaks any task that depends on real time (freshness, deadlines, staleness, expiry). Violates ROADMAP principle #4 (branch freshness before blame) and mixes compile-time context into runtime behavior, producing different prompts for two agents on the same main HEAD built a week apart. Fix shape (~30 lines): compute current_date at runtime via chrono::Utc::now().date_naive(), sweep DEFAULT_DATE call sites in main.rs, keep --date override and --version's build-date meaning, add CLAWD_OVERRIDE_DATE env escape for reproducible tests. Filed in response to Clawhip pinpoint nudge 1494653681222811751 in #clawcode-building-in-public.	2026-04-17 20:02:37 +09:00
YeonGyu-Kim	e58c1947c1	ROADMAP #82 : macOS sandbox filesystem_active=true is a lie Dogfooded 2026-04-17 on main HEAD 1743e60 against /tmp/claw-dogfood-2. claw --output-format json sandbox on macOS reports filesystem_active= true, filesystem_mode=workspace-only but the actual enforcement is only HOME/TMPDIR env-var rebasing at bash.rs:205-209 / :228-232. build_linux_sandbox_command is cfg(target_os=linux)-gated and returns None on macOS, so the fallback path is sh -lc <command> with env tweaks and nothing else. Direct escape proof: a child with HOME=/ws/.sandbox-home TMPDIR=/ws/.sandbox-tmp writes /tmp/claw-escape-proof.txt and mkdir /tmp/claw-probe-target without error. Clawability problem: claws/orchestrators read SandboxStatus JSON and branch on filesystem_active && filesystem_mode=='workspace-only' to decide whether a worker can safely touch /tmp or $HOME. Today that branch lies on macOS. Fix shape option A (low-risk, ~15 lines): compute filesystem_active only where an enforcement path exists, so macOS reports false by default and fallback_reason surfaces the real story. Option B: wire a Seatbelt (sandbox-exec) profile for actual macOS enforcement. Filed in response to Clawhip pinpoint nudge 1494646135317598239 in #clawcode-building-in-public.	2026-04-17 19:33:06 +09:00
YeonGyu-Kim	1743e600e1	ROADMAP #81 : claw status Project root lies about session scope Dogfooded 2026-04-17 on main HEAD a48575f inside claw-code itself and reproduced on /tmp/claw-split-17. SessionStore::from_cwd at session_control.rs:32-40 uses the raw CWD as input to workspace_fingerprint() (line 295-303), not the project root surfaced in claw status. Result: two CWDs in the same git repo (e.g. ~/clawd/claw-code vs ~/clawd/claw-code/rust) report the same Project root in status but land in two disjoint .claw/sessions/ <fp>/ partitions. claw --resume latest from one CWD returns 'no managed sessions found' even though the adjacent CWD has a live session visible via /session list. Status-layer truth (Project root) and session-layer truth (fingerprint-of-CWD) disagree and neither surface exposes the disagreement -- classic split-truth per ROADMAP pain point #2. Fix shape (<=40 lines): (a) fingerprint the project root instead of raw CWD, or (b) surface partition key explicitly in status. Filed in response to Clawhip pinpoint nudge 1494638583481372833 in #clawcode-building-in-public.	2026-04-17 19:05:12 +09:00
Jobdori	a48575fd83	ROADMAP #80 : session-lookup error copy lies about on-disk layout Dogfooded 2026-04-17 on main HEAD 688295e against /tmp/claw-d4. SessionStore::from_cwd at session_control.rs:32-40 places sessions under .claw/sessions/<workspace_fingerprint>/ (16-char FNV-1a hex at line 295-303), but format_no_managed_sessions and format_missing_session_reference at line 516-526 advertise plain .claw/sessions/ with no fingerprint context. Concrete repro: fresh workspace, no sessions yet, .claw/sessions/ contains foo/ (hash dir, empty) + ffffffffffffffff/foreign.jsonl (foreign workspace session). 'claw --resume latest' still says 'no managed sessions found in .claw/sessions/' even though that directory is not empty -- the sessions just belong to other workspace partitions. Fix shape is ~30 lines: plumb the resolved sessions_root/workspace into the two format helpers, optionally enumerate sibling partitions so error copy tells the operator where sessions from other workspaces are and why they're invisible. Filed in response to Clawhip pinpoint nudge 1494615932222439456 in #clawcode-building-in-public.	2026-04-17 17:33:05 +09:00
Jobdori	688295ea6c	ROADMAP #79 : claw --output-format json init discards structured InitReport Dogfooded 2026-04-17 on main HEAD 9deaa29. init.rs:38-113 already builds a fully-typed InitReport { project_root, artifacts: Vec< InitArtifact { name, status: InitStatus }> } but main.rs:5436-5454 calls .render() on it and throws the structure away, emitting only {kind, message: '<prose>'} via init_json_value(). Downstream claws have to regex 'created\|updated\|skipped' out of the message string to know per-artifact state. version/system-prompt/acp/bootstrap-plan all emit structured payloads on the same binary -- init is the sole odd-one-out. Fix shape is ~20 lines: add InitReport::to_json_value + InitStatus::as_str, switch run_init to hold the report instead of .render()-ing it eagerly, preserve message for backward compat, add output_format_contract regression. Filed in response to Clawhip pinpoint nudge 1494608389068558386 in #clawcode-building-in-public.	2026-04-17 17:02:58 +09:00
Jobdori	9deaa29710	ROADMAP #78 : claw plugins CLI route is a dead constructor Dogfooded 2026-04-17 on main HEAD d05c868. CliAction::Plugins variant is declared at main.rs:303-307 and wired to LiveCli::print_plugins at main.rs:202-206, but parse_args has no "plugins" arm, so claw plugins / claw plugins list / claw --output-format json plugins all fall through to the LLM-prompt catch-all and emit a missing Anthropic credentials error. This is the sole documented-shaped subcommand that does NOT resolve to a local CLI route: agents, mcp, skills, acp, init, dump-manifests, bootstrap-plan, system-prompt, export all work. grep confirms CliAction::Plugins has exactly one hit in crates/ (the handler), not a constructor anywhere. Filed with a ~15 line parser fix shape plus help/test wiring, matching the pattern already used by agents/mcp/skills. Filed in response to Clawhip pinpoint nudge 1494600832652546151 in #clawcode-building-in-public.	2026-04-17 16:33:09 +09:00
Jobdori	d05c8686b8	ROADMAP #77 : typed error-kind contract for --output-format json errors Dogfooded 2026-04-17 against main HEAD 00d0eb6. Five distinct failure classes (missing credentials, missing manifests, missing worker state, session not found, CLI parse) all emit the same {type,error} envelope with no machine-readable kind/code, so downstream claws have to regex the prose to route failures. Success payloads already carry a stable 'kind' discriminator; error payloads do not. Fix shape proposes an ErrorKind discriminant plus hint/context fields to match the success side contract. Filed in response to Clawhip pinpoint nudge 1494593284180414484 in #clawcode-building-in-public.	2026-04-17 16:08:41 +09:00
Yeachan-Heo	ac45bbec15	Make ACP/Zed status obvious before users go source-diving ROADMAP #21, #22, and #23 were already closed on current main, so the next real repo-local backlog item was the ACP/Zed discoverability gap. This adds a local `claw acp` status surface plus aliases, updates help/docs, and separates the shipped discoverability fix from the still-open daemon/protocol follow-up so editor-first users get a crisp answer immediately. Constraint: No ACP/Zed daemon or protocol server exists in claw-code yet, so the new surface must be explicit status guidance rather than a fake implementation Rejected: Add a pretend `acp serve` daemon path \| would imply supported protocol behavior that does not exist Rejected: Docs-only clarification \| still leaves `claw --help` unable to answer the editor-launch question directly Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep ROADMAP discoverability fixes separate from future ACP daemon/protocol work so help text and backlog IDs stay unambiguous Tested: cargo fmt --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; cargo run -q -p rusty-claude-cli -- acp; cargo run -q -p rusty-claude-cli -- --output-format json acp; architect review APPROVED Not-tested: Real ACP/Zed daemon launch because no protocol-serving surface exists yet	2026-04-16 03:13:50 +00:00
Yeachan-Heo	64e058f720	refresh	2026-04-16 02:50:54 +00:00
Yeachan-Heo	6a957560bd	Make recovery handoffs explain why a lane resumed instead of leaking control prose Recent OMX dogfooding kept surfacing raw `[OMX_TMUX_INJECT]` messages as lane results, which told operators that tmux reinjection happened but not why or what lane/state it applied to. The lane-finished persistence path now recognizes that control prose, stores structured recovery metadata, and emits a human-meaningful fallback summary instead of preserving the raw marker as the primary result. Constraint: Keep the fix in the existing lane-finished metadata surface rather than inventing a new runtime channel Rejected: Treat all reinjection prose as ordinary quality-floor mush \| loses the recovery cause and target lane operators actually need Confidence: high Scope-risk: narrow Reversibility: clean Directive: Recovery classification is heuristic; extend the parser only when new operator phrasing shows up in real dogfood evidence Tested: cargo fmt --all --check Tested: cargo clippy --workspace --all-targets -- -D warnings Tested: cargo test --workspace Tested: LSP diagnostics on rust/crates/tools/src/lib.rs (0 errors) Tested: Architect review (APPROVE) Not-tested: Additional reinjection phrasings beyond the currently observed `[OMX_TMUX_INJECT]` / current-mode-state variants Related: ROADMAP #68	2026-04-12 15:50:39 +00:00
Yeachan-Heo	42bb6cdba6	Keep local clawhip artifacts from tripping routine repo work Dogfooding kept reproducing OMX team merge conflicts on `.clawhip/state/prompt-submit.json`, so the init bootstrap now teaches repos to ignore `.clawhip/` alongside the existing local `.claw/` artifacts. This also updates the current repo ignore list so the fix helps immediately instead of only on future `claw init` runs. Constraint: Keep the fix narrow and centered on repo-local ignore hygiene Rejected: Broader team merge-hygiene changes \| unnecessary for the proven local root cause Confidence: high Scope-risk: narrow Reversibility: clean Directive: If more runtime-local artifact directories appear, extend the shared init gitignore list instead of patching repos ad hoc Tested: cargo fmt --all --check Tested: cargo clippy --workspace --all-targets -- -D warnings Tested: cargo test --workspace Tested: Architect review (APPROVE) Not-tested: Existing clones with already-tracked `.clawhip` files still need manual cleanup Related: ROADMAP #75	2026-04-12 14:47:40 +00:00
Yeachan-Heo	f91d156f85	Keep poisoned test locks from cascading across unrelated regressions The repo-local backlog was effectively exhausted, so this sweep promoted the newly observed test-lock poisoning pain point into ROADMAP #74 and fixed it in place. Test-only env/cwd lock acquisition now recovers poisoned mutexes in the remaining strict call sites, and each affected surface has a regression that proves a panic no longer permanently poisons later tests. Constraint: Keep the fix test-only and avoid widening runtime behavior changes Rejected: Refactor shared helper signatures across broader call paths \| unnecessary churn beyond the remaining strict test sites Confidence: high Scope-risk: narrow Reversibility: clean Directive: These guards only recover the mutex; tests that mutate env or cwd still must restore process-global state explicitly Tested: cargo fmt --all --check Tested: cargo clippy --workspace --all-targets -- -D warnings Tested: cargo test --workspace Tested: Architect review (APPROVE) Not-tested: Additional fault-injection around partially restored env/cwd state after panic Related: ROADMAP #74	2026-04-12 13:52:41 +00:00
Yeachan-Heo	6b4bb4ac26	Keep finished lanes from leaving stale reminders armed The next repo-local sweep target was ROADMAP #66: reminder/cron state could stay enabled after the associated lane had already finished, which left stale nudges firing into completed work. The fix teaches successful lane persistence to disable matching enabled cron entries and record which reminder ids were shut down on the finished event. Constraint: Preserve existing cron/task registries and add the shutdown behavior only on the successful lane-finished path Rejected: Add a separate reminder-cleanup command that operators must remember to run \| leaves the completion leak unfixed at the source Confidence: high Scope-risk: narrow Reversibility: clean Directive: If cron-matching heuristics change later, update `disable_matching_crons`, its regression, and the ROADMAP closeout together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: Cross-process cron/reminder persistence beyond the in-memory registry used in this repo	2026-04-12 12:52:27 +00:00
Yeachan-Heo	e75d67dfd3	Make successful lanes explain what artifacts they actually produced The next repo-local sweep target was ROADMAP #64: downstream consumers still had to infer artifact provenance from prose even though the repo already emitted structured lane events. The fix extends `lane.finished` metadata with structured artifact provenance so successful completions can report roadmap ids, files, diff stat, verification state, and commit sha without relying on narration alone. Constraint: Preserve the existing commit-created event and lane-finished metadata paths while adding structured provenance to successful completions Rejected: Introduce a separate artifact event type first \| unnecessary for this focused closeout because `lane.finished` already carries structured data and existing consumers can read it there Confidence: high Scope-risk: narrow Reversibility: clean Directive: If artifact provenance extraction rules change later, update `extract_artifact_provenance`, its regression payload, and the ROADMAP closeout together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: Downstream consumers that ignore `lane.finished.data.artifactProvenance` and still parse only prose output	2026-04-12 11:56:00 +00:00
Yeachan-Heo	2e34949507	Keep latest-session timestamps increasing under tight loops The next repo-local sweep target was ROADMAP #73: repeated backlog sweeps exposed that session writes could share the same wall-clock millisecond, which made semantic recency fragile and forced the resume-latest regression to sleep between saves. The fix makes session timestamps monotonic within the process and removes the timing hack from the test so latest-session selection stays stable under tight loops. Constraint: Preserve the existing session file format while changing only the timestamp source semantics Rejected: Keep the sleep-based test workaround \| hides the real ordering hazard instead of fixing timestamp generation Confidence: high Scope-risk: narrow Reversibility: clean Directive: Any future session-recency logic must keep `current_time_millis`, ordering tests, and latest-session expectations aligned Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: Cross-process monotonicity when multiple binaries write sessions concurrently	2026-04-12 10:51:19 +00:00
Yeachan-Heo	8f53524bd3	Make backlog-scan lanes say what they actually selected The next repo-local sweep target was ROADMAP #65: backlog-scanning lanes could stop with prose-only summaries naming roadmap items, but there was no machine-readable record of which items were chosen, which were skipped, or whether the lane intended to execute, review, or no-op. The fix teaches completed lane persistence to extract a structured selection outcome while preserving the existing quality- floor and review-verdict behavior for other lanes. Constraint: Keep selection-outcome extraction on the existing `lane.finished` metadata path instead of inventing a separate event stream Rejected: Add a dedicated selection event type first \| unnecessary for this focused closeout because `lane.finished` already persists structured data downstream can read Confidence: high Scope-risk: narrow Reversibility: clean Directive: If backlog-scan summary conventions change later, update `extract_selection_outcome`, its regression test, and the ROADMAP closeout wording together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE after roadmap closeout update Not-tested: Downstream consumers that may still ignore `lane.finished.data.selectionOutcome`	2026-04-12 09:54:37 +00:00
Yeachan-Heo	b5e30e2975	Make completed review lanes emit machine-readable verdicts The next repo-local sweep target was ROADMAP #67: scoped review lanes could stop with prose-only output, leaving downstream consumers to infer approval or rejection from later chatter. The fix teaches completed lane persistence to recognize review-style `APPROVE`/`REJECT`/`BLOCKED` results, attach structured verdict metadata to `lane.finished`, and keep ordinary non-review lanes on the existing quality-floor path. Constraint: Preserve the existing non-review lane summary path while enriching only review-style completions Rejected: Add a brand-new lane event type just for review results \| unnecessary when `lane.finished` already carries structured metadata and downstream consumers can read it there Confidence: high Scope-risk: narrow Reversibility: clean Directive: If review verdict parsing changes later, update `extract_review_outcome`, the finished-event payload fields, and the review-lane regression together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: External consumers that may still ignore `lane.finished.data.reviewVerdict`	2026-04-12 08:49:40 +00:00
Yeachan-Heo	dbc2824a3e	Keep latest session selection tied to real session recency The next repo-local sweep target was ROADMAP #72: the `latest` managed-session alias could depend on filesystem mtime before the session's own persisted recency markers, which made the selection path vulnerable to coarse or misleading file timestamps. The fix promotes `updated_at_ms` into the summary/order path, keeps CLI wrappers in sync, and locks the mtime-vs-session-recency case with regression coverage. Constraint: Preserve existing managed-session storage layout while changing only the ordering signal Rejected: Keep sorting by filesystem mtime and just sleep longer in tests \| hides the semantic ordering bug instead of fixing it Confidence: high Scope-risk: narrow Reversibility: clean Directive: Any future managed-session ordering change must keep runtime and CLI summary structs aligned on the same recency fields Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: Cross-filesystem behavior where persisted session JSON cannot be read and fallback ordering uses mtime only	2026-04-12 07:49:32 +00:00
Yeachan-Heo	f309ff8642	Stop repo lanes from executing the wrong task payload The next repo-local sweep target was ROADMAP #71: a claw-code lane accepted an unrelated KakaoTalk/image-analysis prompt even though the lane itself was supposed to be repo-scoped work. This extends the existing prompt-misdelivery guardrail with an optional structured task receipt so worker boot can reject visible wrong-task context before the lane continues executing. Constraint: Keep the fix inside the existing worker_boot / WorkerSendPrompt control surface instead of inventing a new external OMX-only protocol Rejected: Treat wrong-task receipts as generic shell misdelivery \| loses the expected-vs-observed task context needed to debug contaminated lanes Confidence: high Scope-risk: narrow Reversibility: clean Directive: If task-receipt fields change later, update the WorkerSendPrompt schema, worker payload serialization, and wrong-task regression together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: External orchestrators that have not yet started populating the optional task_receipt field	2026-04-12 07:00:07 +00:00
Yeachan-Heo	3b806702e7	Make the CLI point users at the real install source The next repo-local backlog item was ROADMAP #70: users could mistake third-party pages or the deprecated `cargo install claw-code` path for the official install route. The CLI now surfaces the source of truth directly in `claw doctor` and `claw --help`, and the roadmap closeout records the change. Constraint: Keep the fix inside repo-local Rust CLI surfaces instead of relying on docs alone Rejected: Close #70 with README-only wording \| the bug was user-facing CLI ambiguity, so the warning needed to appear in runtime help/doctor output Confidence: high Scope-risk: narrow Reversibility: clean Directive: If install guidance changes later, update both the doctor check payload and the help-text warning together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: Third-party websites outside this repo that may still present stale install instructions	2026-04-12 04:50:03 +00:00
Yeachan-Heo	26b89e583f	Keep completed lanes from ending on mushy stop summaries The next repo-local sweep target was ROADMAP #69: completed lane runs could persist vague control text like “commit push everyting, keep sweeping $ralph”, which made downstream stop summaries operationally useless. The fix adds a lane-finished quality floor that preserves strong summaries, rewrites empty/control-only/too- short-without-context summaries into a contextual fallback, and records structured metadata explaining when the fallback fired. Constraint: Keep legitimate concise lane summaries intact while improving only low-signal completions Rejected: Blanket-rewrite every completed summary into a templated sentence \| would erase useful model-authored detail from good lane outputs Confidence: high Scope-risk: narrow Reversibility: clean Directive: If lane-finished summary heuristics change later, update the structured `qualityFloorApplied/rawSummary/reasons/wordCount` contract and its regression tests together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: External OMX consumers that may still ignore the new lane.finished data payload	2026-04-12 03:23:39 +00:00
YeonGyu-Kim	17e21bc4ad	docs(roadmap): add #70 — install-source ambiguity misleads users User treated claw-code.io as official, hit clawcode vs deprecated claw-code naming collision. Adding requirement for canonical docs to explicitly state official source and warn against deprecated crate. Source: gaebal-gajae community watch 2026-04-12	2026-04-12 12:08:52 +09:00
Yeachan-Heo	4f83a81cf6	Make dump-manifests recoverable outside the inferred build tree The backlog sweep found that the user-cited #21-#23 items were already closed, and the next real pain point was `claw dump-manifests` failing without a direct way to point at the upstream manifest source. This adds an explicit `--manifests-dir` path, upgrades the failure messages to say whether the source root or required files are missing, and updates the ROADMAP closeout to reflect that #45 is now fixed. Constraint: Preserve existing dump-manifests behavior when no explicit override is supplied Rejected: Require CLAUDE_CODE_UPSTREAM for every invocation \| breaks existing build-tree workflows and is unnecessarily rigid Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep manifest-source override guidance centralized so future error-path edits do not drift Tested: cargo fmt --all; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: Manual invocation against every legacy env-based manifest lookup layout	2026-04-12 02:57:11 +00:00
Yeachan-Heo	1d83e67802	Keep the backlog sweep from chasing external executor notes ROADMAP #31 described acpx/droid executor quirks, but a fresh repo-local search showed no implementation surface outside ROADMAP.md. This rewrites the local unpushed team checkpoint commits into one docs-only closeout so the branch reflects the real claw-code backlog instead of runtime-generated state. Constraint: Current evidence is limited to repo-local search plus existing prior closeouts Rejected: Leave team auto-checkpoint commits intact \| they pollute the branch with runtime state and obscure the actual closeout Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep generated .clawhip prompt-submit artifacts out of backlog closeout commits Tested: Repo-local grep evidence for #31/#63-#68 terms; ROADMAP.md line review; architect approval x2 Not-tested: Fresh remote/backlog audit beyond the current repo-local evidence set	2026-04-12 02:57:11 +00:00
YeonGyu-Kim	763437a0b3	docs(roadmap): add #69 — lane stop summary quality floor clawcode-human session stopped with sloppy summary ('commit push everyting, keep sweeping '). Adding requirement for minimum stop/result summary standards. Source: gaebal-gajae dogfood analysis 2026-04-12	2026-04-12 11:18:18 +09:00
Yeachan-Heo	491386f0a5	Keep external orchestration gaps out of the claw-code sweep path ROADMAP #63-#68 describe OMX/Ultraclaw orchestration behavior, but a repo-local search shows those implementation markers do not exist in claw-code source. Marking that scope boundary directly in the roadmap keeps future backlog sweeps from repeatedly targeting the wrong repository. Constraint: Stay within claw-code repo scope while continuing the user-requested backlog sweep Rejected: Attempt repo-local fixes for #63-#68 \| implementation surface is absent from this repository Confidence: high Scope-risk: narrow Reversibility: clean Directive: Treat #63-#68 as external tracking notes unless claw-code later grows the corresponding orchestration/runtime surface Tested: Repo-local search for acpx/ultraclaw/roadmap-nudge-10min/OMX_TMUX_INJECT outside ROADMAP.md Not-tested: No code/test/static-analysis rerun because the change is docs-only	2026-04-12 02:14:43 +00:00
Yeachan-Heo	5c85e5ad12	Keep the worker-state backlog honest with current main behavior ROADMAP #62 was stale. Current main already emits `.claw/worker-state.json` on worker status transitions and exposes the documented `claw state` reader surface, so leaving the item open would keep sending future backlog passes after already-landed work. Fresh verification on the exact branch confirmed the implementation and left the workspace green, so this commit closes the item with current proof instead of duplicating the feature. Constraint: User required fresh cargo fmt, cargo clippy --workspace --all-targets -- -D warnings, and cargo test --workspace before push Constraint: OMX team runtime was explicitly requested, but the verification lane stalled before producing any diff Rejected: Re-implement the worker-state feature from scratch \| current main already contains the runtime hook, CLI surface, and regression coverage Confidence: high Scope-risk: narrow Reversibility: clean Directive: Reopen #62 only with a fresh repro showing missing `.claw/worker-state.json` writes or a broken `claw state` surface on current main Tested: cargo test -p runtime emit_state_file_writes_worker_status_on_transition -- --nocapture; cargo test -p tools recovery_loop_state_file_reflects_transitions -- --nocapture; cargo test -p rusty-claude-cli removed_login_and_logout_subcommands_error_helpfully -- --nocapture; cargo fmt; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: No dedicated automated end-to-end CLI regression for reading `.claw/worker-state.json` beyond parser coverage and focused smoke validation	2026-04-12 01:51:15 +00:00
Yeachan-Heo	b825713db3	Retire the stale slash-command backlog item without breaking verification ROADMAP #39 was stale: current main already hides the unimplemented slash commands from the help/completion surfaces that triggered the original report, so the backlog entry should be marked done with current evidence instead of staying open forever. While rerunning the user's required Rust verification gates on the exact commit we planned to push, clippy exposed duplicate and unused imports in the plugin state-isolation files. Folding those cleanup fixes into the same closeout keeps the proof honest and restores a green workspace before the backlog retirement lands. Constraint: User required fresh cargo fmt, cargo clippy --workspace --all-targets -- -D warnings, and cargo test --workspace before push Rejected: Push the roadmap-only closeout without fixing the workspace \| would violate the required verification gate and leave main red Confidence: high Scope-risk: narrow Reversibility: clean Directive: Re-run the full Rust workspace gates on the exact commit you intend to push when retiring stale roadmap items Tested: cargo fmt; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: No manual interactive REPL completion/help smoke test beyond the existing automated coverage	2026-04-12 00:59:29 +00:00

1 2 3 4

151 Commits