claw-code

mirror of https://github.com/ultraworkers/claw-code.git synced 2026-04-24 13:08:11 +08:00

Author	SHA1	Message	Date
YeonGyu-Kim	79a9f0e6f6	fix: #163 — remove [turn N] suffix pollution from run_turn_loop; file #164 timeout-cancellation followup #163: run_turn_loop no longer injects f'{prompt} [turn N]' into follow-up prompts. The suffix was never defined or interpreted anywhere — not by the engine, not by the system prompt, not by any LLM. It looked like a real user-typed annotation in the transcript and made replay/analysis fragile. New behaviour: - turn 0 submits the original prompt (unchanged) - turn > 0 submits caller-supplied continuation_prompt if provided, else the loop stops cleanly — no fabricated user turn - added continuation_prompt: str \| None = None parameter to run_turn_loop - added --continuation-prompt CLI flag for claws scripting multi-turn loops - zero '[turn' strings ever appear in mutable_messages or stdout now Behaviour change for existing callers: - Before: run_turn_loop(prompt, max_turns=3) submitted 3 turns ('prompt', 'prompt [turn 2]', 'prompt [turn 3]') - After: run_turn_loop(prompt, max_turns=3) submits 1 turn ('prompt') - To preserve old multi-turn behaviour, pass continuation_prompt='Continue.' or any structured follow-up text One existing timeout test (test_budget_is_cumulative_across_turns) updated to pass continuation_prompt so the cumulative-budget contract is actually exercised across turns instead of trivially satisfied by a one-turn loop. #164 filed: addresses reviewer feedback on #161. The wall-clock timeout bounds the caller-facing wait, but the underlying submit_message worker thread keeps running and can mutate engine state after the timeout TurnResult is returned. A cooperative cancel_event pattern is sketched in the pinpoint; real asyncio.Task.cancel() support will come once provider IO is async-native (larger refactor). Tests (tests/test_run_turn_loop_continuation.py, 8 tests): - TestNoTurnSuffixInjection (2): zero '[turn' strings in any submitted prompt, both default and explicit-continuation paths - TestContinuationDefaultStopsAfterTurnZero (2): default loops run exactly one turn; engine.submit_message called exactly once despite max_turns=10 - TestExplicitContinuationBehaviour (2): turn 0 = original, turn N = continuation verbatim; max_turns still respected - TestCLIContinuationFlag (2): CLI default emits only '## Turn 1'; --continuation-prompt wires through to multi-turn behaviour Full suite: 67/67 passing. Closes ROADMAP #163. Files #164.	2026-04-22 17:37:22 +09:00
YeonGyu-Kim	41a6091355	file: #163 — run_turn_loop injects [turn N] suffix into follow-up prompts; multi-turn sessions semantically broken	2026-04-22 10:07:35 +09:00
YeonGyu-Kim	bc94870a54	file: #162 — submit_message appends budget-exceeded turn before returning max_budget_reached; session state corrupted on overflow	2026-04-22 09:38:00 +09:00
YeonGyu-Kim	ee3aa29a5e	file: #161 — run_turn_loop has no wall-clock timeout, stalled turn blocks indefinitely	2026-04-22 08:57:38 +09:00
YeonGyu-Kim	a389f8dff1	file: #160 — session_store missing list_sessions, delete_session, session_exists — claw cannot enumerate or clean up sessions without filesystem hacks	2026-04-22 08:47:52 +09:00
YeonGyu-Kim	7a014170ba	file: #159 — run_turn_loop hardcodes empty denied_tools, permission denials absent from multi-turn sessions	2026-04-22 06:48:03 +09:00
YeonGyu-Kim	986f8e89fd	file: #158 — compact_messages_if_needed drops turns silently, no structured compaction event	2026-04-22 06:37:54 +09:00
YeonGyu-Kim	ef1cfa1777	file: #157 — structured remediation registry for error hints (Phase 3 of #77 ) ## Gap #77 Phase 1 added machine-readable error kind discriminants and #156 extended them to text-mode output. However, the hint field is still prose derived from splitting existing error text — not a stable registry-backed remediation contract. Downstream claws inspecting the hint field still need to parse human wording to decide whether to retry, escalate, or terminate. ## Fix Shape 1. Remediation registry: remediation_for(kind, operation) -> Remediation struct with action (retry/escalate/terminate/configure), target, and stable message 2. Stable hint outputs per error class (no more prose splitting) 3. Golden fixture tests replacing split_error_hint() string hacks ## Source gaebal-gajae dogfood sweep 2026-04-22 05:30 KST	2026-04-22 05:31:00 +09:00
YeonGyu-Kim	14c5ef1808	file: #156 — error classification for text-mode output (Phase 2 of #77 ) ROADMAP entry for natural Phase 2 follow-up to #77 Phase 1 (JSON error kind classification). Text-mode errors currently prose-only with no structured class; observability tools parsing stderr need the kind token. Two implementation options: - Prefix line before error prose: [error-kind: missing_credentials] - Suffix comment: # error_class=missing_credentials Scope: ~20 lines. Non-breaking (adds classification, doesn't change error text). Source: Cycle 11 dogfood probe at 23:18 KST — product surface clean after today's batch, identified natural next step for error-classification symmetry.	2026-04-21 23:19:58 +09:00
YeonGyu-Kim	4b53b97e36	docs: #155 — add USAGE.md documentation for /ultraplan, /teleport, /bughunter commands ## Problem Three interactive slash commands are documented in `claw --help` but have no corresponding section in USAGE.md: - `/ultraplan [task]` — Run a deep planning prompt with multi-step reasoning - `/teleport <symbol-or-path>` — Jump to a file or symbol by searching the workspace - `/bughunter [scope]` — Inspect the codebase for likely bugs New users see these commands in the help output but don't know: - What each command does - How to use it - When to use it vs. other commands - What kind of results to expect ## Fix Added new section "Advanced slash commands (Interactive REPL only)" to USAGE.md with documentation for all three commands: 1. `/ultraplan` — multi-step reasoning for complex tasks - Example: `/ultraplan refactor the auth module to use async/await` - Output: structured plan with numbered steps and reasoning 2. `/teleport` — navigate to a file or symbol - Example: `/teleport UserService`, `/teleport src/auth.rs` - Output: file content with the requested symbol highlighted 3. `/bughunter` — scan for likely bugs - Example: `/bughunter src/handlers`, `/bughunter` (all) - Output: list of suspicious patterns with explanations ## Impact Users can now discover these commands and understand when to use them without having to guess or search external sources. Bridges the gap between `--help` output and full documentation. Also filed ROADMAP #155 documenting the gap. Closes ROADMAP #155.	2026-04-21 21:49:04 +09:00
YeonGyu-Kim	3cfe6e2b14	feat: #154 — hint provider prefix and env var when model name looks like different provider ## Problem When a user types `claw --model gpt-4` or `--model qwen-plus`, they get: ``` error: invalid model syntax: 'gpt-4'. Expected provider/model (e.g., anthropic/claude-opus-4-6) or known alias ``` USAGE.md documents that "The error message now includes a hint that names the detected env var" — but this hint does not actually exist. The user has to re-read USAGE.md or guess the correct prefix. ## Fix Enhance `validate_model_syntax` to detect when a model name looks like it belongs to a different provider: 1. OpenAI models (starts with `gpt-` or `gpt_`): ``` Did you mean `openai/gpt-4`? (Requires OPENAI_API_KEY env var) ``` 2. Qwen/DashScope models (starts with `qwen`): ``` Did you mean `qwen/qwen-plus`? (Requires DASHSCOPE_API_KEY env var) ``` 3. Grok/xAI models (starts with `grok`): ``` Did you mean `xai/grok-3`? (Requires XAI_API_KEY env var) ``` Unrelated invalid models (e.g., `asdfgh`) do not get a spurious hint. ## Verification - `claw --model gpt-4` → hints `openai/gpt-4` + `OPENAI_API_KEY` - `claw --model qwen-plus` → hints `qwen/qwen-plus` + `DASHSCOPE_API_KEY` - `claw --model grok-3` → hints `xai/grok-3` + `XAI_API_KEY` - `claw --model asdfgh` → generic error (no hint) ## Tests Added 3 new assertions in `parses_multiple_diagnostic_subcommands`: - GPT model error hints openai/ prefix and OPENAI_API_KEY - Qwen model error hints qwen/ prefix and DASHSCOPE_API_KEY - Unrelated models don't get a spurious hint All 177 rusty-claude-cli tests pass. Closes ROADMAP #154.	2026-04-21 21:40:48 +09:00
YeonGyu-Kim	71f5f83adb	feat: #153 — add post-build binary location and verification guide to README ## Problem Users frequently ask after building: - "Where is the claw binary?" - "Did the build actually work?" - "Why can't I run \`claw\` from anywhere?" This happens because \`cargo build\` puts the binary in \`rust/target/debug/claw\` (or \`rust/target/release/claw\`), and new users don't know: 1. Where to find it 2. How to test it 3. How to add it to PATH (optional but common follow-up) ## Fix Added new section "Post-build: locate the binary and verify" to README covering: 1. Binary location table: debug vs. release, macOS/Linux vs. Windows paths 2. Verification commands: Test the binary with \`--help\` and \`doctor\` 3. Three ways to add to PATH: - Symlink (macOS/Linux): \`ln -s ... /usr/local/bin/claw\` - cargo install: \`cargo install --path . --force\` - Shell profile update: add rust/target/debug to \$PATH 4. Troubleshooting: Common errors ("command not found", "permission denied", debug vs. release build speed) ## Impact New users can now: - Find the binary immediately after build - Run it and verify with \`claw doctor\` - Know their options for system-wide access Also filed ROADMAP #153 documenting the gap. Closes ROADMAP #153.	2026-04-21 21:29:59 +09:00
YeonGyu-Kim	dddbd78dbd	file: #152 — diagnostic verb suffixes allow arbitrary positional args, double error prefix Filed from nudge directive at 21:17 KST. Implementation exists on worktree `jobdori-127-verb-suffix` but needs rebase due to merge with #141. Ready for Phase 1 implementation once conflicts resolved.	2026-04-21 21:19:51 +09:00
YeonGyu-Kim	7bc66e86e8	feat: #151 — canonicalize workspace path in SessionStore::from_cwd/data_dir ## Problem `workspace_fingerprint(path)` hashes the raw path string without canonicalization. Two equivalent paths (e.g. `/tmp/foo` vs `/private/tmp/foo` on macOS) produce different fingerprints and therefore different session stores. #150 fixed the test-side symptom; this fixes the underlying product contract. ## Discovery path #150 fix (canonicalize in test) was a workaround. Q's ack on #150 surfaced the deeper gap: the function itself is still fragile for any caller passing a non-canonical path: 1. Embedded callers with a raw `--data-dir` path 2. Programmatic `SessionStore::from_cwd(user_path)` calls 3. NixOS store paths, Docker bind mounts, case-insensitive normalization The REPL's default flow happens to work because `env::current_dir()` returns canonical paths on macOS. But any caller passing a raw path risks silent session-store divergence. ## Fix Canonicalize inside `SessionStore::from_cwd()` and `from_data_dir()` before computing the fingerprint. Kept `workspace_fingerprint()` itself as a pure function for determinism — canonicalization is the entry point's responsibility. ```rust let canonical_cwd = fs::canonicalize(cwd).unwrap_or_else(\|_\| cwd.to_path_buf()); let sessions_root = canonical_cwd.join(".claw").join("sessions").join(workspace_fingerprint(&canonical_cwd)); ``` Falls back to the raw path if canonicalize fails (directory doesn't exist yet). ## Test-side updates Three legacy-session tests expected the non-canonical base path to match the store's workspace_root. Updated them to canonicalize `base` after creation — same defensive pattern as #150, now explicit across all three tests. ## Regression test Added `session_store_from_cwd_canonicalizes_equivalent_paths` that creates two stores from equivalent paths (raw vs canonical) and asserts they resolve to the same sessions_dir. ## Verification - `cargo test -p runtime session_store_` — 9/9 pass - `cargo test --workspace` — all green, no FAILED markers - No behavior change for existing users (REPL default flow already used canonical paths) ## Backward compatibility Users on macOS who always went through `env::current_dir()`: no hash change, sessions resume identically. Users who ever called with a non-canonical path: hash would change, but those sessions were already broken (couldn't be resumed from a canonical-path cwd). Net improvement. Closes ROADMAP #151.	2026-04-21 21:06:09 +09:00
YeonGyu-Kim	eaa077bf91	fix: #150 — eliminate symlink canonicalization flake in resume_latest test + file #246 (reminder outcome ambiguity) ## #150 Fix: resume_latest test flake Problem: `resume_latest_restores_the_most_recent_managed_session` intermittently fails when run in the workspace suite or multiple times in sequence, but passes in isolation. Root cause: `workspace_fingerprint(path)` hashes the path string without canonicalization. On macOS, `/tmp` is a symlink to `/private/tmp`. The test creates a temp dir via `std::env::temp_dir().join(...)` which returns `/var/folders/...` (non-canonical). When the subprocess spawns, `env::current_dir()` returns the canonical path `/private/var/folders/...`. The two fingerprints differ, so the subprocess looks in `.claw/sessions/<hash1>` while files are in `.claw/sessions/<hash2>`. Session discovery fails. Fix: Call `fs::canonicalize(&project_dir)` after creating the directory to ensure test and subprocess use identical path representations. Verification: 5 consecutive runs of the full test suite — all pass. Previously: 5/5 failed when run in sequence. ## #246 Filing: Reminder cron outcome ambiguity (control-loop blocker) The `clawcode-dogfood-cycle-reminder` cron times out repeatedly with no structured feedback on whether the nudge was delivered, skipped, or died in-flight. Phase 1 outcome schema — add explicit field to cron result: - `delivered` — nudge posted to Discord - `timed_out_before_send` — died before posting - `timed_out_after_send` — posted but cleanup timed out - `skipped_due_to_active_cycle` — previous cycle active - `aborted_gateway_draining` — daemon shutdown Assigned to gaebal-gajae (cron/orchestration domain). Unblocks trustworthy dogfood cycle observability. Closes ROADMAP #150. Filed ROADMAP #246.	2026-04-21 21:01:09 +09:00
YeonGyu-Kim	bc259ec6f9	fix: #149 — eliminate parallel-test flake in runtime::config tests ## Problem `runtime::config::tests::validates_unknown_top_level_keys_with_line_and_field_name` intermittently fails during `cargo test --workspace` (witnessed during #147 and #148 workspace runs) but passes deterministically in isolation. Example failure from workspace run: test result: FAILED. 464 passed; 1 failed ## Root cause `runtime/src/config.rs::tests::temp_dir()` used nanosecond timestamp alone for namespace isolation: std::env::temp_dir().join(format!("runtime-config-{nanos}")) Under parallel test execution on fast machines with coarse clock resolution, two tests start within the same nanosecond bucket and collide on the same path. One test's `fs::remove_dir_all(root)` then races another's in-flight `fs::create_dir_all()`. Other crates already solved this pattern: - plugins::tests::temp_dir(label) — label-parameterized - runtime::git_context::tests::temp_dir(label) — label-parameterized runtime/src/config.rs was missed. ## Fix Added process id + monotonically-incrementing atomic counter to the namespace, making every callsite provably unique regardless of clock resolution or scheduling: static COUNTER: AtomicU64 = AtomicU64::new(0); let pid = std::process::id(); let seq = COUNTER.fetch_add(1, Ordering::Relaxed); std::env::temp_dir().join(format!("runtime-config-{pid}-{nanos}-{seq}")) Chose counter+pid over the label-parameterized pattern to avoid touching all 20 callsites in the same commit (mechanical noise with no added safety — counter alone is sufficient). ## Verification Before: one failure per workspace run (config test flake). After: 5 consecutive `cargo test --workspace` runs — zero config test failures. Only pre-existing `resume_latest` flake remains (orthogonal, unrelated to this change). for i in 1 2 3 4 5; do cargo test --workspace; done # All 5 runs: config tests green. Only resume_latest flake appears. cargo test -p runtime # 465 passed; 0 failed ## ROADMAP.md Added Pinpoint #149 documenting the gap, root cause, and fix. Closes ROADMAP #149.	2026-04-21 20:54:12 +09:00
YeonGyu-Kim	f84c7c4ed5	feat: #148 + #128 closure — model provenance in claw status JSON/text ## Scope Two deltas in one commit: ### #128 closure (docs) Re-verified on main HEAD `4cb8fa0`: malformed `--model` strings already rejected at parse time (`validate_model_syntax` in parse_args). All historical repro cases now produce specific errors: claw --model '' → error: model string cannot be empty claw --model 'bad model' → error: invalid model syntax: 'bad model' contains spaces claw --model 'sonet' → error: invalid model syntax: 'sonet'. Expected provider/model or known alias claw --model '@invalid' → error: invalid model syntax: '@invalid'. Expected provider/model ... claw --model 'totally-not-real-xyz' → error: invalid model syntax: ... claw --model sonnet → ok, resolves to claude-sonnet-4-6 claw --model anthropic/claude-opus-4-6 → ok, passes through Marked #128 CLOSED in ROADMAP with repro block. Residual provenance gap split off as #148. ### #148 implementation Problem. After #128 closure, `claw status --output-format json` still surfaces only the resolved model string. No way for a claw to distinguish whether `claude-sonnet-4-6` came from `--model sonnet` (alias resolution) vs `--model claude-sonnet-4-6` (pass-through) vs `ANTHROPIC_MODEL` env vs `.claw.json` config vs compiled-in default. Debug forensics had to re-read argv instead of reading a structured field. Clawhip orchestrators sending `--model` couldn't confirm the flag was honored vs falling back to default. Fix. Added two fields to status JSON envelope: - `model_source`: "flag" \| "env" \| "config" \| "default" - `model_raw`: user's input before alias resolution (null on default) Text mode appends a `Model source` line under `Model`, showing the source and raw input (e.g. `Model source flag (raw: sonnet)`). Resolution order (mirrors resolve_repl_model but with source attribution): 1. If `--model` / `--model=` flag supplied → source: flag, raw: flag value 2. Else if ANTHROPIC_MODEL set → source: env, raw: env value 3. Else if `.claw.json` model key set → source: config, raw: config value 4. Else → source: default, raw: null ## Changes ### rust/crates/rusty-claude-cli/src/main.rs - Added `ModelSource` enum (Flag/Env/Config/Default) with `as_str()`. - Added `ModelProvenance` struct (resolved, raw, source) with three constructors: `default_fallback()`, `from_flag(raw)`, and `from_env_or_config_or_default(cli_model)`. - Added `model_flag_raw: Option<String>` field to `CliAction::Status`. - Parse loop captures raw input in `--model` and `--model=` arms. - Extended `parse_single_word_command_alias` to thread `model_flag_raw: Option<&str>` through. - Extended `print_status_snapshot` signature to accept `model_flag_raw: Option<&str>`. Resolves provenance at dispatch time (flag provenance from arg; else probe env/config/default). - Extended `status_json_value` signature with `provenance: Option<&ModelProvenance>`. On Some, adds `model_source` and `model_raw` fields; on None (legacy resume paths), omits them for backward compat. - Extended `format_status_report` signature with optional provenance. On Some, renders `Model source` line after `Model`. - Updated all existing callers (REPL /status, resume /status, tests) to pass None (legacy paths don't carry flag provenance). - Added 2 regression assertions in parse_args test covering both `--model sonnet` and `--model=...` forms. ### ROADMAP.md - Marked #128 CLOSED with re-verification block. - Filed #148 documenting the provenance gap split, fix shape, and acceptance criteria. ## Live verification $ claw --model sonnet --output-format json status \| jq '{model,model_source,model_raw}' {"model": "claude-sonnet-4-6", "model_source": "flag", "model_raw": "sonnet"} $ claw --output-format json status \| jq '{model,model_source,model_raw}' {"model": "claude-opus-4-6", "model_source": "default", "model_raw": null} $ ANTHROPIC_MODEL=haiku claw --output-format json status \| jq '{model,model_source,model_raw}' {"model": "claude-haiku-4-5-20251213", "model_source": "env", "model_raw": "haiku"} $ echo '{"model":"claude-opus-4-7"}' > .claw.json && claw --output-format json status \| jq '{model,model_source,model_raw}' {"model": "claude-opus-4-7", "model_source": "config", "model_raw": "claude-opus-4-7"} $ claw --model sonnet status Status Model claude-sonnet-4-6 Model source flag (raw: sonnet) Permission mode danger-full-access ... ## Tests - rusty-claude-cli bin: 177 tests pass (2 new assertions for #148) - Full workspace green except pre-existing resume_latest flake (unrelated) Closes ROADMAP #128, #148.	2026-04-21 20:48:46 +09:00
YeonGyu-Kim	4cb8fa059a	feat: #147 — reject empty / whitespace-only prompts at CLI fallthrough ## Problem The `"prompt"` subcommand arm enforced `if prompt.trim().is_empty()` and returned a specific error. The fallthrough `other` arm in the same match block — which routes any unrecognized first positional arg to `CliAction::Prompt` — had no such guard. Result: $ claw "" error: missing Anthropic credentials; export ANTHROPIC_AUTH_TOKEN ... $ claw " " error: missing Anthropic credentials; ... $ claw "" "" error: missing Anthropic credentials; ... $ claw --output-format json "" {"error":"missing Anthropic credentials; ...","type":"error"} An empty prompt should never reach the credentials check. Worse: with valid credentials, the literal empty string gets sent to Claude as a user prompt, either burning tokens for nothing or triggering a model- side refusal. Same prompt-misdelivery family as #145. ## Root cause In `parse_subcommand()`, the final `other =>` arm in the top-level match only guards against typos (#108 guard via `looks_like_subcommand_typo`) and then unconditionally builds `CliAction::Prompt { prompt: rest.join(" ") }`. An empty/whitespace-only join passes through. ## Changes ### rust/crates/rusty-claude-cli/src/main.rs Added the same `if joined.trim().is_empty()` guard already used in the `"prompt"` arm to the fallthrough path. Error message distinguishes it from the `prompt` subcommand path: empty prompt: provide a subcommand (run `claw --help`) or a non-empty prompt string Runs AFTER the typo guard (so `claw sttaus` still suggests `status`) and BEFORE CliAction::Prompt construction (so no network call ever happens for empty inputs). ### Regression tests Added 4 assertions in the existing parse_args test: - parse_args([""]) → Err("empty prompt: ...") - parse_args([" "]) → Err("empty prompt: ...") - parse_args(["", ""]) → Err("empty prompt: ...") - parse_args(["sttaus"]) → Err("unknown subcommand: ...") [verifies #108 typo guard still takes precedence] ### ROADMAP.md Added Pinpoint #147 documenting the gap, verification, root cause, fix shape, and acceptance. Joins the prompt-misdelivery cluster alongside #145. ## Live verification $ claw "" error: empty prompt: provide a subcommand (run `claw --help`) or a non-empty prompt string $ claw " " error: empty prompt: provide a subcommand (run `claw --help`) or a non-empty prompt string $ claw --output-format json "" {"error":"empty prompt: provide a subcommand ...","type":"error"} $ claw prompt "" # unchanged: subcommand-specific error preserved error: prompt subcommand requires a prompt string $ claw hello # unchanged: typo guard still fires error: unknown subcommand: hello. Did you mean help $ claw "real prompt here" # unchanged: real prompts still reach API error: api returned 401 Unauthorized (with dummy key, as expected) All empty/whitespace-only paths exit 1. No network call. No misleading credentials error. ## Tests - rusty-claude-cli bin: 177 tests pass (4 new assertions) - Full workspace green except pre-existing resume_latest flake (unrelated) Closes ROADMAP #147.	2026-04-21 20:35:17 +09:00
YeonGyu-Kim	f877acacbf	feat: #146 — wire `claw config` and `claw diff` as standalone subcommands ## Problem `claw config` and `claw diff` are pure-local read-only introspection commands (config merges .claw.json + .claw/settings.json from disk; diff shells out to `git diff --cached` + `git diff`). Neither needs a session context, yet both rejected direct CLI invocation: $ claw config error: `claw config` is a slash command. Use `claw --resume SESSION.jsonl /config` ... $ claw diff error: `claw diff` is a slash command. ... This forced clawing operators to spin up a full session just to inspect static disk state, and broke natural pipelines like `claw config --output-format json \| jq`. ## Root cause Sibling of #145: `SlashCommand::Config { section }` and `SlashCommand::Diff` had working renderers (`render_config_report`, `render_config_json`, `render_diff_report`, `render_diff_json_for`) exposed for resume sessions, but the top-level CLI parser in `parse_subcommand()` had no arms for them. Zero-arg `config`/`diff` hit `parse_single_word_command_alias`'s fallback to `bare_slash_command_guidance`, producing the misleading guidance. ## Changes ### rust/crates/rusty-claude-cli/src/main.rs - Added `CliAction::Config { section, output_format }` and `CliAction::Diff { output_format }` variants. - Added `"config"` / `"diff"` arms to the top-level parser in `parse_subcommand()`. `config` accepts an optional section name (env\|hooks\|model\|plugins) matching SlashCommand::Config semantics. `diff` takes no positional args. Both reject extra trailing args with a clear error. - Added `"config" \| "diff" => None` to `parse_single_word_command_alias` so bare invocations fall through to the new parser arms instead of the slash-guidance error. - Added dispatch in run() that calls existing renderers: text mode uses `render_config_report` / `render_diff_report`; JSON mode uses `render_config_json` / `render_diff_json_for` with `serde_json::to_string_pretty`. - Added 5 regression assertions in parse_args test covering: parse_args(["config"]), parse_args(["config", "env"]), parse_args(["config", "--output-format", "json"]), parse_args(["diff"]), parse_args(["diff", "--output-format", "json"]). ### ROADMAP.md Added Pinpoint #146 documenting the gap, verification, root cause, fix shape, and acceptance. Explicitly notes which other slash commands (`hooks`, `usage`, `context`, etc.) are NOT candidates because they are session-state-modifying. ## Live verification $ claw config # no config files Config Working directory /private/tmp/cd-146-verify Loaded files 0 Merged keys 0 Discovered files user missing ... project missing ... local missing ... Exit 0. $ claw config --output-format json { "cwd": "...", "files": [...], ... } $ claw diff # no git Diff Result no git repository Detail ... Exit 0. $ claw diff --output-format json # inside claw-code { "kind": "diff", "result": "changes", "staged": "", "unstaged": "diff --git ..." } Exit 0. ## Tests - rusty-claude-cli bin: 177 tests pass (5 new assertions in parse_args) - Full workspace green except pre-existing resume_latest flake (unrelated) ## Not changed `hooks`, `usage`, `context`, `tasks`, `theme`, `voice`, `rename`, `copy`, `color`, `effort`, `branch`, `rewind`, `ide`, `tag`, `output-style`, `add-dir` — all session-mutating or interactive-only; correctly remain slash-only. Closes ROADMAP #146.	2026-04-21 20:07:28 +09:00
YeonGyu-Kim	7d63699f9f	feat: #145 — wire `claw plugins` subcommand to CLI parser (prompt misdelivery fix) ## Problem `claw plugins` (and `claw plugins list`, `claw plugins --help`, `claw plugins info <name>`, etc.) fell through the top-level subcommand match and got routed into the prompt-execution path. Result: a purely local introspection command triggered an Anthropic API call and surfaced `missing Anthropic credentials` to the user. With valid credentials, it would actually send the literal string "plugins" as a user prompt to Claude, burning tokens for a local query. $ claw plugins error: missing Anthropic credentials; export ANTHROPIC_AUTH_TOKEN or ANTHROPIC_API_KEY before calling the Anthropic API $ ANTHROPIC_API_KEY=dummy claw plugins ⠋ 🦀 Thinking... ✘ ❌ Request failed error: api returned 401 Unauthorized Meanwhile siblings (`agents`, `mcp`, `skills`) all worked correctly: $ claw agents No agents found. $ claw mcp MCP Working directory ... Configured servers 0 ## Root cause `CliAction::Plugins` exists, has a working dispatcher (`LiveCli::print_plugins`), and is produced inside the REPL via `SlashCommand::Plugins`. But the top-level CLI parser in `parse_subcommand()` had arms for `agents`, `mcp`, `skills`, `status`, `doctor`, `init`, `export`, `prompt`, etc., and no arm for `plugins`. The dispatch never ran from the CLI entry point. ## Changes ### rust/crates/rusty-claude-cli/src/main.rs Added a `"plugins"` arm to the top-level match in `parse_subcommand()` that produces `CliAction::Plugins { action, target, output_format }`, following the same positional convention as `mcp` (`action` = first positional, `target` = second). Rejects >2 positional args with a clear error. Added four regression assertions in the existing `parse_args` test: - `plugins` alone → `CliAction::Plugins { action: None, target: None }` - `plugins list` → action: Some("list"), target: None - `plugins enable <name>` → action: Some("enable"), target: Some(...) - `plugins --output-format json` → action: None, output_format: Json ### ROADMAP.md Added Pinpoint #145 documenting the gap, verification, root cause, fix shape, and acceptance. ## Live verification $ claw plugins # no credentials set Plugins example-bundled v0.1.0 disabled sample-hooks v0.1.0 disabled $ claw plugins --output-format json # no credentials set { "action": "list", "kind": "plugin", "message": "Plugins\n example-bundled ...\n sample-hooks ...", "reload_runtime": false, "target": null } Exit 0 in all modes. No network call. No "missing credentials" error. ## Tests - rusty-claude-cli bin: 177 tests pass (new plugin assertions included) - Full workspace green except pre-existing resume_latest flake (unrelated) Closes ROADMAP #145.	2026-04-21 19:36:49 +09:00
YeonGyu-Kim	faeaa1d30c	feat: #144 phase 1 + ROADMAP filing — claw mcp degrades gracefully on malformed config Filing + Phase 1 fix in one commit (sibling of #143). ## Context With #143 Phase 1 landed (`claw status` degrades), `claw mcp` was the remaining diagnostic surface that hard-failed on a malformed `.claw.json`. Same input, same parse error, same partial-success violation. Fresh dogfood at 18:59 KST caught it on main HEAD `e2a43fc`. ## Changes ### ROADMAP.md Added Pinpoint #144 documenting the gap and acceptance criteria. Joins the partial-success / Principle #5 cluster with #143. ### rust/crates/commands/src/lib.rs `render_mcp_report_for()` + `render_mcp_report_json_for()` now catch the ConfigError at loader.load() instead of propagating: - Text mode prepends a "Config load error" block (same shape as #143's status output) before the MCP listing. The listing still renders with empty servers so the output structure is preserved. - JSON mode adds top-level `status: "ok" \| "degraded"` + `config_load_error: string \| null` fields alongside existing fields (`kind`, `action`, `working_directory`, `configured_servers`, `servers[]`). On clean runs, `status: "ok"` and `config_load_error: null`. On parse failure, `status: "degraded"`, `config_load_error: "..."`, `servers: []`, exit 0. - Both list and show actions get the same treatment. ### Regression test `commands::tests::mcp_degrades_gracefully_on_malformed_mcp_config_144`: - Injects the same malformed .claw.json as #143 (one valid + one broken mcpServers entry). - Asserts mcp list returns Ok (not Err). - Asserts top-level status: "degraded" and config_load_error names the malformed field path. - Asserts show action also degrades. - Asserts clean path returns status: "ok" with config_load_error null. ## Live verification $ claw mcp --output-format json { "action": "list", "kind": "mcp", "status": "degraded", "config_load_error": ".../.claw.json: mcpServers.missing-command: missing string field command", "working_directory": "/Users/yeongyu/clawd", "configured_servers": 0, "servers": [] } Exit 0. ## Contract alignment after this commit All three diagnostic surfaces match now: - `doctor` — degraded envelope with typed check entries ✅ - `status` — degraded envelope with config_load_error ✅ (#143) - `mcp` — degraded envelope with config_load_error ✅ (this commit) Phase 2 (typed-error object joining taxonomy §4.44) tracked separately across all three surfaces. Full workspace test green except pre-existing resume_latest flake (unrelated). Closes ROADMAP #144 phase 1.	2026-04-21 19:07:17 +09:00
YeonGyu-Kim	fcd5b49428	ROADMAP #143 : claw status hard-fails on malformed MCP config while doctor degrades gracefully	2026-04-21 18:32:09 +09:00
YeonGyu-Kim	2665ada94e	ROADMAP #142 : claw init --output-format json emits unstructured message string instead of created/skipped fields	2026-04-21 17:31:11 +09:00
YeonGyu-Kim	21b377d9c0	ROADMAP #141 : claw <subcommand> --help has 5 different behaviors — inconsistent help surface	2026-04-21 17:01:46 +09:00
YeonGyu-Kim	0cf8241978	ROADMAP #140 : deprecated permissionMode migration silently downgrades DangerFullAccess to WorkspaceWrite — 1 test failure on main HEAD 36b3a09	2026-04-21 16:23:00 +09:00
YeonGyu-Kim	36b3a09818	ROADMAP #139 : claw state error references undocumented 'worker' concept (unactionable for claws)	2026-04-21 16:01:54 +09:00
YeonGyu-Kim	883cef1a26	docs: #138 add concrete evidence — feat/134-135 branch pushed but no PR (closure-state gap)	2026-04-21 15:02:33 +09:00
YeonGyu-Kim	768c1abc78	ROADMAP #138 : dogfood cycle report-gate opacity — nudge surface needs explicit closure state	2026-04-21 14:49:36 +09:00
YeonGyu-Kim	724a78604d	ROADMAP #137 : model-alias shorthand regression in test suite — bare alias parsing broken on feat/134-135-session-identity; 3 tests fail with invalid model syntax error after #134/#135 validation tightening	2026-04-21 13:27:10 +09:00
YeonGyu-Kim	91ba54d39f	ROADMAP #136 : --compact flag silently overrides --output-format json — compact turn always emits plain text even when JSON requested; unreachable Json arm in run_with_output() match; joins output-format completeness cluster #90/#91/#92/#127/#130 and CLI/REPL parity §7.1	2026-04-21 12:27:06 +09:00
YeonGyu-Kim	8b52e77f23	ROADMAP #135 : claw status --json missing active_session bool and session.id cross-reference — status query side of #134 round-trip; joins session identity completeness §4.7 and status surface completeness cluster #80/#83/#114/#122; natural bundle #134+#135 closes session-identity round-trip	2026-04-21 06:55:09 +09:00
YeonGyu-Kim	2c42f8bcc8	docs: remove duplicate ROADMAP #134 entry	2026-04-21 04:50:43 +09:00
YeonGyu-Kim	f266505546	ROADMAP #134 : no run/correlation ID at session boundary — session.id missing from startup event and status JSON; observer must infer session identity from timing	2026-04-21 01:55:42 +09:00
YeonGyu-Kim	5c579e4a09	§4.44.5.1: file ship event wiring pinpoint (schema landed, wiring missing) Dogfood cycle 2026-04-20 identified that §4.44.5 ship/provenance event schema is implemented (ShipProvenance struct, ship.* constructors, tests pass) but actual git push/merge/commit-range operations do not yet emit these events. Events remain dead code—constructors exist but are never called during real workflows. This pinpoint tracks the missing wiring: locating actual git operation call sites in main.rs/tools/lib.rs/worker_boot.rs and intercepting to emit ship.prepared/commits_selected/merged/pushed_main with real metadata (source_branch, commit_range, merge_method, actor, pr_number). Acceptance: at least one real git push emits all 4 events with actual payload values, claw state JSON surfaces ship provenance. Ref: dogfood gaebal-gajae @ 1495672954573291571 (15:30 KST)	2026-04-20 15:30:34 +09:00
YeonGyu-Kim	8a8ca8a355	ROADMAP #4.44.5: Ship/provenance events — implement §4.44.5 Adds structured ship provenance surface to eliminate delivery-path opacity: New lane events: - ship.prepared — intent to ship established - ship.commits_selected — commit range locked - ship.merged — merge completed with provenance - ship.pushed_main — delivery to main confirmed ShipProvenance struct carries: - source_branch, base_commit - commit_count, commit_range - merge_method (direct_push/fast_forward/merge_commit/squash_merge/rebase_merge) - actor, pr_number Constructor methods added to LaneEvent for all four ship events. Tests: - Wire value serialization for ship events - Round-trip deserialization - Canonical event name coverage Runtime: 465 tests pass ROADMAP updated with IMPLEMENTED status This closes the gap where 56 commits pushed to main had no structured provenance trail — now emits first-class events for clawhip consumption.	2026-04-20 15:06:50 +09:00
YeonGyu-Kim	b0b579ebe9	ROADMAP #133 : Blocked-state subphase contract — implement §6.5 Adds BlockedSubphase enum with 7 variants for structured blocked-state reporting: - blocked.trust_prompt — trust gate blockers - blocked.prompt_delivery — prompt misdelivery - blocked.plugin_init — plugin startup failures - blocked.mcp_handshake — MCP connection issues - blocked.branch_freshness — stale branch blockers - blocked.test_hang — test timeout/hang - blocked.report_pending — report generation stuck LaneEventBlocker now carries optional subphase field that gets serialized into LaneEvent data. Enables clawhip to route recovery without pane scraping. Updates: - lane_events.rs: BlockedSubphase enum, LaneEventBlocker.subphase field - lane_events.rs: blocked()/failed() constructors with subphase serialization - lib.rs: Export BlockedSubphase - tools/src/lib.rs: classify_lane_blocker() with subphase: None - Test imports and fixtures updated Backward-compatible: subphase is Option<>, existing events continue to work.	2026-04-20 15:04:08 +09:00
YeonGyu-Kim	c956f78e8a	ROADMAP #4.44.5: Ship/provenance opacity — filed from dogfood Added structured delivery-path contract to surface branch → merge → main-push provenance as first-class events. Filed from the 56-commit 2026-04-20 push that exposed the gap. Also fixes: ApiError test compilation — add suggested_action: None to 4 sites - Line ~8414: opaque_provider_wrapper_surfaces_failure_class_session_and_trace - Line ~8436: retry_exhaustion_uses_retry_failure_class_for_generic_provider_wrapper - Line ~8499: provider_context_window_errors_are_reframed_with_same_guidance - Line ~8533: retry_wrapped_context_window_errors_keep_recovery_guidance	2026-04-20 14:35:07 +09:00
YeonGyu-Kim	dd73962d0b	ROADMAP #122 : doctor invocation does not check stale-base condition — run_stale_base_preflight() only invoked in Prompt + REPL paths, missing in doctor action handler; inconsistency: doctor says 'ok' but prompt warns 'stale base'; joins boot preflight / doctor contract family (#80-#83/#114) and silent-state inventory (#102/#127/#129/#245)	2026-04-20 13:11:12 +09:00
YeonGyu-Kim	027efb2f9f	ROADMAP §4.44: Typed-error envelope contract (Silent-state inventory roll-up) — locks in structured error.kind/operation/target/errno/hint/retryable contract that closes the family of pinpoints currently scattered across #102 + #121 + #127 + #129 + #130 + #245 ; backward-compat additive; regression locked via golden-fixture; gates 'Run claw --help for usage' trailer on error.kind == usage; drafted jointly with gaebal-gajae during 2026-04-20 dogfood cycle	2026-04-20 13:03:50 +09:00
YeonGyu-Kim	866f030713	ROADMAP #130 : claw export --output filesystem errors surface raw OS errno strings with zero context — 5 distinct failure modes all produce different errno strings but the same zero-context shape; no path echoed, no operation named, no io::ErrorKind classification, no actionable hint; JSON envelope flattens to {error, type} losing all structure; Run claw --help for usage trailer misleads on non-usage errors; joins JSON-envelope asymmetry family #90/#91/#92/#110/#115/#116 and truth-audit #80-#127/#129	2026-04-20 12:52:22 +09:00
YeonGyu-Kim	d2a83415dc	ROADMAP #129 : MCP server startup blocks credential validation in Prompt path — cred check ordered AFTER MCP child handshake await; misbehaved/slow MCP wedges every claw <prompt> invocation indefinitely; npx restart loop wastes resources; runtime-side companion to #102 's config-time MCP gap; PARITY.md Lane 7 acceptance gap	2026-04-20 12:43:11 +09:00
YeonGyu-Kim	8122029eba	ROADMAP #128 : claw --model <malformed> (spaces, empty string, invalid syntax) silently accepted at parse time, falls through to cred-error misdirection; joins parser-level trust gap family #108/#117/#119/#122/#127; joins token-burn family #99/#127	2026-04-20 12:32:56 +09:00
YeonGyu-Kim	d284ef774e	ROADMAP #127 : claw <subcommand> --json silently falls through to LLM Prompt dispatch — diagnostic verbs (doctor, status, sandbox, skills, version, help) reject --json with cred-error misdirection; valid verb + unrecognized suffix arg = Prompt fall-through; 18th silent-flag, 5th parser-level trust gap, joins #108 + #117 + #119 + #122	2026-04-20 12:05:05 +09:00
YeonGyu-Kim	7370546c1c	ROADMAP #126 : /config [env\|hooks\|model\|plugins] ignores section argument — all 4 subcommands return bit-identical file-list envelope; 4-way dispatch collapse Dogfooded 2026-04-18 on main HEAD b56841c from /tmp/cdFF2. /config model, /config hooks, /config plugins, /config env all return: {kind:'config', cwd, files:[...], loaded_files, merged_keys} — BIT-IDENTICAL. diff /config model vs /config hooks → empty. Section argument parsed at slash-command level but not branched on in the handler. Help: '/config [env\|hooks\|model\|plugins] Inspect Claude config files or merged sections [resume]' → 'merged sections' never shown. Same file-list for all. Third dispatch-collapse finding: #111: /providers → Doctor (2-way, wildly wrong) #118: /stats + /tokens + /cache → Stats (3-way, distinct) #126: /config env + hooks + model + plugins → file-list (4-way) Fix shape (~60 lines): - Section-specific handlers: /config model → resolved model, source, aliases /config hooks → pre_tool_use, post_tool_use arrays /config plugins → enabled_plugins list /config env → current file-list (already correct) - Bare /config → current file-list envelope - Regression per section Joins Silent-flag/documented-but-unenforced. Joins Truth-audit — help promises section inspection. Joins Dispatch-collapse family: #111 + #118 + #126. Natural bundle: #111 + #118 + #126 — dispatch-collapse trio. Complete parser-dispatch-collapse audit across slash commands. Filed in response to Clawhip pinpoint nudge 1495023618529300580 in #clawcode-building-in-public.	2026-04-18 20:32:52 +09:00
YeonGyu-Kim	b56841c5f4	ROADMAP #125 : git_state 'clean' emitted for non-git directories; GitWorkspaceSummary default all-zeros → is_clean() → 'clean' even when in_git_repo: false; contradictory doctor fields Dogfooded 2026-04-18 on main HEAD debbcbe from /tmp/cdBB2. Non-git directory: $ mkdir /tmp/cdBB2 && cd /tmp/cdBB2 # NO git init $ claw --output-format json status \| jq .workspace.git_state 'clean' # should be null — not in a git repo $ claw --output-format json doctor \| jq '.checks[] \| select(.name=="workspace") \| {in_git_repo, git_state}' {"in_git_repo": false, "git_state": "clean"} # CONTRADICTORY: not in git BUT git is 'clean' Trace: main.rs:2550-2554 parse_git_workspace_summary: let Some(status) = status else { return summary; // all-zero default when no git }; All-zero GitWorkspaceSummary → is_clean() (changed_files==0) → true → headline() = 'clean' main.rs:4950 status JSON: git_summary.headline() for git_state main.rs:1856 doctor workspace: same headline() for git_state Fix shape (~25 lines): - Return Option<GitWorkspaceSummary> when status is None - headline() returns Option<String>: None when no git - Status JSON: git_state: null when not in git - Doctor: omit git_state when in_git_repo: false, or set null - Optional: claw init skip .gitignore in non-git dirs - Regression: non-git → null, git clean → 'clean', detached HEAD → 'clean' + 'detached HEAD' Joins Truth-audit — 'clean' is a lie for non-git dirs. Adjacent to #89 (claw blind to mid-rebase) — same field, different missing state. Joins #100 (status/doctor JSON gaps) — another field whose value doesn't reflect reality. Natural bundle: #89 + #100 + #125 — git-state-completeness triple: rebase/merge invisible (#89) + stale-base unplumbed (#100) + non-git 'clean' lie (#125). Complete git_state field failure coverage. Filed in response to Clawhip pinpoint nudge 1495016073085583442 in #clawcode-building-in-public.	2026-04-18 20:03:32 +09:00
YeonGyu-Kim	debbcbe7fb	ROADMAP #124 : --model accepts any string with zero validation; typos silently pass through; empty string accepted; status JSON has no model provenance Dogfooded 2026-04-18 on main HEAD bb76ec9 from /tmp/cdAA2. --model flag has zero validation: claw --model sonet status → model:'sonet' (typo passthrough) claw --model '' status → model:'' (empty accepted) claw --model garbage status → model:'garbage' (any string) Valid aliases do resolve: sonnet → claude-sonnet-4-6 opus → claude-opus-4-6 Config aliases also resolve via resolve_model_alias_with_config But unresolved strings pass through silently. Typo 'sonet' becomes literal model ID sent to API → fails late with 'model not found' after full context assembly. Compare: --reasoning-effort: validates low\|medium\|high. Has guard. --permission-mode: validates against known set. Has guard. --model: no guard. Any string. --base-commit: no guard (#122). Same pattern. status JSON: {model: 'sonet'} — shows resolved name only. No model_source (flag/config/default). No model_raw (pre-resolution input). No model_valid (known to any provider). Claw can't distinguish typo from exact model from alias. Trace: main.rs:470-480 --model parsing: model = value.clone(); index += 2; No validation. Raw string stored. main.rs:1032-1046 resolve_model_alias_with_config: resolves known aliases. Unknown strings pass through. main.rs:~4951 status JSON builder: reports resolved model. No source/raw/valid fields. Fix shape (~65 lines): - Reject empty string at parse time - Warn on unresolved aliases with fuzzy-match suggestion - Add model_source, model_raw to status JSON - Add model-validity check to doctor - Regression per failure mode Joins #105 (4-surface model disagreement) — model pair: #105 status ignores config model, doctor mislabels #124 --model flag unvalidated, no provenance in JSON Joins #122 (--base-commit zero validation) — unvalidated-flag pair: same parser pattern, no guards. Joins Silent-flag/documented-but-unenforced as 17th. Joins Truth-audit — status model field has no provenance. Joins Parallel-entry-point asymmetry as 10th. Filed in response to Clawhip pinpoint nudge 1495000973914144819 in #clawcode-building-in-public.	2026-04-18 19:03:02 +09:00
YeonGyu-Kim	bb76ec9730	ROADMAP #123 : --allowedTools tool-name normalization asymmetric; snake_case canonicals accept variants, PascalCase canonicals reject snake_case; whitespace+comma split undocumented; allowed_tools not surfaced in JSON Dogfooded 2026-04-18 on main HEAD 2bf2a11 from /tmp/cdZZ. Asymmetric normalization: normalize_tool_name(value) = trim + lowercase + replace -→_ Canonical 'read_file' (snake_case): accepts: read_file, READ_FILE, Read-File, read-file, Read (alias), read (alias) rejects: ReadFile, readfile, READFILE → Because normalize('ReadFile')='readfile', and name_map has key 'read_file' not 'readfile'. Canonical 'WebFetch' (PascalCase): accepts: WebFetch, webfetch, WEBFETCH rejects: web_fetch, web-fetch, Web-Fetch → Because normalize('WebFetch')='webfetch' (no underscore). User input 'web_fetch' normalizes to 'web_fetch' (keeps underscore). Keys don't match. The normalize function ADDS underscores (hyphen→underscore) but DOESN'T REMOVE them. So PascalCase canonicals have underscore- free normalized keys; user input with explicit underscores keeps them, creating key mismatch. Result: 'bash,Bash,BASH,Read,read_file,Read-File,WebFetch' all accepted, but 'web_fetch,web-fetch' rejected. Additional silent-flag issues: - Splits on commas OR whitespace (undocumented — help says TOOL[,TOOL...]) - 'bash,Bash,BASH' silently accepts all 3 case variants, no dedup warning - Allowed tools NOT in status/doctor JSON — claw passing --allowedTools has no way to verify what runtime accepted Trace: tools/src/lib.rs:192-244 normalize_allowed_tools: canonical_names from mvp_tool_specs + plugin_tools + runtime name_map: (normalize_tool_name(canonical), canonical) for token in value.split(\|c\| c==',' \|\| c.is_whitespace()): lookup normalize_tool_name(token) in name_map tools/src/lib.rs:370-372 normalize_tool_name: fn normalize_tool_name(value: &str) -> String { value.trim().replace('-', '_').to_ascii_lowercase() } Replaces - with _. Lowercases. Does NOT remove _. Asymmetry source: normalize('WebFetch')='webfetch', normalize('web_fetch')='web_fetch'. Different keys. --allowedTools NOT plumbed into Status JSON output (no 'allowed_tools' field). Fix shape (~50 lines): - Symmetric normalization: strip underscores from both canonical and input, OR don't normalize hyphens in input either. Pick one convention. - claw tools list / --allowedTools help subcommand that prints canonical names + accepted variants. - Surface allowed_tools in status/doctor JSON when flag set. - Document comma+whitespace split semantics in --help. - Warn on duplicate tokens (bash,Bash,BASH = 3 tokens, 1 unique). - Regression per normalization pair + status surface + duplicate. Joins Silent-flag/documented-but-unenforced (#96-#101, #104, #108, #111, #115, #116, #117, #118, #119, #121, #122) as 16th. Joins Permission-audit/tool-allow-list (#94, #97, #101, #106, #115, #120) as 7th. Joins Truth-audit — status/doctor JSON hides what allowed-tools set actually is. Joins Parallel-entry-point asymmetry (#91, #101, #104, #105, #108, #114, #117, #122) as 9th — --allowedTools vs .claw.json permissions.allow likely disagree on normalization. Natural bundles: #97 + #123 — --allowedTools trust-gap pair: empty silently blocks (#97) + asymmetric normalization + invisible runtime state (#123) Permission-audit 7-way (grown): #94 + #97 + #101 + #106 + #115 + #120 + #123 Flagship permission-audit sweep 8-way (grown): #50 + #87 + #91 + #94 + #97 + #101 + #115 + #123 Filed in response to Clawhip pinpoint nudge 1494993419536306176 in #clawcode-building-in-public.	2026-04-18 18:38:24 +09:00
YeonGyu-Kim	2bf2a11943	ROADMAP #122 : --base-commit greedy-consumes next arg with zero validation; subcommand/flag swallow; stale-base signal missing from status/doctor JSON surfaces Dogfooded 2026-04-18 on main HEAD d1608ae from /tmp/cdYY. Three related findings: 1. --base-commit has zero validation: $ claw --base-commit doctor warning: worktree HEAD (...) does not match expected base commit (doctor). Session may run against a stale codebase. error: missing Anthropic credentials; ... # 'doctor' used as base-commit value literally. # Subcommand absorbed. Prompt fallthrough. Billable. 2. Greedy swallow of next flag: $ claw --base-commit --model sonnet status warning: ...does not match expected base commit (--model) # '--model' taken as value. status never dispatched. 3. Garbage values silently accepted: $ claw --base-commit garbage status Status ... # No validation. No warning (status path doesn't run check). 4. Stale-base signal missing from JSON surfaces: $ claw --output-format json --base-commit $BASE status {"kind":"status", ...} # no stale_base, no base_commit, no base_commit_mismatch. Stale-base check runs ONLY on Prompt path, as stderr prose. Trace: main.rs:487-494 --base-commit parsing: 'base-commit' => { let value = args.get(index + 1).ok_or_else(...)?; base_commit = Some(value.clone()); index += 2; } No format check. No reject-on-flag-prefix. No reject-on- known-subcommand. Compare main.rs:498-510 --reasoning-effort: validates 'low' \| 'medium' \| 'high'. Has guard. stale_base.rs check_base_commit runs on Prompt/turn path only. No Status/Doctor handler includes base_commit field. grep 'stale_base\|base_commit_matches\|base_commit:' rust/crates/rusty-claude-cli/src/main.rs \| grep status\|doctor → zero matches. Fix shape (~40 lines): - Reject values starting with '-' (flag-like) - Reject known-subcommand names as values - Optionally run 'git cat-file -e {value}' to verify real commit - Plumb base_commit + base_commit_matches + stale_base_warning into Status and Doctor JSON surfaces - Emit warning as structured JSON event too (not just stderr) - Regression per failure mode Joins Silent-flag/documented-but-unenforced (#96-#101, #104, #108, #111, #115, #116, #117, #118, #119, #121) as 15th. Joins Parser-level trust gaps: #108 + #117 + #119 + #122 — billable-token silent-burn via parser too-eager consumption. Joins Parallel-entry-point asymmetry (#91, #101, #104, #105, #108, #114, #117) as 8th — stale-base implemented for Prompt but absent from Status/Doctor. Joins Truth-audit — 'expected base commit (doctor)' lies by including user's mistake as truth. Cross-cluster with Unplumbed-subsystem (#78, #96, #100, #102, #103, #107, #109, #111, #113, #121) — stale-base signal in runtime but not JSON. Natural bundles: Parser-level trust gap quintet (grown): #108 + #117 + #119 + #122 — billable-token silent-burn via parser too-eager consumption. #100 + #122 — stale-base diagnostic-integrity pair: #100 stale-base subsystem unplumbed (general) #122 --base-commit accepts anything, greedy, Status/Doctor JSON unplumbed (specific) Filed in response to Clawhip pinpoint nudge 1494978319920136232 in #clawcode-building-in-public.	2026-04-18 18:03:35 +09:00
YeonGyu-Kim	d1608aede4	ROADMAP #121 : hooks schema incompatible with Claude Code; error message misleading; doctor JSON emits 2 objects on failure breaking single-doc parsing; doctor has duplicate message+report fields Dogfooded 2026-04-18 on main HEAD b81e642 from /tmp/cdWW. Four related findings in one: 1. hooks schema incompatible with Claude Code (primary): claw-code: {'hooks':{'PreToolUse':['cmd1','cmd2']}} Claude Code: {'hooks':{'PreToolUse':[ {'matcher':'Bash','hooks':[{'type':'command','command':'...'}]} ]}} Flat string array vs matcher-keyed object array. Incompatible. User copying .claude.json hooks to .claw.json hits parse-fail. 2. Error message misleading: 'field hooks.PreToolUse must be an array of strings, got an array' Both input and expected are arrays. Correct diagnosis: 'got an array of objects where array of strings expected' 3. Missing Claude Code hook event types: claw-code supports: PreToolUse, PostToolUse, PostToolUseFailure Claude Code supports: above + UserPromptSubmit, Notification, Stop, SubagentStop, PreCompact, SessionStart 5+ event types missing. matcher regex not supported. type: 'command' vs type: 'http' extensibility not supported. 4. doctor NDJSON output on failures: With failures present, --output-format json emits TWO concatenated JSON objects on stdout: Object 1: {kind:'doctor', has_failures:true, ...} Object 2: {type:'error', error:'doctor found failing checks'} python json.load() fails: 'Extra data: line 133 column 1' Flag name 'json' violated — NDJSON is not JSON. 5. doctor message + report byte-duplicated: .message and .report top-level fields have identical prose content. Parser ambiguity + byte waste. Trace: config.rs:750-771 parse_optional_hooks_config_object: optional_string_array(hooks, 'PreToolUse', context) Expects ['cmd1', 'cmd2']. Claude Code gives [{matcher,hooks:[{type,command}]}]. Schema-incompatible. config.rs:775-779 validate_optional_hooks_config: calls same parser. Error bubbles up. Message comes from optional_string_array path — technically correct but misleading. Fix shape (~200 lines + migration docs): - Dual-schema hooks parser: accept native + Claude Code forms - Add missing event types to RuntimeHookConfig - Implement matcher regex - Fix error message to distinguish array-element types - Fix doctor: single JSON object regardless of failure state - De-duplicate message + report (keep report, drop message) - Regression per schema form + event type + matcher Joins Claude Code migration parity (#103, #109, #116, #117, #119, #120) as 7th — most severe parity break since hooks is load-bearing automation infrastructure. Joins Truth-audit on misleading error message. Joins Silent-flag on --output-format json emitting NDJSON. Cross-cluster with Unplumbed-subsystem (#78, #96, #100, #102, #103, #107, #109, #111, #113) — hooks subsystem exists but schema incompatible with reference implementation. Natural bundles: Claude Code migration parity septet (grown flagship): #103 + #109 + #116 + #117 + #119 + #120 + #121 Complete coverage of every migration failure mode. #107 + #121 — hooks-subsystem pair: #107 hooks invisible to JSON diagnostics #121 hooks schema incompatible with migration source Filed in response to Clawhip pinpoint nudge 1494963222157983774 in #clawcode-building-in-public.	2026-04-18 17:03:14 +09:00
YeonGyu-Kim	b81e6422b4	ROADMAP #120 : .claw.json custom JSON5-partial parser accepts trailing commas but silently drops comments/unquoted/BOM; combined with alias table 'default'→ReadOnly + no-config→DangerFullAccess creates security-critical user-intent inversion Dogfooded 2026-04-18 on main HEAD 7859222 from /tmp/cdVV. Extends #86 (silent-drop general case) with two new angles: 1. JSON5-partial acceptance matrix: ACCEPTED (loaded correctly): - trailing comma (one) SILENTLY DROPPED (loaded_config_files=0, zero stderr, exit 0): - line comments (//) - block comments (/* */) - unquoted keys - UTF-8 BOM - single quotes - hex numbers - leading commas - multiple trailing commas 8 cases tested, 1 accepted, 7 silently dropped. The 1 accepted gives false signal of JSON5 tolerance. 2. Alias table creates user-intent inversion: config.rs:856-858: 'default' \| 'plan' \| 'read-only' => ReadOnly 'acceptEdits' \| 'auto' \| 'workspace-write' => WorkspaceWrite 'dontAsk' \| 'danger-full-access' => DangerFullAccess CRITICAL: 'default' in the config file = ReadOnly no config at all = DangerFullAccess (per #87) These are OPPOSITE modes. Security-inversion chain: user writes: {'// comment', 'defaultMode': 'default'} user intent: read-only parser: rejects comment read_optional_json_object: silently returns Ok(None) config loader: no config present permission_mode: falls back to no-config default = DangerFullAccess ACTUAL RESULT: opposite of intent. ZERO warning. Trace: config.rs:674-692 read_optional_json_object: is_legacy_config = (file_name == '.claw.json') match JsonValue::parse(&contents) { Ok(parsed) => parsed, Err(_error) if is_legacy_config => return Ok(None), Err(error) => return Err(ConfigError::Parse(...)), } is_legacy silent-drop. (#86 covers general case) json.rs JsonValue::parse — custom parser: accepts trailing comma rejects everything else JSON5-ish Fix shape (~80 lines, overlaps with #86): - Pick policy: strict JSON or explicit JSON5. Enforce consistently. - Apply #86 fix here: replace silent-drop with warn-and-continue, structured warning in stderr + JSON surface. - Rename 'default' alias OR map to 'ask' (matches English meaning). - Structure status output: add config_parse_errors:[] field so claws detect silent drops via JSON without stderr-parsing. - Regression matrix per JSON5 feature + security-invariant test. Joins Permission-audit/tool-allow-list (#94, #97, #101, #106, #115) as 6th — this is the CONFIG-PARSE anchor of the permission- posture problem. Complete matrix: #87 absence → DangerFullAccess #101 env-var fail-OPEN → DangerFullAccess #115 init-generated dangerous default → DangerFullAccess #120 config parse-drops → DangerFullAccess Joins Truth-audit on loaded_config_files=0 + permission_mode= danger-full-access inconsistency without config_parse_errors[]. Joins Reporting-surface/config-hygiene (#90, #91, #92, #110, #115, #116) on silent-drop-no-stderr-exit-0 axis. Joins Claude Code migration parity (#103, #109, #116, #117, #119) as 6th — claw-code is strict-where-Claude-was-lax (#116) AND lax-where-Claude-was-strict (#120). Maximum migration confusion. Natural bundles: #86 + #120 — config-parse reliability pair: silent-drop general case (#86) + JSON5-partial-acceptance + alias-inversion (#120) Permission-drift-at-every-boundary 4-way: #87 + #101 + #115 + #120 — absence + env-var + init + config-drop. Complete coverage of every path to DangerFullAccess. Security-critical permission drift audit mega-bundle: #86 + #87 + #101 + #115 + #116 + #120 — five-way sweep of every path to wrong permissions. Filed in response to Clawhip pinpoint nudge 1494955670791913508 in #clawcode-building-in-public.	2026-04-18 16:34:19 +09:00

1 2 3 4 5

215 Commits