claw-code

mirror of https://github.com/ultraworkers/claw-code.git synced 2026-04-29 00:02:01 +08:00

Author	SHA1	Message	Date
Yeachan-Heo	6db68a2baa	Expose tool permission gates as structured worker blockers Worker boot could previously stall on an interactive MCP/tool permission prompt while readiness and startup-timeout surfaces only had generic idle/no-evidence shapes. This adds a first-class blocked lifecycle state, structured event payload, startup evidence fields, and regression coverage so callers can report the exact server/tool gate instead of pane-scraping. Constraint: ROADMAP #200 requires tool/server identity, prompt age, and session-only versus always-allow capability in status/evidence surfaces Rejected: Treat MCP/tool prompts as trust gates \| conflates distinct prompts and loses tool identity Rejected: Leave allow-scope as pane text only \| clawhip still could not classify the blocker without scraping Confidence: high Scope-risk: moderate Directive: Keep tool_permission_required distinct from trust_required; downstream claws rely on server/tool payload plus allow-scope metadata Tested: cargo test -p runtime tool_permission Tested: cargo fmt -p runtime -- --check && cargo clippy -p runtime --all-targets -- -D warnings && cargo test -p runtime Tested: cargo test --workspace Not-tested: live interactive MCP permission prompt in tmux	2026-04-27 09:28:09 +00:00
Yeachan-Heo	5b910356a2	Preserve trust boundaries during pulled follow-up The pull brought the branch current with origin/main while replaying local follow-up work. Conflict resolution kept the roadmap/progress additions and integrated the runtime event/trust changes with upstream's newer surfaces. The trust allowlist now treats worktree_pattern as an additional required predicate, including the missing-worktree case, so auto-trust cannot fall back to cwd-only matching when a worktree constraint was declared. The runtime formatting cleanup keeps clippy/fmt green after the merge. Constraint: Local branch was 109 commits behind origin/main with dirty tracked follow-up work. Rejected: Drop the autostash after conflict resolution \| keeping it preserves a reversible safety backup for unrelated recovery. Confidence: high Scope-risk: moderate Directive: Do not relax worktree_pattern matching without preserving the missing-worktree regression. Tested: git diff --cached --check; cargo fmt -p runtime -- --check; cargo clippy -p runtime --all-targets -- -D warnings; cargo test -p runtime; cargo test --workspace; architect verification approved Not-tested: Live tmux/worker auto-trust behavior outside unit/integration tests	2026-04-27 09:05:50 +00:00
Yeachan-Heo	87b982ece5	US-011: Performance optimization for API request serialization Added criterion benchmarks and optimized flatten_tool_result_content: - Added criterion dev-dependency and request_building benchmark suite - Optimized flatten_tool_result_content to pre-allocate capacity and avoid intermediate Vec construction (was collecting to Vec then joining) - Made key functions public for benchmarking: translate_message, build_chat_completion_request, flatten_tool_result_content, is_reasoning_model, model_rejects_is_error_field Benchmark results: - flatten_tool_result_content/single_text: ~17ns - translate_message/text_only: ~200ns - build_chat_completion_request/10 messages: ~16.4µs - is_reasoning_model detection: ~26-42ns All 119 unit tests and 29 integration tests pass. cargo clippy passes Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-16 11:11:45 +00:00
Yeachan-Heo	f65d15fb2f	US-010: Add model compatibility documentation Created comprehensive MODEL_COMPATIBILITY.md documenting: - Kimi models is_error exclusion (prevents 400 Bad Request) - Reasoning models tuning parameter stripping (o1, o3, o4, grok-3-mini, qwen-qwq) - GPT-5 max_completion_tokens requirement - Qwen model routing through DashScope Includes implementation details, key functions table, guide for adding new models, and testing commands. Cross-referenced with existing code comments. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-16 10:55:58 +00:00
Yeachan-Heo	3e4e1585b5	US-009: Add comprehensive unit tests for kimi model compatibility fix Added 4 unit tests to verify is_error field handling for kimi models: - model_rejects_is_error_field_detects_kimi_models: Detects kimi-k2.5, kimi-k1.5, dashscope/kimi-k2.5 (case insensitive) - translate_message_includes_is_error_for_non_kimi_models: Verifies gpt-4o, grok-3, claude include is_error - translate_message_excludes_is_error_for_kimi_models: Verifies kimi models exclude is_error (prevents 400 Bad Request) - build_chat_completion_request_kimi_vs_non_kimi_tool_results: Full integration test for request building All 119 unit tests and 29 integration tests pass. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-16 10:54:48 +00:00
Yeachan-Heo	6376694669	Mark all 7 roadmap stories complete with PRD and progress record - Update prd.json: mark US-001 through US-007 as passes: true - Add progress.txt: detailed implementation summary for all stories All acceptance criteria verified: - US-001: Startup failure evidence bundle + classifier - US-002: Lane event schema with provenance and deduplication - US-003: Stale branch detection with policy integration - US-004: Recovery recipes with ledger - US-005: Typed task packet format with TaskScope - US-006: Policy engine for autonomous coding - US-007: Plugin/MCP lifecycle maturity	2026-04-16 09:31:47 +00:00

6 Commits