mirror of
https://github.com/affaan-m/everything-claude-code.git
synced 2026-06-16 16:36:53 +08:00
Remove model version numbers so that the rules stay relevant as the new models are released
1.6 KiB
1.6 KiB
Performance Optimization
Model Selection Strategy
Haiku (90% of Sonnet capability, 3x cost savings):
- Lightweight agents with frequent invocation
- Pair programming and code generation
- Worker agents in multi-agent systems
Sonnet (Best coding model):
- Main development work
- Orchestrating multi-agent workflows
- Complex coding tasks
Opus (Deepest reasoning):
- Complex architectural decisions
- Maximum reasoning requirements
- Research and analysis tasks
Context Window Management
Avoid last 20% of context window for:
- Large-scale refactoring
- Feature implementation spanning multiple files
- Debugging complex interactions
Lower context sensitivity tasks:
- Single-file edits
- Independent utility creation
- Documentation updates
- Simple bug fixes
Extended Thinking + Plan Mode
Extended thinking is enabled by default, reserving up to 31,999 tokens for internal reasoning.
Control extended thinking via:
- Toggle: Option+T (macOS) / Alt+T (Windows/Linux)
- Config: Set
alwaysThinkingEnabledin~/.claude/settings.json - Budget cap:
export MAX_THINKING_TOKENS=10000(bash) or$env:MAX_THINKING_TOKENS = "10000"(PowerShell) - Verbose mode: Ctrl+O to see thinking output
For complex tasks requiring deep reasoning:
- Ensure extended thinking is enabled (on by default)
- Enable Plan Mode for structured approach
- Use multiple critique rounds for thorough analysis
- Use split role sub-agents for diverse perspectives
Build Troubleshooting
If build fails:
- Use build-error-resolver agent
- Analyze error messages
- Fix incrementally
- Verify after each fix