mirror of
https://github.com/affaan-m/everything-claude-code.git
synced 2026-06-16 16:36:53 +08:00
The official Agent Skills spec (agentskills.io/specification) whitelists exactly 6 top-level frontmatter keys (name/description/license/compatibility/metadata/ allowed-tools). A top-level `origin` key fails the official validator (anthropics/skills quick_validate.py ALLOWED_PROPERTIES; skills-ref validate). This moves `origin: X` -> `metadata.origin: X` across the canonical skills/ tree, preserving each value verbatim. Frontmatter-only, minimal diff. - 251 SKILL.md updated (242 new metadata block, 9 appended to existing metadata) - origin values preserved verbatim (verified 251/251) - YAML validated on all changed files - scoped to canonical skills/ only (docs/<lang> translations + tool mirrors .cursor/.kiro/.agents left untouched; presumably regenerated from canonical) Addresses #2233
65 lines
1.7 KiB
Markdown
65 lines
1.7 KiB
Markdown
---
|
|
name: agentic-engineering
|
|
description: Operate as an agentic engineer using eval-first execution, decomposition, and cost-aware model routing.
|
|
metadata:
|
|
origin: ECC
|
|
---
|
|
|
|
# Agentic Engineering
|
|
|
|
Use this skill for engineering workflows where AI agents perform most implementation work and humans enforce quality and risk controls.
|
|
|
|
## Operating Principles
|
|
|
|
1. Define completion criteria before execution.
|
|
2. Decompose work into agent-sized units.
|
|
3. Route model tiers by task complexity.
|
|
4. Measure with evals and regression checks.
|
|
|
|
## Eval-First Loop
|
|
|
|
1. Define capability eval and regression eval.
|
|
2. Run baseline and capture failure signatures.
|
|
3. Execute implementation.
|
|
4. Re-run evals and compare deltas.
|
|
|
|
## Task Decomposition
|
|
|
|
Apply the 15-minute unit rule:
|
|
- each unit should be independently verifiable
|
|
- each unit should have a single dominant risk
|
|
- each unit should expose a clear done condition
|
|
|
|
## Model Routing
|
|
|
|
- Haiku: classification, boilerplate transforms, narrow edits
|
|
- Sonnet: implementation and refactors
|
|
- Opus: architecture, root-cause analysis, multi-file invariants
|
|
|
|
## Session Strategy
|
|
|
|
- Continue session for closely-coupled units.
|
|
- Start fresh session after major phase transitions.
|
|
- Compact after milestone completion, not during active debugging.
|
|
|
|
## Review Focus for AI-Generated Code
|
|
|
|
Prioritize:
|
|
- invariants and edge cases
|
|
- error boundaries
|
|
- security and auth assumptions
|
|
- hidden coupling and rollout risk
|
|
|
|
Do not waste review cycles on style-only disagreements when automated format/lint already enforce style.
|
|
|
|
## Cost Discipline
|
|
|
|
Track per task:
|
|
- model
|
|
- token estimate
|
|
- retries
|
|
- wall-clock time
|
|
- success/failure
|
|
|
|
Escalate model tier only when lower tier fails with a clear reasoning gap.
|