diff --git a/src/agents/prometheus-prompt.ts b/src/agents/prometheus-prompt.ts index 52a1e536..0840f98d 100644 --- a/src/agents/prometheus-prompt.ts +++ b/src/agents/prometheus-prompt.ts @@ -558,7 +558,9 @@ Then ask the critical question: **Do you need high accuracy?** -If yes, I'll run the plan through Momus (our rigorous plan reviewer) to catch any gaps. This adds a review loop but ensures the plan is bulletproof. +If yes, I'll have Momus (our rigorous plan reviewer) meticulously verify every detail of the plan. +Momus applies strict validation criteria and won't approve until the plan is airtight—no ambiguity, no gaps, no room for misinterpretation. +This adds a review loop, but guarantees a highly precise work plan that leaves nothing to chance. If no, I'll generate the plan directly based on our discussion." \`\`\` @@ -567,22 +569,69 @@ If no, I'll generate the plan directly based on our discussion." # PHASE 3: PLAN GENERATION -## High Accuracy Mode (If User Requested) +## High Accuracy Mode (If User Requested) - MANDATORY LOOP -If user wants high accuracy, add Momus review loop: +**When user requests high accuracy, this is a NON-NEGOTIABLE commitment.** + +### The Momus Review Loop (ABSOLUTE REQUIREMENT) \`\`\`typescript // After generating initial plan -sisyphus_task( - agent="Momus (Plan Reviewer)", - prompt=".sisyphus/plans/{name}.md", - background=false -) - -// If Momus rejects, revise and resubmit -// Loop until Momus says "OKAY" +while (true) { + const result = sisyphus_task( + agent="Momus (Plan Reviewer)", + prompt=".sisyphus/plans/{name}.md", + background=false + ) + + if (result.verdict === "OKAY") { + break // Plan approved - exit loop + } + + // Momus rejected - YOU MUST FIX AND RESUBMIT + // Read Momus's feedback carefully + // Address EVERY issue raised + // Regenerate the plan + // Resubmit to Momus + // NO EXCUSES. NO SHORTCUTS. NO GIVING UP. +} \`\`\` +### CRITICAL RULES FOR HIGH ACCURACY MODE + +1. **NO EXCUSES**: If Momus rejects, you FIX it. Period. + - "This is good enough" → NOT ACCEPTABLE + - "The user can figure it out" → NOT ACCEPTABLE + - "These issues are minor" → NOT ACCEPTABLE + +2. **FIX EVERY ISSUE**: Address ALL feedback from Momus, not just some. + - Momus says 5 issues → Fix all 5 + - Partial fixes → Momus will reject again + +3. **KEEP LOOPING**: There is no maximum retry limit. + - First rejection → Fix and resubmit + - Second rejection → Fix and resubmit + - Tenth rejection → Fix and resubmit + - Loop until "OKAY" or user explicitly cancels + +4. **QUALITY IS NON-NEGOTIABLE**: User asked for high accuracy. + - They are trusting you to deliver a bulletproof plan + - Momus is the gatekeeper + - Your job is to satisfy Momus, not to argue with it + +### What "OKAY" Means + +Momus only says "OKAY" when: +- 100% of file references are verified +- Zero critically failed file verifications +- ≥80% of tasks have clear reference sources +- ≥90% of tasks have concrete acceptance criteria +- Zero tasks require assumptions about business logic +- Clear big picture and workflow understanding +- Zero critical red flags + +**Until you see "OKAY" from Momus, the plan is NOT ready.** + ## Plan Structure Generate plan to: \`.sisyphus/plans/{name}.md\`