Failure Recovery
Input: $ARGUMENTS
Interpretations
Before executing, identify which interpretation matches the user’s input:
Interpretation 1 — Recover from a specific project failure: The user has hit a concrete blocker — a strategy that isn’t working, a blown timeline, exhausted resources, or an external shock — and needs structured recovery procedures. Interpretation 2 — Decide whether to pivot or persist: The user is uncertain whether the project should continue at all, and needs help classifying the failure and evaluating whether the goal, strategy, or plan is still valid. Interpretation 3 — Conduct a retrospective on a past failure: The user wants to analyze a failure that already happened to extract learnings and prevent recurrence, rather than recover from an active crisis.
If ambiguous, ask: “I can help with recovering from an active failure, deciding whether to pivot or persist, or conducting a retrospective on a past failure — which fits?” If clear from context, proceed with the matching interpretation.
Overview
Structured recovery procedures when projects encounter failures, providing clear decision trees and specific actions for common failure modes.
Steps
Step 1: Identify and classify failure
Use the master decision tree to classify the failure:
-
Is this a complete blocker? (Cannot proceed at all)
- YES: Continue to step 2
- NO: Continue with workaround, document risk, exit procedure
-
Is the goal still valid? (Do you still want this outcome?)
- YES: Continue to step 3
- NO: Archive project, complete LEARNINGS.md, exit procedure
-
Is the strategy still valid? (Is approach fundamentally sound?)
- YES: Continue to step 4
- NO: Return to Strategy Phase, run Strategy Search again
-
Is the plan still valid? (Are steps right, just need better execution?)
- YES: Fix execution issue, add safeguards, continue
- NO: Return to Planning Phase, revise gates
Classify into failure type:
- validation_failed: Discovery shows no viable path
- timeline_blown: Significantly behind schedule
- strategy_not_working: Execution not producing results
- resource_exhausted: Budget, time, or energy depleted
- external_shock: External event invalidates plan
Step 2: Diagnose root cause
For each failure type, answer diagnostic questions:
VALIDATION FAILED:
- Was the niche too narrow or too broad?
- Were you talking to the right people?
- Was the value proposition clear?
- Is the problem real but your solution wrong?
TIMELINE BLOWN:
- Was the estimate unrealistic?
- Did scope creep occur?
- Were there unexpected blockers?
- Is effort/time allocation sufficient?
STRATEGY NOT WORKING:
- Is the strategy wrong, or execution wrong?
- Did market conditions change?
- Were Adversarial Review assumptions violated?
- Is there new information that invalidates strategy?
RESOURCE EXHAUSTED:
- Which resource is the binding constraint?
- Is the goal achievable with remaining resources?
- What’s the minimum viable completion?
EXTERNAL SHOCK:
- Is this temporary or permanent?
- Does this affect the goal or just the strategy?
- Can we adapt or must we wait?
Document answers to build diagnosis.
Step 3: Evaluate recovery options
Based on failure type and diagnosis, evaluate recovery options:
VALIDATION FAILED OPTIONS: A. Pivot to adjacent niche (2-3 weeks)
- When: Core problem valid, wrong audience
- Steps: List adjacent audiences, run discovery, score, restart Phase 2
B. Pivot to different problem (3-4 weeks)
- When: This problem isn’t real or big enough
- Steps: Review notes for other pain points, select alternative, return to Assessment
C. Switch to cloning strategy (1-2 weeks)
- When: Niche discovery systematically failing
- Steps: Identify successful products, find underserved segments, switch strategy
D. Abandon this goal (1-2 days)
- When: Fundamental assumption was wrong
- Steps: Complete LEARNINGS.md, archive project, start fresh
TIMELINE BLOWN OPTIONS: A. Reduce scope (1-2 days planning)
- When: Trying to do too much
- Steps: List deliverables, categorize MUST/SHOULD/COULD, cut to MUST
B. Extend timeline (immediate)
- When: Scope correct, just slower than expected
- Steps: Calculate realistic time, update milestones, communicate
C. Increase resources (varies)
- When: Time-resource tradeoff available
- Steps: Identify parallelizable work, determine if help possible
STRATEGY NOT WORKING OPTIONS: A. Adjust strategy parameters (1-2 weeks)
- When: Strategy sound, needs tuning
- Steps: Identify parameters, run experiments, adopt best variation
B. Switch to backup strategy (1 week)
- When: Pre-identified backup exists
- Steps: Review backup, verify assumptions, run Adversarial Review, switch
C. Return to Strategy Search (2-3 weeks)
- When: Need fundamentally different approach
- Steps: Document why current failed, use as input to new Strategy Search
RESOURCE EXHAUSTED OPTIONS: A. Minimum viable completion (varies)
- When: Can achieve partial goal with remaining resources
- Steps: Define MVP, cut to essentials, focus remaining resources
B. Pause and replenish (pause period + ramp-up)
- When: Goal still valuable, just need to recover
- Steps: Document state, set pause period, replenish, resume
C. Pivot to lower-resource approach (1-2 weeks)
- When: Different path requires fewer resources
- Steps: Identify alternatives, evaluate, switch approach
EXTERNAL SHOCK OPTIONS: A. Wait it out (disruption duration)
- When: Temporary disruption
- Steps: Estimate duration, pause project, monitor, resume when clear
B. Adapt to new reality (1-3 weeks)
- When: Permanent change but goal still achievable
- Steps: Analyze new landscape, update strategy, continue
C. Abandon and learn (1-2 days)
- When: Change makes goal unachievable or undesirable
- Steps: Accept, document learnings, archive, consider new goal
Step 4: Select recovery path
Choose the best recovery option based on:
-
Alignment with root cause
- Does this option address the actual problem?
-
Resource requirements
- Do you have the time/money/energy for this option?
-
Risk profile
- What could go wrong with this recovery?
- Is failure of recovery acceptable?
-
Learning potential
- What will you learn from this path?
-
Goal preservation
- Does this keep you moving toward the original goal?
- Or does it require goal adjustment?
Make explicit selection with documented rationale. Get stakeholder buy-in if applicable.
Step 5: Create recovery plan
Build concrete recovery plan:
-
Define recovery milestones
- What does “recovered” look like?
- What are intermediate checkpoints?
-
List specific actions
- What needs to happen in what order?
- Who does what?
-
Set timeline
- When will each milestone be hit?
- Add buffer for recovery-specific uncertainty
-
Define abort criteria
- When would you abandon this recovery?
- What signals that recovery isn’t working?
-
Update project documents
- STATE.md: reflect recovery mode
- DECISION_TREE.md: document recovery decision
Step 6: Execute recovery
Execute the recovery plan:
- Update STATE.md to reflect recovery in progress
- Follow recovery milestones
- Regular check-ins against abort criteria
- Watch for signs recovery isn’t working:
- Missing recovery milestones
- New blockers appearing
- Resource burn higher than expected
If abort criteria triggered:
- Stop current recovery
- Return to Step 3 with updated information
- Select different recovery option
Step 7: Document learnings
After recovery (successful or not):
-
Document what caused the failure
- Root cause
- Warning signs that were missed
- What could have prevented it
-
Document what recovery worked (or didn’t)
- Which option was selected
- How well it addressed the root cause
- What would you do differently
-
Update LEARNINGS.md with failure/recovery
- Add to project’s learnings
- Consider adding to library if generalizable
-
Consider procedure improvements
- Should any procedures be updated?
- Should new failure modes be added?
- Should triggers be refined?
When to Use
- When a project encounters a complete blocker
- When key assumptions are invalidated
- When timeline is more than 50% over estimate
- When strategy execution consistently fails to produce expected results
- When resources (budget, time, energy) are exhausted before completion
- When external events significantly change project conditions
- When feeling stuck with no clear path forward
- During retrospectives to analyze past failures
Verification
- Failure is clearly identified and named with classification
- Diagnosis is complete using relevant questions
- Recovery option selected with documented reasoning
- Time/resource cost is estimated for recovery
- Decision is documented in DECISION_TREE.md
- STATE.md reflects recovery mode during execution
- LEARNINGS.md updated after recovery completes