Category

claim - Test a Claim

Claim

Input: $ARGUMENTS


Interpretations

Before executing, identify which interpretation matches the user’s input:

Interpretation 1 — Direct assertion: The user states something as fact (“X is true”, “Y causes Z”) and it needs testing. Interpretation 2 — Uncertain belief: The user holds a belief with some doubt (“I think X”, “Is X true?”) and wants it examined. Interpretation 3 — Received wisdom: The user presents something they’ve heard or read (“People say X”, “The conventional view is X”) and wants to know if it holds up.

If ambiguous, ask: “Are you asserting this, questioning it, or reporting what others believe?” If clear from context, proceed with the matching interpretation.


Core Principles

  1. Claims are tested, not confirmed. The default posture is skeptical curiosity, not validation. Testing means genuinely trying to break the claim, not finding reasons to agree with it.

  2. Bundled claims must be unbundled. “Remote work is better” contains at least three claims: better for whom, better by what measure, and better under what conditions. Each gets tested separately. The most load-bearing claim goes first.

  3. Untestable claims are restated, not dismissed. “Freedom is the highest value” can’t be tested as-is, but its observable consequences can. Convert to testable predictions, then test those.

  4. The user’s confidence determines the balance. Confident assertions need harder adversarial testing (push AW). Uncertain doubts need harder supportive testing (push AR). Never match the user’s emotional investment — counterbalance it.

  5. Verdicts are earned, not declared. A claim is VALIDATED only when AR evidence reaches bedrock and AW doesn’t find fatal flaws at bedrock. “Probably true” is not a verdict — it’s a failure to test deeply enough.

  6. The most interesting finding is usually the condition. Most claims are neither universally true nor universally false. The valuable output is: “True when A, false when B, and the crux is whether you’re in A or B.”


Routing Decisions

1. Extract the Proposition

What is being claimed? State it as a single testable sentence.

If the input is vague, restate it precisely before proceeding. A claim must be falsifiable — it must be possible for it to be wrong.

2. Is This Actually a Claim?

  • “Should I X?” → This is a decision, not a claim. → INVOKE: /decide $ARGUMENTS
  • “I want X” → This is a goal. → INVOKE: /want $ARGUMENTS
  • “I’m frustrated that X” → This is emotional. → INVOKE: /emotion $ARGUMENTS
  • “What are the options for X?” → This is exploration. → INVOKE: /search $ARGUMENTS
  • “I think X” (tentative, uncertain) → Use /it to formalize the claim first. → INVOKE: /it $ARGUMENTS
  • “X, but Y” (claim with objection attached) → Use /but to separate and resolve. → INVOKE: /but $ARGUMENTS
  • “I’m not sure about X” → Use /nsa to classify the uncertainty first. → INVOKE: /nsa $ARGUMENTS
  • “Handle this” (vague) → INVOKE: /handle $ARGUMENTS
  • If it IS a claim → continue.

3. Claim Shape Detection

  • Single claim: proceed directly.
  • Multiple claims bundled: unbundle into separate claims. Test the most load-bearing one first, then the others.
  • Claim with “etc” or “and so on”: expand the implicit list first. → INVOKE: /etc or /aso on the claim tail, then test each expanded claim.
  • Claim that is a platitude (“Move fast and break things”, “Fail fast”): needs operationalization before testing. → INVOKE: /platitude $ARGUMENTS — then test the operationalized version.
  • Contradictory platitudes (“Move fast” vs “Measure twice”): → INVOKE: /platitudes $ARGUMENTS
  • Hidden compound: “X is better than Y” bundles “X has property A”, “Y lacks property A”, and “property A is the right measure.” Unbundle all three.

4. Is the Claim Testable?

  • Testable (“Remote work is more productive”): proceed to ARAW.
  • Untestable/definitional (“Freedom is the highest value”): unpack what it would MEAN for this to be true — what observable consequences follow? Test those instead.
  • Ethical/moral claim (“We should do X because it’s right”): needs ethical analysis alongside truth-testing. → INVOKE: /eth $ARGUMENTS — then ARAW the factual components.

5. Determine Depth

  • Fragment or single sentence with no qualifiers → 2x
  • Sentence with context or qualifiers → 4x
  • Paragraph with reasoning → 4x-8x
  • User specifies depth → use that
  • User wants maximum rigor → INVOKE: /certainty $ARGUMENTS instead

6. Set AR/AW Balance

  • User says “I believe X” or asserts confidently → push AW harder (test the doubt side more). 60% AW.
  • User says “I doubt X” or is skeptical → push AR harder (find what supports it). 60% AR.
  • User says “Is X true?” with no lean → balanced 50/50.
  • User reports received wisdom (“people say X”) → push AW harder. 60% AW.
  • Default: 50/50.

Exception: If the user’s confidence is about their own future capability or aspiration (‘I can do X’, ‘I will achieve Y’), the confidence is not evidence of bias — it’s a claim about commitment. Test the claim’s feasibility, not the user’s right to be confident.

7. Does the Claim Need Context First?

Before testing, some claims need additional framing:

  • Claim about the future (“AI will replace programmers by 2030”): → also INVOKE: /fut $ARGUMENTS for temporal context.
  • Claim about what’s obvious (“Obviously we should X”): → also INVOKE: /obv $ARGUMENTS to check if the “obvious” holds up.
  • Claim that something is safe/dangerous: → also INVOKE: /saf $ARGUMENTS for safety-specific analysis.
  • Claim with hidden assumptions: → also INVOKE: /aex $ARGUMENTS to surface them before testing.

Execute

→ INVOKE: /araw $ARGUMENTS

Use the depth and balance determined above.

For bundled claims: → Unbundle first, then INVOKE: /araw on the most load-bearing claim → Then INVOKE: /araw on remaining claims if needed

For untestable claims: → Restate as testable prediction first → Then INVOKE: /araw on the restated claim

Supplementary Analysis (invoke when relevant)

SituationAlso invoke
Claim is an argument with premises→ /agsk (argument analysis)
Claim involves predictions→ /fut (future analysis)
Claim involves best/worst outcomes→ /gop, /obo, /ogo
Claim seems too easy or obvious→ /obv (obvious check)
You suspect self-deception→ /sdc (self-deception check)
Claim involves safety→ /saf (safety analysis)
Claim involves ethics→ /eth (ethics analysis)
Claim could be tested empirically→ /abts (A/B test design)

Failure Modes

FailureSignalFix
Confirmation testingAR much deeper than AW; all claims VALIDATEDPush AW harder — find the genuine weaknesses
Unbundling failureCompound claim tested as single unitDecompose into atomic claims, test each
Vague verdict”It’s complicated” or “it depends” without specificsState exactly what it depends ON — name the conditions
Testing the wrong claimLots of analysis but the user’s actual question isn’t answeredRe-read the input; are you testing what they asked?
Definitional dodgeClaim restated until trivially trueTest the version that has real-world implications, not the safe version
Balance mismatchTesting aligns with user’s confidence instead of counterbalancingIf user is confident, push AW. If doubtful, push AR.

Depth Scaling

DepthScopeFloor
1xQuick test — one AR/AW pass on the main claim5 claims, 12 findings
2xStandard — unbundle, test main claim, note conditions7 claims, 20 findings
4xThorough — all claims tested, assumptions surfaced12 claims, 35 findings
8xDeep — full unbundling, all claims to bedrock, alternatives derived18 claims, 55 findings

Pre-Completion Checklist

  • Claim extracted and stated as testable sentence
  • Bundled claims unbundled into separate propositions
  • AR/AW balance set based on user’s confidence level
  • ARAW executed at appropriate depth
  • Verdict derived from evidence (not asserted)
  • Conditions identified (when true / when false)
  • What would change the verdict is stated
  • Hidden assumptions surfaced (via /aex if needed)

After Completion

Report:

  • The claim as tested (restated precisely)
  • Whether it was unbundled (and which sub-claim was most load-bearing)
  • Verdict from ARAW (validated / conditional / damaged / rejected / uncertain)
  • Key AR findings and key AW findings
  • Conditions (true when X, false when Y)
  • What would change the verdict
  • If supplementary skills were used, their findings integrated

Follow-Up Routing

After the claim is tested, the user may need:

  • “What should I do about this?” → INVOKE: /decide or /how
  • “What are the implications?” → INVOKE: /sycs $ARGUMENTS
  • “What’s the worst case?” → INVOKE: /dys $ARGUMENTS
  • “What skill should I run next?” → INVOKE: /next or /fonss

Integration

  • Use from: /emotion (doubt about self becomes testable claim), /evaluate (core claims extracted and tested), /diagnose (suspected cause tested as claim), /decide (each option’s premises tested)
  • Routes to: /araw (primary — the testing engine), /aex (assumption extraction), /it (claim formalization), /but (claim-objection separation), /eth (ethical claims), /fut (future claims)
  • Differs from: /viability (claim tests truth, viability tests workability), /evaluate (claim tests a proposition, evaluate assesses a piece of work), /decide (claim tests one statement, decide compares options)
  • Complementary: /aex (surface hidden assumptions), /se (enumerate alternative claims), /ver (GOSM verification for empirical claims), /obv (test “obvious” claims)