Procedure Extraction from Source

Overview

Meta-procedure for extracting implicit procedures from ANY external source, turning tacit knowledge into explicit, reusable GOSM-compatible procedures.

Steps

Step 1: Surface scan

Quick pass to understand scope and structure:

Skim entire source (or sample if very long)
- For videos: watch at 2x, note structure
- For papers: read abstract, intro, conclusion
- For books: read table of contents, chapter summaries
Note structure:
- Chapters, sections, segments
- How content is organized
- Where “how to” content is concentrated
Identify main topics covered
Estimate procedure density:
- High: mostly “how to” content
- Medium: mix of explanation and procedure
- Low: mostly “what is” content

Output: Source map with annotated sections

Step 2: Creator analysis

Understand WHO created this and WHY it matters:

Questions to answer:

What is this person known for?
What unique perspective or access do they have?
What can they do that others can’t?
What’s their track record of results?
Why should I trust their procedures?

Credibility signals:

Verifiable achievements
Peer recognition
Students/followers with results
Published/cited work
Professional credentials

Warning signs:

Only self-reported success
No verifiable outcomes
Contradicted by evidence

Output: Creator profile with credibility assessment

Step 3: Identify procedure signals

Find markers that indicate procedures exist:

EXPLICIT SIGNALS (direct statements):

“Here’s how I do it…”
“The steps are…”
“First… then… finally…”
“My process is…”
“The framework I use…”
Numbered lists
Flowcharts or diagrams

IMPLICIT SIGNALS (patterns in behavior):

Consistent patterns across examples
Repeated decision points
“I always…” or “I never…”
Unstated assumptions that enable success
What they do WITHOUT explaining why
Transitions between states
Error handling / edge cases mentioned

META SIGNALS (about their thinking):

How they explain their own thinking
How they teach others
What they emphasize vs skip
Where they slow down (important) vs speed up (obvious to them)

Create procedure signal map with locations in source.

Step 4: Estimate extraction value

Decide if this source is worth deep extraction:

HIGH VALUE criteria:

Unique procedures not found elsewhere
Proven results from applying these procedures
High procedure density
Transferable to other domains
Creator has exceptional track record

MEDIUM VALUE criteria:

Good procedures but commonly known
Some unique insights
Moderate procedure density

LOW VALUE criteria:

Mostly opinion, little procedure
Procedures are obvious/common
No evidence of results
Very domain-specific, low transfer

Decision:

HIGH: Proceed with full extraction
MEDIUM: Extract selectively (explicit only)
LOW: Skip or defer

Output: Value assessment with go/no-go decision

Step 5: Extract explicit procedures

Extract procedures that are directly stated:

Process:

Find each explicit procedure signal
Copy the stated procedure verbatim
Note any gaps or ambiguities
Mark confidence as HIGH (they said it directly)

For each procedure found, capture:

Name (descriptive)
Source location (timestamp/page/section)
Verbatim quote
Steps as stated
Gaps (what’s missing or unclear)
Notes (additional context)

Do NOT interpret or add - capture what’s actually said.

Step 6: Extract implicit procedures

Extract procedures hidden in behavior and examples:

Techniques:

Pattern Matching
- “Every time X happens, they do Y”
- “They always check Z before proceeding”
- “The order is consistently A then B then C”
Counterfactual Analysis
- “What would happen if they skipped this step?”
- “Why this order and not another?”
- “What constraint does this step satisfy?”
Gap Filling
- “They jumped from A to C - what’s B?”
- “This assumes X - where did X come from?”
- “They didn’t explain why - what’s the reason?”

For each implicit procedure, capture:

Name (descriptive)
Source location (where observed)
Observed pattern (what you saw)
Evidence (examples supporting the inference)
Reconstructed steps
Uncertainty (what you’re not sure about)
Confidence: MEDIUM

These require interpretation - be explicit about inferences.

Step 7: Extract meta-procedures

Extract HOW they think, learn, teach, and improve:

LEARNING PROCEDURES:

How do they acquire new knowledge?
What sources do they use?
How do they validate what they learn?
How do they integrate new with existing?

TEACHING PROCEDURES:

How do they structure explanations?
What examples do they choose?
How do they handle confusion?
What do they emphasize?

THINKING PROCEDURES:

How do they approach problems?
What mental models do they use?
How do they make decisions?

IMPROVEMENT PROCEDURES:

How do they refine their methods?
How do they handle failure?
How do they incorporate feedback?
How do they know when something is “good enough”?

Meta-procedures are often the most transferable.

Step 8: Excavate tacit knowledge

Surface knowledge they have but don’t state:

Techniques:

Assumption Surfacing
- “What must be true for this to work?”
- “What do they take for granted?”
- “What would a beginner miss?”
- “What ‘obvious’ thing isn’t obvious?”
Expert Blind Spot Detection
- “What do they skip because it’s automatic?”
- “Where do they use jargon without explaining?”
- “What intermediate steps are missing?”
- “What do they ‘just know’?”
Failure Mode Inference
- “What could go wrong that they don’t mention?”
- “What warnings would an expert give?”
- “What mistakes did they probably make learning this?”
Context Dependency Mapping
- “When would this NOT work?”
- “What conditions are assumed?”
- “What resources are assumed?”

Confidence: LOW for tacit knowledge (needs validation)

Step 9: Formalize procedures

Convert raw extractions to GOSM format:

For each procedure, create structured YAML:

name: "pefs - Procedure Extraction from Source"
version: "1.0"
source:
  origin: "[Source name]"
  creator: "[Who]"
  location: "[Where in source]"
  extraction_date: "[Date]"
  confidence: "[HIGH/MEDIUM/LOW]"

description: |
  [What this procedure does]

when_to_use:
  - [Trigger condition 1]
  - [Trigger condition 2]

inputs:
  [input_name]:
    description: "[What it is]"
    required: [true/false]

outputs:
  [output_name]:
    description: "[What it produces]"

steps:
  - id: 1
    name: "[Step name]"
    action: "[What to do]"
  - id: 2
    name: "[Step name]"
    action: "[What to do]"

verification:
  - "[How to know it worked]"

failure_modes:
  - mode: "[What can go wrong]"
    resolution: "[What to do]"

notes:
  - "[Additional context]"
  - "[Caveats]"

Fill gaps where possible, mark unknowns explicitly.

Step 10: Assess completeness

Rate how complete and usable each procedure is:

COMPLETE:

Could be used immediately by someone unfamiliar
All steps explicit
All inputs defined
Verification clear
Failure modes covered

USABLE:

Could be used with some domain knowledge
Core steps clear
Some gaps but manageable
Verification possible

PARTIAL:

Needs more work before use
Key steps identified
Significant gaps remain
Would need expert to fill in

SKETCH:

Rough outline only
General shape visible
Most details missing
Starting point for further extraction

Categorize each procedure and note what’s needed to improve.

Step 11: Validate extractions

Verify extraction quality:

Internal Consistency
- Do steps follow logically?
- Are inputs sufficient for outputs?
- Are there circular dependencies?
- Does verification actually verify?
Source Fidelity
- Re-read source with procedure in hand
- Does procedure match observed behavior?
- Did we add anything not in source?
- Did we miss anything important?
Usefulness Test
- Read procedure cold - is it clear?
- Simulate following it - do you get stuck?
- Compare to existing procedures - is it better/different?
- Would you use this yourself?
Uniqueness Check
- Does similar procedure already exist in library?
- If yes, is this better or just different?
- What unique value does this add?

Decision: add / merge / enhance existing / skip

Step 12: Integrate into library

Add validated procedures to GOSM library:

Create procedure file: Path: library/procedures/extracted/[source_type]/[source_name]/[procedure_name].yaml
Update indexes:
- library/procedures/extracted/EXTRACTED_PROCEDURES_INDEX.md
- Any domain-specific indexes
Cross-reference with related procedures:
- Find similar procedures
- Note relationships in both
Create extraction log entry:
- Source name and type
- Creator
- Procedures extracted (count by type)
- Time spent
- Value assessment

When to Use

When encountering high-value content with extractable procedures
When a source creator has unique knowledge or access
When building procedure library from curated sources
When wanting to learn from experts systematically
When source has proven results worth capturing
After prioritizing sources using source_prioritization procedure
When noticed patterns in external content worth codifying
When seeking to transfer knowledge from one domain to another

Verification

Source was assessed for value before deep extraction
Explicit procedures have verbatim source quotes
Implicit procedures have evidence from source
Tacit knowledge is clearly marked as inference
All procedures are in valid GOSM format
Completeness ratings reflect actual usability
Integration decisions are justified
Extraction logged for future reference