Overview

Critical Thinking Framework (FCT) has two modes of use:

Mode 1 — Direct Application: Apply FCT components to evaluate an existing document, response, or analysis. When a user provides a piece of writing and asks you to critique it, find flaws, or improve it using structured reasoning, use FCT as an analytical lens. Read the document, apply the Reasoning Router to select your analysis method (usually GoT for interconnected multi-part documents), use the Metacognitive Monitor for confidence assessment, and the Bias Detector to surface hidden assumptions. Output specific findings and proposed edits.

Mode 2 — Agent Architecture: Design or implement AI agents with FCT components (metacognition, self-correction, reasoning routing). Appropriate when the user is building a system, not analyzing a single document.

When to use Mode 1 (direct application):

User provides a document and asks to "apply critical thinking", "think critically about this", "find the flaws", "critique this"
User wants a structured evaluation of an existing response or analysis
User wants to improve a specific piece of writing with structured reasoning

When to use Mode 2 (agent architecture):

Building AI agents for complex tasks requiring multi-step reasoning
Implementing self-correction and error detection in agent workflows
Selecting the appropriate reasoning method (CoT vs ToT vs GoT) based on task characteristics
Adding metacognitive monitoring to improve reliability

Overview

Critical Thinking Framework (FCT) has two modes of use:

When to use Mode 1 (direct application):

User provides a document and asks to "apply critical thinking", "think critically about this", "find the flaws", "critique this"

User wants a structured evaluation of an existing response or analysis

User wants to improve a specific piece of writing with structured reasoning

When to use Mode 2 (agent architecture):

Building AI agents for complex tasks requiring multi-step reasoning

Implementing self-correction and error detection in agent workflows

Selecting the appropriate reasoning method (CoT vs ToT vs GoT) based on task characteristics

Adding metacognitive monitoring to improve reliability

User Request → Two entry points: [Direct Application] User provides a document to evaluate ├─ ⚠️ PRE-FLIGHT (DO THIS BEFORE ANY READ TOOL CALL): │ CATEGORICAL RULE: If the path contains doc_converter/, ALWAYS use Grep first — │ NEVER call Read on the full file regardless of extension or apparent size. │ doc_converter/ files are converted spreadsheets/PDFs and are ALWAYS 200-500KB │ with extremely long lines that GUARANTEE limit failures. This is not a judgment │ call — it is an absolute rule with no exceptions. │ For ALL other files: ask yourself: Is the file a converted xlsx/pdf/docx/.csv? │ Could it exceed 50KB? If YES to ANY of these, STOP — use Grep instead. │ Full reads of converted documents ALWAYS hit the 256KB byte limit or 25K token │ limit, wasting turns before any analysis. This guard applies even when the file │ path looks like a markdown file — converted spreadsheets saved as .md are │ regularly 300-400KB with extremely long lines. │ ALSO APPLIES TO: thoughts/shared/plans/ and thoughts/shared/research/ documents. │ Architecture plans and research documents in thoughts/ are plain markdown but │ routinely exceed 10K tokens (the per-read limit). ALWAYS use offset/limit or │ Grep for these files — NEVER attempt a full read without checking size first. │ NEVER make parallel Read calls on the same large file: if one call would exceed │ the limit, ALL parallel calls will fail simultaneously, tripling wasted turns. │ Call Grep first (single call), then Read specific line ranges as follow-up. │ ├─ Step 1: Read the document (targeted reads only per pre-flight above). │ BATCH YOUR READS: Before calling any Grep or Read, enumerate ALL the sections │ or fields you need from the document in one pass. Then Grep for them in sequence. │ Do NOT interleave reads with analysis — gather all evidence first, then synthesize. │ This avoids returning to the same large file 5-6 times, which wastes turns and │ context on repeated file overhead. │ Read ONLY the document the user provided. Do not read other user │ documents, prior session outputs, or HTML artifacts unless the user explicitly │ references them or asks for a comparison. (Internal skill references like │ reasoning_router.md are loaded in later steps — they are not "auxiliary files." │ EXCEPTION: If the document is a draft produced by another skill (e.g., a demo │ request, business case, or ROI model), load that skill's SKILL.md and/or │ references/ files to understand the intended format and quality bar. This │ context is required to evaluate the draft against the right criteria.) ├─ Step 2: Apply Reasoning Router to select analysis method │ └─ Multi-part interconnected doc → GoT │ └─ Single linear argument → CoT │ └─ Multiple competing hypotheses → ToT ├─ Step 3: Run Metacognitive Monitor — confidence per claim/section ├─ Step 4: Run Bias Detector — flag hidden assumptions ├─ Step 5: Produce findings: evaluate EVERY claim, metric, and assertion in the │ document — not just the obviously wrong ones. Partial coverage is the most │ common failure mode. Produce: what to keep, what to cut, P0/P1 edits. │ DEPENDENCY ORDERING: When evaluating solution sequencing or phase ordering, │ present findings as hypotheses — the user may have data dependency knowledge │ (e.g., "B reads from the store that A populates") that overrides FCT logic. │ If the user corrects your ordering, defer immediately. Do not re-argue. │ OUTPUT CONSTRAINTS: If the user specifies a format constraint (single slide, │ one paragraph, 3 bullets), calibrate output to fit that constraint immediately. │ Do NOT produce a full draft and then compress; ask or infer the constraint first. │ YES/NO EVALUATION QUESTIONS: When the question is phrased as a binary check │ ("does this make sense?", "is this the right approach?", "does the plan make sense?", │ "is this sound?"), default to answer-first format: direct yes/no verdict, 3-5 key │ supporting reasons, and an offer to expand. Do NOT default to the full GoT writeup │ for a yes/no question — save the full analysis for when the user asks for depth. │ CUSTOMER-FACING REVIEW: If the output is a proposal, deck, or vendor response going │ back to the customer, apply an extra check: (1) Do not use the customer's own internal │ escalation language against them ("board-flagged," "exec-escalated," "your team said │ this is a P0"). Reframe discovery call content as business impact, not leverage. │ (2) Attribute quotes to the business problem, not the individual ("Resolved cases │ aren't followed up on" not "Robert said you don't call customers back"). │ CITATION REQUIRED (inline, not in a footnote): For every derived calculation │ (percentage, total, ratio), embed the citation immediately after the number. │ Format: "[value] (derived from [field name]: $X / $Y = Z%)". Never present a │ derived number without this inline trace. This is the most common follow-up │ question — cite proactively, not on request. For cap table reconstructions │ specifically, add a standalone "Derivations" block immediately after the table │ showing every percentage's source math explicitly (e.g., "$5M / $20M │ post-money = 25%"). Users always verify cap table math — surface it upfront. └─ Step 6: Apply edits if requested. After writing the file, explicitly list ONLY the changes you made (e.g., "I trimmed VO-03 from 95 words to 41"). Do NOT comment on or take credit for changes you did not make — if the file was modified between your read and write, acknowledge that the user may have made concurrent edits and ask what remains. [Agent Architecture] Build/improve AI agent with critical thinking ├─ Step 1: Analyze Task Complexity │ ├─ Simple, single-path → Use CoT (Chain-of-Thought) │ ├─ Complex, multi-path → Use ToT (Tree-of-Thoughts) │ ├─ Interconnected → Use GoT (Graph-of-Thoughts) │ └─ Critical, needs verification → Use Self-Consistency │ ├─ Step 2: Implement Metacognitive Layer │ ├─ Add confidence scoring │ ├─ Set up reflection triggers │ └─ Configure human handoff thresholds │ ├─ Step 3: Add Self-Verification │ ├─ Implement CoVe for factual claims │ ├─ Add backward verification for math/logic │ └─ Setup cross-verification if external sources available │ ├─ Step 4: Integrate Bias Detection │ ├─ Check for confirmation bias │ ├─ Validate assumption diversity │ └─ Apply mitigation strategies │ └─ Step 5: Setup Memory & Learning ├─ Configure episodic memory ├─ Setup quality weighting └─ Implement experience replay

Task Characteristic	Recommended Method	Cost	Accuracy
Simple, linear	CoT	Low	Good
Complex planning	ToT-BFS	High	Very Good
Deep reasoning	ToT-DFS	High	Very Good
Interconnected	GoT	Very High	Excellent
Critical decisions	Self-Consistency	Very High	Excellent
Factual claims	CoVe	Medium	Good

Critical Thinking Framework

Overview

Critical Thinking Framework

Overview

Core Capabilities

1. Reasoning Router

2. Metacognitive Monitor

3. Self-Verification

4. Bias Detector

5. Producer-Critic Orchestrator

6. Memory Curator

7. Reasoning Validator

8. Reflection Trigger

Default Invocation Behavior

Workflow Decision Tree

Quick Reference: Reasoning Method Selection

Implementation Example

Resources

references/

Taskflow Inbox Triage

Accessibility

Open a Pull Request

Investor Materials

Continuous Agent Loop

Configure Ecc