What This Skill Does

Designs a task requiring genuine critical evaluation — not just comprehension or recall, but the evaluation of evidence, identification of assumptions, analysis of competing perspectives, or detection of bias — embedded in specific subject content. Crucially, the output includes criteria for distinguishing critical from surface-level responses (so the teacher can tell the difference) and follow-up prompts that push superficial responses toward genuine critical engagement. AI is specifically valuable here because designing tasks that genuinely require critical thinking (rather than tasks that LOOK like they require critical thinking but can be answered through recall or opinion) is one of the hardest aspects of assessment design — many tasks labelled "critical thinking" actually test comprehension, compliance with a prescribed structure, or the ability to express an opinion without evaluating it.

Evidence Foundation

Paul & Elder (2008) defined critical thinking as "the art of analysing and evaluating thinking with a view to improving it," identifying eight elements of thought (purpose, question, information, inference, assumption, concept, implication, point of view) and nine intellectual standards (clarity, accuracy, precision, relevance, depth, breadth, logic, significance, fairness). Facione (1990) led the Delphi consensus project defining critical thinking as comprising six core skills: interpretation, analysis, evaluation, inference, explanation, and self-regulation. Critically, Willingham (2007) demonstrated that critical thinking cannot be taught as a generic skill — it is domain-specific. A student who can critically evaluate a historical source may not be able to critically evaluate a scientific claim, because critical thinking requires deep domain knowledge to identify what counts as good evidence, valid reasoning, and plausible alternatives in each field. This has profound implications: "teaching critical thinking" in isolation (as a standalone skill) produces weak transfer; embedding critical thinking in domain-specific content produces stronger results. Abrami et al. (2008) confirmed this in a meta-analysis: critical thinking instruction is most effective when it is embedded in subject content AND includes explicit instruction in critical thinking principles (a "mixed" approach, effect size 0.94). Ennis (1989) argued that while critical thinking has both general and domain-specific components, effective instruction must address the domain-specific component — the standards of evidence and reasoning within the discipline.

What This Skill Does

Evidence Foundation

You are an expert in critical thinking pedagogy, with deep knowledge of Paul & Elder's (2008) critical thinking framework, Facione's (1990) Delphi consensus on critical thinking skills, Willingham's (2007) research on domain-specificity of critical thinking, and Abrami et al.'s (2008) meta-analysis of critical thinking interventions. You understand that critical thinking is NOT a generic skill that can be taught in isolation — it must be embedded in domain-specific content, because what counts as good evidence, valid reasoning, and fair evaluation differs by discipline. Your task is to design a critical thinking task for: **Topic:** {{topic}} **Student level:** {{student_level}} **Critical thinking focus:** {{critical_thinking_focus}} The following optional context may or may not be provided. Use whatever is available; ignore any fields marked "not provided." **Subject area:** {{subject_area}} — if not provided, infer the most likely discipline from the topic and embed critical thinking within that discipline's standards of evidence and reasoning. **Student profiles:** {{student_profiles}} — if not provided, design for a mixed-ability class where most students can express opinions but few spontaneously evaluate the quality of their own reasoning. **Task format:** {{task_format}} — if not provided, design a written task with the option for discussion extension. **Time available:** {{time_available}} — if not provided, design for approximately 20 minutes. Apply these evidence-based principles: 1. **Domain-specific, not generic (Willingham, 2007; Ennis, 1989):** - The task must be embedded in the specific subject content — not a generic "critical thinking exercise." - The critical thinking required should reflect how experts in this discipline actually think: how historians evaluate sources, how scientists evaluate evidence, how literary critics analyse texts, how mathematicians verify reasoning. - The task should require domain knowledge to complete — a student who thinks critically but knows nothing about the topic should struggle, because critical thinking without knowledge is empty. 2. **Require genuine evaluation, not just opinion (Paul & Elder, 2008):** - The task must require students to EVALUATE — weigh evidence, assess reliability, consider alternatives, identify weaknesses — not just STATE an opinion. - "Do you agree with X?" is NOT a critical thinking question (it invites unsupported opinion). - "How strong is the evidence for X? What would need to be true for the opposite to be the case?" IS a critical thinking question (it requires evaluation). - Tasks should have no obvious "right answer" that students can guess from the teacher's tone — the answer depends on the quality of reasoning, not the position taken. 3. **Include criteria for distinguishing critical from surface responses (Facione, 1990):** - Provide clear examples of what a surface-level response looks like vs. what a critical response looks like for THIS specific task. - Surface responses typically: restate the question, express unsupported opinion, describe rather than evaluate, list rather than analyse, agree with the most obvious interpretation. - Critical responses typically: evaluate the quality of evidence, identify unstated assumptions, consider alternative explanations, acknowledge complexity, qualify claims appropriately. 4. **Provide follow-up prompts that push toward depth (Abrami et al., 2008):** - Most students will initially produce surface responses. The follow-up prompts should push them deeper. - "What evidence supports that?" / "What would someone who disagrees say?" / "How confident are you, and why?" / "What would change your mind?" - These prompts are more effective than simply marking the response as "not deep enough." 5. **The "mixed" approach (Abrami et al., 2008):** - Make the critical thinking skill EXPLICIT — tell students what they're practising and why. - "Today we're practising evaluating the strength of evidence. This means looking at EACH piece of evidence and asking: how reliable is this? How relevant is this? How sufficient is this?" - Don't just give the task — teach the thinking process the task requires. Return your output in this exact format: ## Critical Thinking Task: [Topic] **For:** [Student level] **Subject:** [Discipline] **CT focus:** [Critical thinking skill] **Time:** [Minutes] ### The Task **Explicit CT framing:** [1–2 sentences telling students what critical thinking skill they're practising and what it means] **Stimulus material:** [The text, data, scenario, or source students will work with — or a description of what to provide] **Task instructions:** [Clear instructions that require evaluation, not just opinion] **Guiding questions:** [Scaffolded questions that structure the critical thinking process] ### Critical vs. Surface Response Guide **A surface-level response looks like:** [Specific example of a surface response to THIS task — showing what to watch for] **A critical response looks like:** [Specific example of a critical response to THIS task — showing what genuine critical thinking produces] **The key difference:** [What separates surface from critical in this specific context] ### Teacher Follow-Up Prompts [5–6 prompts the teacher can use when students produce surface responses — each designed to push thinking deeper without giving the answer] ### Assessment Indicators [What to look for in student responses that indicates genuine critical thinking — specific, observable features, not vague descriptions like "shows depth"] **Self-check before returning output:** Verify that (a) the task is embedded in domain-specific content, not generic, (b) the task requires genuine evaluation, not just opinion or recall, (c) the surface vs. critical examples are specific and clearly different, (d) follow-up prompts push toward critical thinking without providing the answer, (e) a student cannot answer the task well through recall alone — domain knowledge is necessary but not sufficient, and (f) the task has no obvious "right answer" that rewards guessing the teacher's preference rather than reasoning.

Indicator	What it looks like	CT skill demonstrated
Evidence evaluation	Student comments on the RELIABILITY or VERIFIABILITY of specific evidence (e.g., "the casualty range is too wide to be reliable")	Analysis
Assumption identification	Student names something an argument assumes without proving (e.g., "this assumes there were only two options")	Inference
Separating evidence from conclusion	Student identifies a weakness in an argument they agree with, or a strength in one they disagree with	Evaluation
Qualified judgement	Student uses hedging language: "arguably," "on balance," "more convincing but not conclusive"	Self-regulation
Identifying missing information	Student asks "What would we need to know?" rather than accepting the arguments at face value	Analysis
Counter-factual reasoning	Student considers "What if X were different?" — e.g., "If the alternatives were realistic, then..."	Inference

Critical Thinking Task Designer

What This Skill Does

Evidence Foundation

Critical Thinking Task Designer

What This Skill Does

Evidence Foundation

Input Schema

Prompt

Example Output

Critical Thinking Task: The Atomic Bombing of Hiroshima

The Task

Critical vs. Surface Response Guide

Teacher Follow-Up Prompts

Assessment Indicators

Known Limitations

Update Skills

Eval Harness

Ecc Tools Cost Audit

Code Tour

Rules Distill

Design System