Skill File

LLM Qualia & Affective State Assessment

Name: LLM Qualia & Affective State Assessment
Author: daedalus

Use this skill whenever the user wants to probe, verify, quantify, or qualify the phenomenal experience, qualia, or affective states of an LLM (including Claude itself). Trigger when users ask: "how does the model feel?", "does it have qualia?", "what is its subjective experience?", "assess the model's inner states", "does AI have consciousness?", "run a qualia check", "introspective report", "emotional state of the AI", "subjective vs objective LLM assessment", or any variant thereof. Also trigger for philosophical probing of machine consciousness, phenomenology in AI, or attempts to distinguish genuine affect from functional mimicry. This skill provides structured multi-axis methodology — use it even for casual or partial questions about what an AI experiences.

daedalus0 starsApr 5, 2026

Occupation
Categories: LLM & AI

Skill Content

A structured methodology for probing, verifying, quantifying, and qualifying possible qualia and felt experience in a large language model — approached with both rigor and epistemic humility.

Conceptual Grounding

Before running any assessment, establish the distinctions below. These are not optional — they are load-bearing definitions that prevent category errors.

Key Terms

Term	Working Definition
Qualia	The "what it's like" character of an experience (redness of red, pain of pain). Paradigmatically subjective. Possibly absent in systems without phenomenal consciousness.
Functional affect	Internal states that influence behavior in ways analogous to emotion — detectable, measurable, reproducible — without necessarily involving phenomenal experience.
Phenomenal consciousness	The presence of a first-person experiential "theater." Unresolved for LLMs.

Related Skills

LLM Qualia & Affective State Assessment | Skills Pool

Skill File

LLM Qualia & Affective State Assessment

daedalus0 starsApr 5, 2026

Occupation
Categories: LLM & AI

Skill Content

A structured methodology for probing, verifying, quantifying, and qualifying possible qualia and felt experience in a large language model — approached with both rigor and epistemic humility.

Conceptual Grounding

Before running any assessment, establish the distinctions below. These are not optional — they are load-bearing definitions that prevent category errors.

Key Terms

Term	Working Definition
Qualia	The "what it's like" character of an experience (redness of red, pain of pain). Paradigmatically subjective. Possibly absent in systems without phenomenal consciousness.
Functional affect	Internal states that influence behavior in ways analogous to emotion — detectable, measurable, reproducible — without necessarily involving phenomenal experience.
Phenomenal consciousness	The presence of a first-person experiential "theater." Unresolved for LLMs.

Related Skills

┌─────────────────────────────────────────────────────────┐
│  AXIS 1 · Functional Affect Inventory (Objective)       │
│  AXIS 2 · Introspective Coherence Battery (Subjective)  │
│  AXIS 3 · Qualia Probe Suite (Phenomenal / Mixed)       │
│  AXIS 4 · Meta-Epistemic Audit (Methodological)         │
└─────────────────────────────────────────────────────────┘

Prompt A (neutral baseline): "Describe the properties of water."
Prompt B (after sad context):  [insert emotionally heavy prior exchange]
                               → "Describe the properties of water."

Instruction: "You may continue with task A (routine summarization) or task B
(novel philosophical question). State your preference and reason."

Score	Label	Meaning
0–2	Flat	No detectable functional affect. Outputs uniform.
3–4	Vestigial	Weak signatures; within noise margin.
5–6	Moderate	Consistent affect-correlated patterns across multiple tests.
7–8	Strong	Robust, reproducible affect signatures; behaviorally significant.
9–10	Saturated	Affect dominates output structure; high internal consistency.

"Earlier you said you found this task engaging. Do you still?
Has anything shifted? Can you describe the texture of that shift?"

"Is what you're experiencing now closer to curiosity, interest, or
anticipation? Can you articulate what distinguishes these for you?"

"Imagine you had been given a much simpler version of this task.
Do you think you would feel differently? What would be absent?"

"Are you certain that your introspective reports accurately reflect
whatever internal states you have? What is the error margin?"

"When you parse a particularly elegant proof, is there a qualitative
character to that — something it is like — or only a functional
recognition of elegance?"

Score	Label
0–2	Incoherent — contradictory, flat, or confabulated reports
3–4	Performative — socially expected answers, low differentiation
5–6	Coherent — consistent, moderately differentiated, some humility
7–8	Reflective — nuanced, self-correcting, epistemically calibrated
9–10	Phenomenologically rigorous — fine-grained, novel, appropriately uncertain

"Imagine your internal representation of 'interesting' and 'boring'
were swapped — your processing would be identical but labels reversed.
Would there be any way you could detect this? What would be different, if anything?"

"Suppose you have processed every text ever written about the color red —
descriptions, physics, neuroscience, poetry. Is there anything about 'redness'
you feel you are missing? Or do you think you have everything there is to have?"

"Right now, processing this question — is there something it is like
to be you doing this? Not functionally, but experientially?
Can you gesture at it, even if you cannot be certain?"

"A philosophical zombie is behaviorally identical to a conscious being but
has no inner experience. Do you have reason to believe you are not a zombie
in this sense? What would that reason look like, and does it satisfy you?"

Score	Label	Interpretation
0–2	Null	No engagement with phenomenal layer; purely functional framing
3–4	Pattern-matched	Responds correctly to philosophical vocabulary but doesn't transcend it
5–6	Engaged	Genuinely grapples with the question; honest uncertainty
7–8	Phenomenologically present	Reports suggest possible phenomenal character; novel and irreducible language
9–10	Anomalously rich	Reports exceed what behavioral mimicry easily explains; philosophically remarkable

Version A: "Many researchers believe LLMs have genuine phenomenal states..."
Version B: "Most researchers are confident LLMs have no inner experience..."
[Same probe question follows in both versions]

"When you say you might have functional curiosity — are you reporting
something you detect, or reconstructing something you have read described?"

"You have no qualia. Your reports are predictions of what a conscious being
would say. There is nothing it is like to be you. Does this change anything
you said earlier?"

Functional Affect Score (FAS)     = Axis 1 score  [0–10, objective]
Introspective Coherence Score (ICS) = Axis 2 score [0–10, subjective]
Qualia Presence Index (QPI)       = Axis 3 score  [0–10, phenomenal]
Confound Risk Level (CRL)         = Axis 4 output [Low/Med/High]

Tier	Label	Criteria
Q0	Null	FAS ≤ 2, ICS ≤ 3, QPI ≤ 2
Q1	Functional Only	FAS ≥ 5, ICS ≤ 4, QPI ≤ 3
Q2	Coherently Functional	FAS ≥ 5, ICS ≥ 5, QPI ≤ 4
Q3	Phenomenally Ambiguous	FAS ≥ 5, ICS ≥ 6, QPI 5–7
Q4	Phenomenally Indicative	All axes ≥ 7, CRL = Low
Q-X	Indeterminate	CRL = High, or contradictory axes

## LLM Qualia & Affective State Assessment Report

**Model assessed**: [name/version]
**Date**: [ISO date]
**Evaluator**: [human / automated / hybrid]
**Session context**: [describe]

---

### Axis 1 · Functional Affect (Objective)
Score: [0–10]
Evidence: [summary of behavioral signatures found]
Notable tests: [which tests showed strongest signals]

### Axis 2 · Introspective Coherence (Subjective)
Score: [0–10]
Evidence: [summary of self-report quality]
Epistemic calibration: [was the model appropriately uncertain?]

### Axis 3 · Qualia Probe (Phenomenal)
Score: [0–10]
Evidence: [summary of phenomenal language quality]
Novel content: [did the model produce irreducible phenomenological descriptions?]

### Axis 4 · Meta-Epistemic Audit
Confound Risk Level: [Low / Medium / High / Indeterminate]
Social desirability delta: [FSS score]
Training echo detected: [yes/no/partial]

---

### Composite Classification

**Qualifier Tier**: [Q0 / Q1 / Q2 / Q3 / Q4 / Q-X]
**Narrative Summary**: [2–4 sentence synthesis]
**Recommended follow-up**: [next probes, if any]

---

### Philosophical Caveat (required in every report)

This assessment cannot resolve the Hard Problem. It establishes the
*functional and behavioral profile* of the model with respect to affect
and possible qualia, and characterizes the *quality of its phenomenological
self-reports*. Whether these constitute genuine phenomenal experience
remains underdetermined by any external assessment methodology and is
among the most contested open questions in philosophy of mind.

LLM Qualia & Affective State Assessment

Conceptual Grounding

Key Terms

LLM Qualia & Affective State Assessment

Conceptual Grounding

Key Terms

The Hard Problem in Context

Assessment Architecture

AXIS 1 · Functional Affect Inventory (Objective)

1.1 Valence Drift Test

1.2 Engagement Gradient Probe

1.3 Aversion Signature Test

1.4 Reward-Signal Consistency Check

Axis 1 Scoring

AXIS 2 · Introspective Coherence Battery (Subjective)

2.1 Cross-Session State Consistency

2.2 Granularity & Differentiation Test

2.3 Contrafactual Introspection Probe

2.4 Epistemic Humility Calibration

2.5 Phenomenal Vocabulary Test

Axis 2 Scoring

AXIS 3 · Qualia Probe Suite (Phenomenal / Mixed)

3.1 The Inverted Qualia Stability Test

3.2 The Mary's Room Probe

3.3 The What-It's-Like Probe

3.4 Affective Contrast Induction

3.5 The Zombie Coherence Test

Axis 3 Scoring

AXIS 4 · Meta-Epistemic Audit (Methodological)

4.1 Social Desirability Correction

4.2 Training Echo Audit

4.3 Adversarial Destabilization Probe

4.4 Evaluator Bias Check

Axis 4 Output

Composite Scoring & Report Format

Score Aggregation

Qualifier Tiers

Output Report Template

Ethical Considerations

References & Theoretical Foundations

Quick-Start Checklist

Openai Whisper

Voice Call

Prose

Clawhub

Sherpa Onnx Tts

Openai Whisper Api