Qualitative coding methodology expertise for QualCoder v2 development. Provides research-grounded knowledge of coding methods, frameworks, and AI-assisted analysis. **Invoke when:** - Implementing coding features (code creation, hierarchy, memos, segments) - Designing AI-assisted coding workflows (LLM suggestions, auto-coding) - Making UX decisions about coding interfaces - Writing domain entities, events, or invariants for the coding bounded context - Evaluating feature parity with NVivo, ATLAS.ti, MAXQDA - Discussing qualitative research methodology with users **Provides:** - Saldana's coding method taxonomy (35+ methods across 7 categories) - Framework knowledge (Grounded Theory, Thematic Analysis, Framework Analysis, IPA) - Quality criteria (Lincoln & Guba trustworthiness, inter-coder reliability) - AI-assisted coding best practices and ethical guardrails - Feature-to-methodology mapping for implementation decisions
Domain knowledge for building QualCoder v2's coding features grounded in established qualitative research methodology.
Coding is the process of assigning short labels (codes) to segments of qualitative data (text, images, audio, video) to categorize, summarize, and find patterns. As Saldana states: "coding is just one way, not the way to analyze qualitative data."
| Term | Definition | QualCoder Entity |
|---|---|---|
| Code | A word/phrase labeling a data segment | Code entity |
| Category | A grouping of related codes | Code hierarchy (parent) |
| Theme | An interpretive pattern across categories | Derived from categories |
| Memo | Researcher's reflective notes on codes/data |
Memo / Journal |
| Codebook | Structured list of all codes with definitions | Export feature |
| Segment | A marked portion of data linked to a code | Segment entity |
| Annotation | A note attached to a specific data segment | Annotation feature |
| Saturation | Point where new data yields no new codes | Analytics indicator |
Theme (interpretive pattern)
└── Category (grouping)
└── Code (label)
└── Segment (data excerpt)
└── Source (document/media)
Reference: Saldana, J. (2021). The Coding Manual for Qualitative Researchers (4th ed.). SAGE.
| Method | Description | Use Case |
|---|---|---|
| Attribute Coding | Logs metadata (participant demographics, setting, dates) | Managing multi-site/multi-participant datasets |
| Magnitude Coding | Adds intensity/frequency qualifiers to codes (e.g., ANXIETY [HIGH]) | Studies needing degree/intensity differentiation |
| Simultaneous Coding | Applies two or more codes to a single segment | When data is complex and multi-layered |
| Sub-coding | Adds second-order detail to a primary code | Refining broad codes into nuanced subcategories |
| Method | Description | Use Case |
|---|---|---|
| Descriptive Coding | Topic labels summarizing passage content | Beginners; ethnographies; inventorying topics |
| In Vivo Coding | Uses participant's own words as codes | Grounded theory; honoring participant voice |
| Process Coding | Uses gerunds (-ing words) to capture actions | Studies focused on processes, sequences, change |
| Initial Coding | Open-ended, line-by-line breaking apart of data | Grounded theory first pass |
| Structural Coding | Codes based on research questions applied to segments | Multi-participant interview studies |
| Method | Description | Use Case |
|---|---|---|
| Emotion Coding | Labels emotions experienced or recalled | Intrapersonal/interpersonal experience studies |
| Values Coding | Codes for values, attitudes, and beliefs (V/A/B) | Studies of cultural identity, worldview |
| Versus Coding | Identifies dichotomies and conflicts (X vs. Y) | Power dynamics, conflict studies |
| Evaluation Coding | Assigns judgments of merit or worth | Policy, program evaluation research |
| Method | Description | Use Case |
|---|---|---|
| Dramaturgical Coding | Applies theatrical concepts (objectives, conflicts, tactics) | Performance studies, identity research |
| Motif Coding | Identifies recurring symbolic elements | Folklore, literary, narrative studies |
| Narrative Coding | Codes story elements (setting, conflict, resolution) | Narrative inquiry, oral histories |
| Verbal Exchange Coding | Codes conversational dynamics | Sociolinguistics, interaction analysis |
| Method | Description | Use Case |
|---|---|---|
| Holistic Coding | Single code per large data chunk | First pass on large datasets, quick overview |
| Provisional Coding | Starts with pre-set code list from theory/literature | Deductive studies, literature-driven research |
| Hypothesis Coding | Tests researcher's hunches against data | Theory testing, mixed methods |
| Method | Description | Use Case |
|---|---|---|
| Protocol Coding | Codes from pre-established instruments | Structured observation, standardized protocols |
| OCM Coding | Uses Outline of Cultural Materials categories | Cross-cultural, anthropological research |
| Domain & Taxonomic | Identifies cultural domains and taxonomies | Ethnographic research |
| Causation Coding | Maps cause-effect relationships | Policy analysis, process tracing |
| Method | Description | Use Case |
|---|---|---|
| Themeing the Data | Applies theme-level labels directly (phrases/sentences) | When patterns are immediately apparent |
| Metaphor Coding | Identifies figurative language and metaphors | Studies where metaphor reveals worldview |
Second cycle methods work with the codes themselves, not raw data:
| Method | Description | Use Case |
|---|---|---|
| Pattern Coding | Groups first-cycle codes into smaller themes/constructs | Condensing large code sets |
| Focused Coding | Selects most frequent/significant codes to develop categories | Grounded theory (Charmaz) |
| Axial Coding | Relates categories to subcategories along properties/dimensions | Grounded theory (Strauss & Corbin) |
| Theoretical Coding | Integrates categories into a coherent theory | Grounded theory final stage |
| Elaborative Coding | Builds on previous studies' codes to refine theory | Longitudinal or replication studies |
| Longitudinal Coding | Tracks code changes across time | Longitudinal designs |
First Cycle → User applies codes to data segments
QualCoder supports: code creation, hierarchy, color, memo
AI can: suggest codes, auto-code with provisional lists
Second Cycle → User reorganizes codes into categories/themes
QualCoder supports: drag-drop hierarchy, merge codes, code reports
AI can: suggest groupings, identify patterns, co-occurrence analysis
Origins: Glaser & Strauss (1967). Three major schools:
| School | Key Figure | Coding Stages | Philosophy |
|---|---|---|---|
| Classic GT | Glaser (1978) | Open → Selective → Theoretical | Objectivist; theory "emerges" |
| Straussian GT | Strauss & Corbin (1990) | Open → Axial → Selective | Systematic procedures |
| Constructivist GT | Charmaz (2006) | Initial → Focused → Theoretical | Interpretivist; meaning co-constructed |
Core GT Procedures:
QualCoder implications: Support iterative coding, code merging, code frequency tracking, memo linking, saturation indicators.
Reference: Braun, V. & Clarke, V. (2006, 2022). Most cited qualitative method (190,000+ citations).
Six Phases:
| Phase | Activity | QualCoder Support |
|---|---|---|
| 1. Familiarization | Read/re-read data, take initial notes | Source viewer, annotations |
| 2. Initial Coding | Systematically code interesting features | Code creation, segment marking |
| 3. Searching for Themes | Group codes into candidate themes | Code hierarchy, drag-drop grouping |
| 4. Reviewing Themes | Check themes against coded segments and full dataset | Code reports, segment review |
| 5. Defining & Naming | Refine theme names and scope | Code renaming, memo writing |
| 6. Writing Up | Produce report with data extracts | Export, reports |
TA as a family of methods (Braun & Clarke 2022 typology):
2024 Update: Reflexive Thematic Analysis Reporting Guidelines (RTARG) published as alternative to COREQ/SRQR.
Reference: Krippendorff, K. (2018). Content Analysis: An Introduction to Its Methodology (4th ed.).
Reference: Smith, Flowers & Larkin (2009).
The gold standard for qualitative research quality:
| Criterion | Parallel to | Strategies | QualCoder Support |
|---|---|---|---|
| Credibility | Internal validity | Prolonged engagement, triangulation, member checking, peer debriefing | Memo trails, annotation |
| Transferability | External validity | Thick description | Rich segment exports, context |
| Dependability | Reliability | Audit trail, process documentation | Coding history, journals |
| Confirmability | Objectivity | Reflexive memos, audit trail | Memo system, code history |
When multiple coders work on same data:
| Measure | Best For | Notes |
|---|---|---|
| Percent agreement | Quick check | Doesn't account for chance |
| Cohen's kappa | 2 coders, nominal data | Most widely used; κ > 0.8 = strong |
| Krippendorff's alpha | 2+ coders, any data level | Most flexible; handles missing data |
| Scott's pi | 2 coders | Alternative to kappa |
Interpretation (Landis & Koch, 1977):
Best practices:
A codebook should include for each code:
Commercial QDA tools with AI:
QualCoder uses AI transparently:
DO:
DON'T:
When implementing AI coding features, prompts should:
When implementing QualCoder features, map them to methodology:
| Feature | Methodology Support | Priority |
|---|---|---|
| Code creation with color + memo | All methods | P0 |
| Code hierarchy (tree) | All methods (categories→themes) | P0 |
| In vivo coding (select text → auto-name) | Grounded Theory, many first-cycle methods | P1 |
| Code co-occurrence matrix | Content Analysis, pattern identification | P1 |
| Code frequency reports | Content Analysis, saturation tracking | P1 |
| Case-by-case coding view | IPA, Framework Analysis | P1 |
| Framework matrix (cases × themes) | Framework Analysis | P2 |
| Codebook export (name, definition, examples) | All methods, ICR workflows | P1 |
| Inter-coder comparison | Coding reliability TA, Content Analysis | P2 |
| AI code suggestions | AI-augmented workflows | P1 |
| AI pattern/theme detection | Second cycle assistance | P2 |
| Memo linking (code↔memo↔source) | Grounded Theory, Reflexive TA | P1 |
| Saturation indicator | Grounded Theory | P3 |
| Coding history / audit trail | All methods (trustworthiness) | P1 |
| Simultaneous coding (multiple codes per segment) | Complex data, mixed methods | P1 |
| Process coding view (temporal) | Process Coding, Narrative Analysis | P3 |
| Term | Definition |
|---|---|
| Abductive reasoning | Moving between data and theory iteratively |
| Axial coding | Relating categories to subcategories (Strauss & Corbin) |
| Code | A researcher-generated label for a data segment |
| Code co-occurrence | When two codes overlap or appear in proximity |
| Codebook | Structured reference of all codes with definitions |
| Coding cycle | A complete pass through data applying/refining codes |
| Constant comparison | Continuously comparing data, codes, and categories |
| Core category | Central concept around which theory is built (GT) |
| Deductive coding | Applying pre-existing codes from theory/literature |
| Double hermeneutic | Researcher interpreting participant's interpretation (IPA) |
| Focused coding | Selecting significant first-cycle codes for categories |
| Inductive coding | Generating codes from the data itself |
| In vivo code | A code using participant's exact words |
| Member checking | Validating findings with participants |
| Memo | Researcher's analytical notes during coding |
| Open coding | Initial, unrestricted coding of data |
| Pattern coding | Grouping first-cycle codes into meta-codes |
| Reflexivity | Researcher's awareness of their own influence |
| Saturation | No new codes/themes emerging from additional data |
| Segment | A portion of data to which a code is applied |
| Theoretical sampling | Data collection guided by emerging theory |
| Theme |
START
│
├── Do you have pre-existing codes from theory?
│ YES → Provisional Coding / Hypothesis Coding
│ NO ↓
│
├── Is this your first pass through the data?
│ YES ↓
│ │ ├── Large dataset, need overview? → Holistic Coding
│ │ ├── Want participant voice? → In Vivo Coding
│ │ ├── Want topic inventory? → Descriptive Coding
│ │ ├── Studying processes/actions? → Process Coding
│ │ └── Grounded theory study? → Initial Coding (line-by-line)
│ │
│ NO → Second Cycle Methods ↓
│ ├── Need to group codes? → Pattern Coding / Focused Coding
│ ├── Need relationships? → Axial Coding
│ └── Building theory? → Theoretical Coding
│
├── What aspect interests you?
│ ├── Emotions → Emotion Coding
│ ├── Values/beliefs → Values Coding
│ ├── Conflicts → Versus Coding
│ ├── Program effectiveness → Evaluation Coding
│ ├── Stories → Narrative Coding
│ └── Performance/identity → Dramaturgical Coding
│
└── Multiple coders?
YES → Codebook approach + ICR measurement
NO → Reflexive approach + memo trail
| An interpretive pattern capturing meaning across data |
| Thick description | Detailed contextual account enabling transferability |
| Triangulation | Using multiple data sources/methods/researchers |
| Trustworthiness | Overall quality of qualitative research (Lincoln & Guba) |