This skill encodes expert knowledge for designing self-paced reading (SPR) experiments in psycholinguistics. SPR is the most widely used behavioral method for studying real-time sentence comprehension during reading (Jegerski, 2014). A competent programmer without psycholinguistics training will reliably make errors in region segmentation, spillover design, and comprehension question construction -- all of which invalidate the resulting data.

For detailed region segmentation strategies, see references/region-segmentation.md. For statistical analysis guidance, see references/analysis-guide.md.

Why SPR Design Requires Domain Expertise

Self-paced reading appears deceptively simple: participants press a button to reveal successive words. But the scientific value of an SPR experiment depends entirely on decisions that require psycholinguistic training:

Region boundaries determine what you can measure. A critical region that spans a clause boundary conflates syntactic processing with wrap-up effects (Just & Carpenter, 1980). A non-specialist would not know this.
Spillover is not a bug -- it is the primary data pattern. In SPR, processing difficulty at word N often appears in reading times at words N+1 and N+2, not at word N itself (Mitchell, 2004; Rayner, 1998). Failing to include and analyze spillover regions means missing the effect entirely.

Why SPR Design Requires Domain Expertise

Region boundaries determine what you can measure. A critical region that spans a clause boundary conflates syntactic processing with wrap-up effects (Just & Carpenter, 1980). A non-specialist would not know this.

Spillover is not a bug -- it is the primary data pattern. In SPR, processing difficulty at word N often appears in reading times at words N+1 and N+2, not at word N itself (Mitchell, 2004; Rayner, 1998). Failing to include and analyze spillover regions means missing the effect entirely.

Parameter	Recommended Value	Rationale
Response timeout	None (self-paced) or 3000-5000 ms per region	No timeout is standard for in-lab SPR; timeout prevents excessively slow responses in web-based studies (Boyce et al., 2020)
Inter-stimulus interval (ISI)	0 ms for non-cumulative moving window	Standard practice; the next word appears immediately when the previous is masked (Just et al., 1982)
ISI for phrase-by-phrase	0 ms (typical)	Any nonzero ISI introduces a blank that disrupts reading and may introduce strategic pausing
Pre-sentence fixation	*+ or for 500-1000 ms**	Orients attention to display location; standard in SPR (Jegerski, 2014)
Post-sentence delay	0-500 ms before comprehension question	Brief delay prevents motor interference between last word button-press and question response
Practice trials	6-10 items minimum	Familiarizes participants with button-press rhythm and comprehension questions; use different sentences than experimental items (Jegerski, 2014; Keating & Jegerski, 2015)

Parameter	Recommendation	Rationale
Proportion of trials with questions	1/3 to 1/2 of all trials (experimental + filler)	Fewer than 1/3: participants may stop reading carefully; more than 1/2: task becomes tedious, and participants may shift to a question-anticipation strategy (Just et al., 1982; Jegerski, 2014)
Answer balance	50% yes / 50% no for yes/no questions	Prevents response bias toward one answer
Question content	Target semantic content of the sentence, NOT the critical manipulation	Questions about the manipulation teach participants what you are studying, inducing strategic reading (Jegerski, 2014)
Accuracy exclusion threshold	>80% correct to retain participant	Standard criterion; lower accuracy suggests the participant was not reading for comprehension (Jegerski, 2014; common practice across SPR studies)
Question timing	Immediately after the sentence (or after the final button press)	Delayed questions test memory, not comprehension

Population	Minimum Items per Condition	Rationale
L1 speakers, robust effect (e.g., garden-path)	24 items per condition	Sufficient for medium-to-large effects in mixed models (Keating & Jegerski, 2015)
L1 speakers, subtle effect (e.g., pragmatic inference)	32-40 items per condition	Smaller effects require more items for adequate power (Keating & Jegerski, 2015; Brysbaert & Stevens, 2018)
L2 speakers	32-40 items per condition	Higher variability in L2 populations requires more observations (Marsden, Thompson, & Plonsky, 2018)

Parameter	Recommendation	Rationale
Filler-to-experimental ratio	2:1 or 3:1 (fillers : experimental items)	Prevents participants from identifying the experimental pattern; higher ratios reduce strategic processing (Keating & Jegerski, 2015)
Filler variety	Include multiple sentence types, lengths, and structures	Monotonous fillers fail to mask the experimental manipulation
Filler complexity	Include some fillers of similar complexity to experimental items	If only experimental items are complex, participants learn to attend differently to them
Comprehension questions on fillers	Yes -- at least the same rate as on experimental items	If questions only follow experimental items, participants learn that complex sentences predict questions

Criterion	SPR	Eye-Tracking
Equipment cost	Low (any computer)	High (dedicated eye-tracker, ~$20,000-$50,000)
Online data collection	Yes (web-based SPR and Maze work well)	No (requires in-lab calibration)
Temporal resolution	Word-by-word, with substantial spillover	Multiple fixation measures (first fixation, gaze duration, go-past, total time, regressions)
Regressions	Not measurable (non-cumulative display prevents rereading)	Yes -- regressions are a primary measure of reanalysis
Ecological validity	Moderate (button-press is unnatural, but spatial layout preserved)	Higher (closer to natural reading)
Sensitivity to early/late processing stages	Low (only a single RT per region, which blends all processing stages)	High (first-pass vs. second-pass measures separate early from late processing; Rayner, 1998)
Best for	Robust syntactic/semantic effects, web-based or underfunded studies, L2 populations without lab access	Nuanced temporal dynamics, distinguishing processing stages, studying regressions, garden-path recovery

Self Paced Reading Designer

Why SPR Design Requires Domain Expertise

Self Paced Reading Designer

Why SPR Design Requires Domain Expertise

Research Planning Protocol

⚠️ Verification Notice

Core Workflow

Step 1: Select a Presentation Method

1A. Non-Cumulative Moving Window (Standard)

1B. Cumulative Moving Window

1C. Phrase-by-Phrase Presentation

1D. Centered (RSVP-style) Presentation

1E. Maze Task (Modern Alternative)

Step 2: Configure Timing Parameters

Step 3: Design Critical Regions

Core Principles

Step 4: Design Comprehension Questions

Guidelines

Example of Good vs. Bad Comprehension Questions

Step 5: Design Item and Condition Structure

Latin Square Design

Items Per Condition

Filler Items

Step 6: Decide Between SPR and Eye-Tracking

Common Pitfalls

Quick Reference: SPR Design Checklist

References

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns