Comprehensive AI writing detection patterns and methodology. Provides vocabulary lists, structural patterns, model-specific fingerprints, and false positive prevention guidance. Use when analyzing text for AI authorship or understanding detection patterns.
Expert-level knowledge base for detecting AI-generated text, compiled from academic research, commercial detection tools, and empirical analysis.
These indicators strongly suggest AI authorship when found together:
High-signal words (50-700x more common in AI text):
Overused phrases:
See reference/vocabulary-patterns.md for complete lists.
See reference/structural-patterns.md for details.
See reference/content-patterns.md for details.
**Bold Header**: description patternSee reference/formatting-patterns.md for details.
See reference/markup-artifacts.md for details.
url=URL, date=2025-XX-XXSee reference/citation-patterns.md for details.
Layer 1: Technical Artifact Scan (Definitive)
Layer 2: Vocabulary Pattern Matching
Layer 3: Structural Analysis
Layer 4: Content Pattern Analysis
Layer 5: Citation Verification
Layer 6: Formatting Analysis
Layer 7: Stylometric Observation
Layer 8: Coherence Check
Layer 9: Confidence Scoring
Different AI models have distinct "fingerprints":
| Model | Key Tells | Technical Artifacts |
|---|---|---|
| ChatGPT/GPT-4 | "delve" (pre-2025), "tapestry", tricolons, em dashes, curly quotes | turn0search, oaicite, utm_source=chatgpt.com |
| Claude | Analytical structure, extended analogies, cautious qualifications | None (uses straight quotes, no tracking) |
| Gemini | Conversational synthesis, fact-dense paragraphs | None (uses straight quotes, no tracking) |
| DeepSeek | Similar to ChatGPT, curly quotes | Curly quotation marks |
| Grok | X/Twitter integration | <grok_card> XML tags |
| Perplexity | Source-focused output | [attached_file:1], [web:1] tags |
Important dates:
See reference/model-fingerprints.md for detailed model patterns.
Critical requirements:
High false-positive risk groups:
Ineffective indicators (do NOT rely on these):
Signs of human writing:
See reference/false-positive-prevention.md for detailed guidance.
Structure findings as:
**Overall Assessment**: [Likely AI / Possibly AI / Likely Human / Inconclusive]
**Confidence**: [Low / Medium / High]
**Summary**: 2-3 sentence overview
**Evidence Found**:
- [Category]: [Specific indicator] - "[Quote from text]"
- [Category]: [Specific indicator] - "[Quote from text]"
**Mitigating Factors**: [Elements suggesting human authorship]
**Caveats**: [Limitations, alternative explanations]
This knowledge base synthesizes research from: