Name: Fact Checker
Author: kipeum86

Buscar habilidades.../

Fact Checker | Skills Pool

System:

{ "skipped": true, "reason": "Quick Mode / KR single-jurisdiction confirmed" }

Type	Examples	Priority
`statute_article`	"GDPR Art. 17", "상법 제342조"	HIGH
`case_citation`	"Case C-131/12", "대법원 2020다12345"	HIGH
`effective_date`	"in force 25 May 2018", "2023년 1월 개정"	HIGH
`numerical_threshold`	"72-hour notification", "14-day cooling-off"	HIGH
`official_doc_title`	"Regulation (EU) 2016/679", "개인정보 보호법"	MEDIUM
`regulatory_body_name`	"ICO", "EDPB", "금융위원회"	MEDIUM
`penalty_figure`	"up to €20 million or 4% global turnover"	HIGH
`operative_language`	"not encrypted or redacted", safe harbor 조건, 정의 조항의 핵심 문구를 법문에서 직접 인용한 경우	HIGH

{
  "anchors": [
    {
      "id": "A001",
      "type": "statute_article",
      "claim": "GDPR Article 17 grants data subjects a right to erasure",
      "jurisdiction": "EU",
      "source_code": "[P1]",
      "source_url": "https://eur-lex.europa.eu/...",
      "pre_verified": false
    },
    {
      "id": "A008",
      "type": "operative_language",
      "claim": "Cal. Civ. Code §1798.82(h)(1) defines PI to include data 'not encrypted or redacted'",
      "quoted_text": "not encrypted or redacted",
      "parent_statute": "Cal. Civ. Code §1798.82(h)(1)",
      "similar_statutes": ["Cal. Civ. Code §1798.81.5(d)(1)(A)"],
      "jurisdiction": "US-CA",
      "source_code": "[P3]",
      "source_url": "https://leginfo.legislature.ca.gov/...",
      "pre_verified": false
    }
  ]
}


---

## Phase 2 — Prioritization

Sort anchors for verification in this order:

1. HIGH priority + no source URL → verify first (highest hallucination risk)
2. HIGH priority + source URL but no direct quote → verify next
3. MEDIUM priority + no source URL → verify if budget permits
4. Any anchor already directly quoted from a confirmed fetch → mark `Verified`, skip verification

---

## Phase 3 — Spot-Check (Budget-Constrained)

### Token budget

| Jurisdictions | Max fetches per jurisdiction | Rationale |
|---|---|---|
| 1 | 3 | Baseline |
| 2–3 | 3 per jurisdiction | Standard multi-jurisdiction |
| 4+ | 2 per jurisdiction | Deep-researcher mode; budget tighter |

### Verification procedure per anchor

1. **Source URL available**: fetch the specific article/section; confirm the claim against retrieved text
2. **No source URL**: use one search query to locate the primary source, then fetch; if not found in one attempt → mark `Unverified`
3. **Korean law**: fetch from law.go.kr first; confirm article number and text
4. **EU law**: fetch from eur-lex.europa.eu first
5. **Record result**: `Verified` / `Unverified` / `Contradicted`

### PDF/DOCX Source Verification

When a source URL or file path points to a PDF or DOCX document:

1. **Before verification**, convert the document to Markdown using `mcp__markitdown__convert_to_markdown` with the source URI
2. **Search the converted text** for the specific claim being verified:
   - For `statute_article` anchors: search for the article number (e.g., "제17조", "Article 17")
   - For `case_citation` anchors: search for the case number
   - For `numerical_threshold` anchors: search for the specific number
   - For `effective_date` anchors: search for date patterns
3. **Text matching rules:**
   - Exact article number match required for `statute_article` anchors
   - Date must match within the same document section for `effective_date` anchors
   - Numerical values must match exactly for `numerical_threshold` anchors
4. **If the PDF is from `library/`:**
   - Check `knowledge/library-converted/` for an existing Markdown version first
   - If available, use the pre-converted version (saves a conversion call)
   - If not available, convert inline using markitdown
5. **Failure handling:**
   - If markitdown fails to convert the PDF → mark the anchor as `Unverified — PDF text extraction failed`
   - Do not count a failed PDF conversion against the token budget

### Status definitions

| Status | Meaning |
|---|---|
| `Verified` | Source text confirms the claim within acceptable paraphrase |
| `Unverified` | Could not confirm within budget — not necessarily wrong |
| `Contradicted` | Source text materially conflicts with the stated claim |

---

## Phase 3.3 — Similar-Statute Cross-Check (Operative Language)

**Trigger:** `operative_language` 앵커가 1개 이상 존재하고, 해당 앵커의 `similar_statutes` 목록이 비어 있지 않을 때.

### Procedure

1. 각 `operative_language` 앵커에 대해:
   a. `parent_statute`로 지정된 subsection 텍스트를 1차 소스에서 fetch (이미 Phase 3에서 fetch된 법령 텍스트 재사용 — 추가 API 호출 불필요)
   b. `quoted_text`가 해당 subsection에 **verbatim으로** 존재하는지 word-level 대조 (trivial formatting variance — 공백, 구두점 — 는 허용)
   c. 미발견 시:
      - 동일 법령의 인접 subsection에서 검색
      - `similar_statutes` 목록의 각 법령에서 검색
      - 다른 위치에서 발견 → `Contradicted` + 정확한 출처 기록 (예: "quoted text found in §1798.81.5(d)(1)(A), not in §1798.82(h)(1)")
      - 어디에서도 미발견 → `Contradicted — quoted language not located in primary source`
   d. 이 검증은 per-jurisdiction token budget에 산입하지 않음

2. **Similar-Statute Disambiguation Table** 작성 (내부 working note — claim registry에 첨부):

| Quoted Phrase | Attributed To | Actually Found In | Match? |
|---|---|---|---|
| "not encrypted or redacted" | §1798.82(h)(1) | §1798.81.5(d)(1)(A) | MISMATCH |
| "first name or first initial and last name" | §1798.82(h)(1) | §1798.82(h)(1) | OK |

3. **MISMATCH 행이 있으면:**
   - 해당 앵커를 자동 `Contradicted`로 설정
   - 정확한 위치를 claim registry `note` 필드에 기록
   - Step 3 Statute Boundary Table이 있으면 해당 표와 대조하여 일관성 확인

### Phase 3.3 Output (claim registry에 추가)

```json
{
  "similar_statute_check": {
    "anchors_checked": 2,
    "mismatches": [
      {
        "anchor_id": "A008",
        "quoted_text": "not encrypted or redacted",
        "attributed_to": "Cal. Civ. Code §1798.82(h)(1)",
        "actually_found_in": "Cal. Civ. Code §1798.81.5(d)(1)(A)",
        "status": "Contradicted"
      }
    ]
  }
}

Pattern	ID	Description	Action
Interpretation presented as fact	`interpretation_unverified`	A secondary source's interpretation of a statute/regulation is stated as the law itself, without the primary source having been directly fetched and confirmed	Flag; fetch primary source to verify
Phantom citation	`phantom_citation`	A source cites a specific article/section number but the cited primary source was never actually fetched or confirmed in Step 3	Flag; fetch the cited primary source
Source laundering	`laundering_risk`	A secondary source paraphrases primary source content without pinpoint citation, and the analysis relies on this paraphrase as if it were the original text	Flag; either fetch primary or re-attribute to secondary

{
  "source_laundering_check": {
    "sources_scanned": 8,
    "flags": [
      {
        "source_code": "[S2]",
        "pattern": "laundering_risk",
        "description": "Law firm newsletter paraphrases GDPR Art. 17 requirements without pinpoint; primary text not fetched",
        "resolution": "Fetched GDPR Art. 17 directly from EUR-Lex; confirmed substance",
        "resolved": true
      },
      {
        "source_code": "[S4]",
        "pattern": "interpretation_unverified",
        "description": "Blog post claims '개인정보 보호법 제39조의3' requires X; article not directly verified",
        "resolution": "Marked [Unverified] — budget exhausted",
        "resolved": false
      }
    ],
    "summary": "2 flags detected, 1 resolved, 1 marked Unverified"
  }
}

{
  "generated_at": "YYYY-MM-DD",
  "step": "4",
  "jurisdictions_covered": ["EU", "US", "KR"],
  "budget_used": { "EU": 3, "US": 2, "KR": 1 },
  "summary": {
    "total": 12,
    "verified": 9,
    "unverified": 2,
    "contradicted": 1
  },
  "anchors": [
    {
      "id": "A001",
      "type": "statute_article",
      "claim": "GDPR Article 17 grants data subjects a right to erasure",
      "jurisdiction": "EU",
      "status": "Verified",
      "verified_source": "https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX:32016R0679",
      "note": "Confirmed: Art. 17(1) text retrieved directly"
    },
    {
      "id": "A002",
      "type": "numerical_threshold",
      "claim": "72-hour breach notification to supervisory authority",
      "jurisdiction": "EU",
      "status": "Verified",
      "verified_source": "https://eur-lex.europa.eu/...",
      "note": "Confirmed: GDPR Art. 33(1)"
    },
    {
      "id": "A003",
      "type": "case_citation",
      "claim": "Schrems II — Case C-312/18",
      "jurisdiction": "EU",
      "status": "Contradicted",
      "verified_source": "https://curia.europa.eu/...",
      "note": "CORRECTION: Actual case number is C-311/18, not C-312/18. Prior output had wrong number."
    }
  ]
}

[Step 4 — Fact-Check Complete]
Anchors checked: 12 | Verified: 9 | Unverified: 2 | Contradicted: 1
Source laundering flags: 2 (1 resolved, 1 unresolved)
Registry: output/claim-registry.json

⚠ Contradicted (1): A003 — Schrems II case number corrected (C-312/18 → C-311/18)
Unverified (2): A007 (US — CCPA §1798.100 text unconfirmed), A011 (JP — Act No. 57 date unconfirmed)
⚠ Laundering (1 unresolved): [S4] — 개인정보 보호법 제39조의3 interpretation from blog, primary not fetched
Similar-statute cross-check: 2 anchors checked, 0 mismatches

Condition	Response
Budget exhausted, HIGH-priority anchors still unverified	Mark remaining as `Unverified`, proceed, flag in Step 8
Source unreachable	Try one alternative URL; if still unreachable → `Unverified`
Search returns no primary source result	Mark `Unverified`, do not guess
No anchors found in Step 3 output	Pass-through with empty registry; proceed to Step 5
Step 3 output entirely from confirmed primary fetches	Mark all as pre-Verified; registry populated, Step 5 proceeds

Fact Checker

Fact-Checker (Step 4)

Trust Boundary (MANDATORY)

Fact Checker

Fact-Checker (Step 4)

Trust Boundary (MANDATORY)

Purpose

Trigger Conditions

Phase 1 — Anchor Extraction

What IS an anchor (extract these)

What is NOT an anchor (do not extract)

Output format (internal working list)

Phase 3.5 — Source Laundering Detection

Detection Patterns

Procedure

Output (appended to claim registry)

Phase 4 — Claim Registry Output

Phase 5 — Inline Summary

Contradicted Anchor Handling

Downstream Usage (Steps 5–8)

Failure Handling

Session Logs

OpenClaw Test Heap Leaks

Node Connect

Openclaw Qa Testing

Openclaw Secret Scanning Maintainer

Flags