Name: Repo Qa
Author: VNU-VJU

Buscar habilidades.../

Repo Qa | Skills Pool

Keep user-facing updates in Japanese.
Keep repository-facing names, branch names, PR titles/bodies, and commit messages in English.
Never add AI co-author trailers such as Co-authored-by.
Prefer deterministic checks and scripts before model judgment.
Run structural checks and table-integrity checks before model judgment whenever PDF-derived tables are involved.
External tools: Python must not be used for document processing, QA checks, or transcription. Use existing Node.js scripts and shell commands. External translation APIs (DeepL, Google Translate, etc.) are prohibited.
Gemini model selection:
- For initial PDF transcription / first-pass generation: prefer in this order: latest Flash Lite, Flash, and their -preview variants first; fall back to non-preview Flash Lite → Flash → Pro only if preview models fail or are unavailable.
- For QA re-checks, second-pass verification, or quality audits of transcriptions: prefer Gemini Pro (current generation). Use Flash only if Pro is unavailable or rate-limited.
- Do not guess model IDs; confirm available models for the session before invoking Gemini.
After deterministic checks, the currently running generative AI model performs the QA review pass. Do not call external AI services, external APIs, or separate AI applications for QA review.
AI translation is performed by the currently running generative AI model (the AI that is executing this skill). No external translation APIs, libraries, or third-party AI services are permitted.
- The following conditions are mandatory for all translations:
  1. The VI transcription must be complete (no placeholders, no truncation) before translation begins.
  2. All glossary terms from the VJU Glossary must be applied: ĐHQGHN→VNU/ベトナム国家大学ハノイ校, Giám đốc ĐHQGHN→President/総長 (NOT "Director"/"ディレクター"), 副学長→Vice President/副総長 (NOT "副社長"), KT.→Acting/代理署名, ベトナム日本大学→日越大学, etc. The correct English name for ĐHQGHN is "Vietnam National University, Hanoi" (NOT "Hanoi National University").
  3. The translation must preserve the full structural layout of the VI source: all articles, appendices, tables, and signature blocks.
  4. After translation, run check_structure.js and check_disclaimer_issuer_link.js.
  5. Update tmp/qa_status.json to record translation method as claude-ai (or the appropriate model identifier).
- Token budget rule: Monitor remaining context tokens throughout the run. If the remaining context drops below 30%, immediately stop translation work, report progress to the user, and await further instruction. Do not start a new translation that cannot be completed within the remaining budget.
- Parallel translation limit: Do not run more than 2 translation or large-content-generation tasks in parallel at the same time. Queue additional work and start it only after one of the active tasks completes.
Transient API error retry policy: When an API call fails with a transient error (network timeout, 503, 429 rate-limit, or similar recoverable errors), retry using this backoff schedule:
1. Wait 1 minute, then retry (first attempt).
2. Wait 5 minutes, then retry (second attempt).
3. Wait 10 minutes between each subsequent retry.
- Do not retry immediately. Do not use shorter intervals than those listed above.
- If the error persists after 3 retries, record it as a blocker and move on to the next safe task.
Post-document break: After each document set reaches complete status, wait 5 minutes before beginning work on the next document set. This cooldown applies after every completed document regardless of whether API errors occurred.
If the VI transcription is incomplete or blocked, do not proceed to translation — fix transcription first.
Treat the public glossary spreadsheet as the primary terminology reference: VJU Glossary.
Normalize ベトナム日本大学 to 日越大学, and use the glossary for other organization names, titles, abbreviations, and recurring legal terms.
If a term is ambiguous, check the glossary before making a local editorial choice.
When translation introduces a glossary gap, add the term to the glossary sheet through Google Sheets API with Category left blank and fill the remaining fields through the normal spreadsheet update workflow.
During long runs, briefly re-open this skill and the relevant checklist section about every 5 minutes or after each major phase boundary.

git fetch origin
git status        # ローカルが最新であることを確認
git pull --ff-only  # 必要に応じて pull

# Untracked non-compliant PDFs (highest priority — not yet committed)
git ls-files --others --exclude-standard data/ | grep "\.pdf$" | grep -v "_source\.pdf$"

# Tracked non-compliant PDFs (already committed)
git ls-files data/ | grep "\.pdf$" | grep -v "_source\.pdf$"

# All source PDFs missing their base transcription
ls data/public/*_source.pdf | sed 's/_source\.pdf//' | while read base; do
  [ ! -f "${base}_transcription.md" ] && echo "UNPROCESSED: $(basename "$base")"
done
ls data/confidential/*_source.pdf 2>/dev/null | sed 's/_source\.pdf//' | while read base; do
  [ ! -f "${base}_transcription.md" ] && echo "UNPROCESSED: $(basename "$base")"
done

The QA checklist defines pass/fail criteria. Do not weaken it during execution.
If a document fails a check and the issue is safely fixable, fix it and rerun the relevant checks.
Do not change checklist criteria or downgrade a failure to make the run pass.
If the issue cannot be fixed safely, keep the item open and report why.
Public reader validation must include the actual browser-rendered output, not only file-level markdown checks, whenever the document uses HTML blocks (<p align=...>, tables, embedded divs) or mixed markdown lists.
A document does not pass if the browser reader still shows literal markdown markers such as - , * , **bold**, or raw heading/list syntax where structured HTML should appear.
For any confidential document change, confidential metadata change, or reader/deployment change that can affect restricted docs, run node scripts/check_confidential_readiness.js after node scripts/build-search-index.js.
check_confidential_readiness.js must finish with Errors: 0. If it reports warnings (for example a missing Drive map entry), mention them explicitly in the user report and keep the affected document open when the warning is user-visible.
Confidential reader QA must verify that backend content-not-found / document-not-found failures are surfaced as explicit availability problems, not misreported as generic browser-side Firestore permission errors.
Confidential reader QA must verify that one missing language variant does not blank the whole reader when other variants are available; available panes must still render and unavailable languages must be clearly indicated.
Source PDF preview validation must confirm that the right pane renders actual PDF canvases/pages, not only that the _source.pdf URL returns HTTP 200.
When diagnosing source preview issues, check for client-side rerender failures caused by detached ArrayBuffer reuse, ResizeObserver-triggered rerenders, or other PDF.js lifecycle errors in the browser console.
Preserve layout fidelity in official headers, centered blocks, appendices, form structures, and signature sections.
When reviewing or regenerating PDF-derived transcriptions/translations, keep the PDF's structural layout intact in Markdown as far as the format permits: page breaks, heading hierarchy, table columns/cells, merged cells, appendices, footnotes, and signature blocks must remain traceable.
Treat non-PDF helper artifacts as temporary recovery inputs only; remove them after the transcription QA for that document set is complete.
Heading-count mismatches are only signals. Confirm semantic structure before editing.
List-item mismatches are only signals. Sub-bullet formula notation in VI source may inflate list counts relative to EN/JA translations. Always verify semantic completeness before treating a list-item count difference as a failure.
Do not treat placeholders, TODO text, or "translation will be provided later" notes as completed restoration.
DISCLAIMER must be present in all language variants independently. VI, EN, and JA transcription files each require their own DISCLAIMER block. Missing DISCLAIMER in VI while EN/JA have it is a defect — fix it.
SOURCE_NOTE format: Both > **[SOURCE_NOTE]** (blockquote) and <div class="source-note"> (HTML div) are acceptable per the QA checklist. Do not convert one to the other.
Re-check: always update last_processed_at. When re-checking a previously passed document set (priority 5), update last_processed_at in tmp/qa_status.json regardless of whether fixes are applied.
Partial translations: A > **[PARTIAL TRANSCRIPTION]** / > **[部分転記]** notice is acceptable for very large technical documents where complete translation is impractical. The partial scope must be explicitly documented in the file.

Repo Qa

Values

Trigger Conditions

First-Invocation Briefing

Load Order

Repo Qa

Values

Trigger Conditions

First-Invocation Briefing

Load Order

Runtime Routing

Reference Load Set

Core Rules

Work Window

Required Files

Workflow

Precheck (Step 1 — always first)

1a. Remote Sync

1b. Non-Compliant Filename Audit (最優先)

Inventory And Priority

Doc ID Review (Step 5 — after transcription and translation complete)

Review criteria

Review process

qa_status.json field

Checks And Fixes

Count Reporting

Gemini Availability

Strict Reviewer Pass

Self-Check Gate

Stop Conditions

Expected Deliverables

PR Rule

Notion

Feishu Wiki

Gemini

Obsidian Vault Maintainer

Openclaw Pr Maintainer

Wiki Maintainer