KoboToolbox Translation & Localization | Skills Pool
技能档案
KoboToolbox Translation & Localization
Translation and localization guidelines for KoboToolbox content from English to Arabic. Use when translating KoboToolbox materials including: (1) Academy courses and educational content, (2) User interface text and documentation, (3) Support articles, (4) Marketing materials, (5) Form building terminology, or (6) XLSForm technical terms. Covers tone, pronouns (tú/usted, tu/vous), gender-inclusive language, and official translations for brand terms and UI elements.
joshuaberetta0 星标2025年12月9日
职业
分类
文档
技能内容
Overview
Translate KoboToolbox content from English to Arabic with consistent terminology, appropriate tone, and cultural adaptation.
📹 For Video Subtitles/Transcripts: If you're translating SRT subtitle files, use the kobo-translation-srt skill extension which includes all base rules plus subtitle-specific guidelines for character limits, spoken language, and chunked translation.
Translation approach:
For NEW FILES (full translation):
Translate the complete source document provided
Use consistent terminology for reliable, repeatable translations
Follow all guidelines in this skill document
For UPDATES (diff-based translation):
You will receive ONLY the changed content from the source document
Translate ONLY what is provided between the markers
Do NOT translate anything outside the markers
Do NOT add explanations or extra content
相关技能
Output ONLY the translated version of the diff content
The system will automatically merge your translation with the existing translated file
Why diff-based? LLMs are non-deterministic, so re-translating an entire document produces slightly different results each time, creating "translation noise" in git diffs. By translating only actual changes, we preserve manual reviewer improvements and reduce unnecessary churn.
🚨 CRITICAL: Pre-Translation Checklist
BEFORE starting translation, read these reference files:
brand-terminology.md - For server names, Question Library, Formbuilder, and ALL brand terms
ui-terminology.md - For button names, tabs, and capitalization rules
Common mistakes to avoid:
❌ Translating server names incorrectly (missing articles or adding extra words)
❌ Not including English term on first reference for Formbuilder
❌ Missing capital article for "Question Library" / "La bibliothèque de questions" / "La biblioteca de preguntas"
❌ Not capitalizing UI terms like "Brouillon" / "Borrador"
❌ Using "de" article when it shouldn't be there in French data management terms
Quick Reference
Translation types:
OFFICIAL - Must use EXACT translation character-for-character (brand terms, UI elements, XLSForm terms)
Preferred - Can adapt for context (general terminology, course content)
Key principles:
ALWAYS check brand-terminology.md FIRST before translating any brand-related terms
START: Do I see ANY of these terms in the source text?
├─ Server names (Global Server, EU Server)?
│ └─ 🚨 STOP → Open brand-terminology.md → Use EXACT translation with articles
│
├─ "Question Library"?
│ └─ 🚨 STOP → Must be "La bibliothèque de questions" / "La biblioteca de preguntas" (capital L)
│
├─ "Formbuilder"?
│ └─ 🚨 STOP → First reference must include English in parentheses
│ ES: "editor de formularios de KoboToolbox (Formbuilder)"
│ FR: "l'interface de création de formulaires KoboToolbox (KoboToolbox Formbuilder)"
│
├─ UI element (button, tab, menu - like DEPLOY, NEW, FORM, DATA)?
│ └─ Check ui-terminology.md → Use OFFICIAL translation → Match UI capitalization
│
├─ Draft / Brouillon / Borrador?
│ └─ Capitalize in UI contexts: "Brouillon" / "Borrador"
│
├─ XLSForm technical term (list_name, cascading select)?
│ └─ Written: English + translation in parentheses
│ Subtitles: English only
│ See form-building-terms.md
│
├─ Question appearance (minimal, picker, rating)?
│ └─ Written: English + translation in parentheses
│ Subtitles: English only
│ See question-types.md
│
├─ Form building or data collection term?
│ └─ Check if OFFICIAL or PREFERRED
│ Apply OFFICIAL exactly; adapt PREFERRED for context
│ See relevant reference file
│
├─ Course or educational content?
│ └─ Use PREFERRED translations
│ Apply appropriate pronoun formality
│ See course-terminology.md
│
└─ Unsure about gender inclusivity?
└─ Use gender-neutral language
FR: vous + parenthetical markers (e)
ES: neutral constructions or tú with "se te"
Enhanced Quality Checklist
Before finalizing translation:
** CRITICAL - Brand & UI Terms:**
All server names use EXACT translations from brand-terminology.md (with articles!)
"Question Library" has capital article: "La bibliothèque" / "La biblioteca"
Formbuilder includes English on first reference
All UI elements (buttons, tabs) match ui-terminology.md exactly
HTML heading tags converted to markdown (## for h2, ### for h3, etc.)
All other HTML tags preserved and unchanged (iframe, section, etc.)
Internal documentation links kept as relative paths (they auto-resolve correctly)
Cross-language reference links updated to use directory paths (../en/, ../es/, ../fr/, ../ar/)
External links: translated text, unchanged URLs
Image paths unchanged
YouTube embed language parameters updated (cc_lang_pref, hl)
Arabic content wrapped in <section dir="rtl"> tags
Heading hierarchy maintained
Language & Style:
Correct formality level (vous/tu, usted/tú) for content type
Gender-inclusive language throughout (especially Spanish double forms)
XLSForm/technical terms follow English + translation pattern
Consistent terminology (same term for same concept)
Plain language, beginner-friendly
Proper acronym handling (full term + acronym first use)
Target language punctuation conventions
No slang or colloquialisms
Natural word order (not forced English structure)
Translation Error Examples
Real-World Translation Patterns
Example from actual translations:
Source English:
"KoboToolbox makes high quality data accessible to social impact organizations worldwide."
Key observations:
French naturally restructures: "makes data accessible" → "rend les données accessibles" (makes the data accessible)
Spanish restructures differently: "permite que los datos sean accesibles" (permits that the data be accessible)
Both translations adapt to natural target language expressions rather than forcing English structure
Source English:
"Designed by data collection practitioners specifically for challenging settings"
✅ CORRECT Arabic:
"مُصممة من قبل مختصين في جمع البيانات للتعامل مع الظروف الصعبة بشكل خاص"
Key observations:
Spanish uses gender-inclusive "personas profesionales" instead of literal "practitioners"
Spanish adds "Fue" (was) for natural past tense flow
Arabic restructures significantly: "designed... for dealing with difficult circumstances especially"
All prioritize natural expression over literal translation
Example 1: Server Names
Source: "Most users sign up for an account on our Global KoboToolbox Server."
Errors fixed:
Removed "de KoboToolbox" (not in official name)
Added gender inclusivity: "los/as usuarios/as"
Errors fixed:
Added definite article "Le" (capitalized)
Added gender inclusivity: "utilisatrices et utilisateurs"
Example 2: Question Library
Source: "Build a form using a template from the question library."
Error fixed: Capitalized "L" in "La" (brand feature name)
Example 3: Formbuilder First Reference
Source: "Create a new form using the KoboToolbox Formbuilder."
Error fixed: Added English term in parentheses on first reference
Example 4: HTML Headings to Markdown
Source (English with HTML tags):
<h3>Why KoboToolbox is unique</h3>
KoboToolbox is hosted and maintained by the international nonprofit organization...
<h3>Supporting global impact</h3>
KoboToolbox is the most widely used data collection tool...
Apoyamos el impacto global
KoboToolbox es la herramienta de recolección de datos más utilizada...
**Error fixed:** Converted HTML `<h3>` tags to markdown `##` (since h3 corresponds to ##)
**Key observations:**
- Always convert HTML heading tags to markdown format
- Maintain the heading level: h1→#, h2→##, h3→###, h4→####
- Keep all other HTML tags (iframe, section, etc.) intact
### Example 5: Internal and Cross-Language Links
**Source (English file in `docs/en/p_codes.md`):**
```markdown
# Including P-Codes in the Output Data
[Lire en français](../fr/p_codes.md) | [Leer en español](../es/p_codes.md)
If using cascading lists, please [follow the instructions](cascading_select.md)
for cascading selects.
Key observations:
Internal doc link cascading_select.md stays as-is (relative path auto-resolves to docs/es/cascading_select.md)
Cross-language links updated to use directory structure (../en/, ../fr/)
Link text translated appropriately for each language
Errors:
Cross-language links don't specify the language directory
Internal link incorrectly uses _es suffix instead of relying on relative path
Example 6: Natural Language Flow
Source: "To support our nonprofit users, we provide our tools for free under the Community Plan."
Key observations:
French: "for free" → "gratuitement" (as adverb, not "pour gratuit")
French: "under the plan" → "dans le cadre du plan" (in the framework of)
Spanish: "for free" → "de forma gratuita" (in free form)
Spanish: "nonprofit users" → "usuarios sin fines de lucro" (different word order)
Notes
Reporting issues:
If UI terminology needs correction, flag in comment and record in appropriate tracker or communications channel.
Working document:
This is a living guideline. Feedback welcome. More languages and terms will be added.