Gemini VLM for PDF extraction: Use gemini-3.1-pro via the Google Vertex AI endpoint as the primary VLM for PDF-to-text extraction. Authenticate using the service account JSON (project-f154aafa-a809-44c8-89f-70e8abc5e53a.json) in the workspace root. A ready-to-use client is provided at scripts/vertex_client.py:

from scripts.vertex_client import get_openai_client
client = get_openai_client()

The module handles service-account authentication, token refresh, and base-URL construction automatically. Use model name google/gemini-3.1-pro.

If the Gemini endpoint is unavailable, rate-limited, or returns errors, fall back to Kimi (KIMI_BASE_URL, KIMI_API_KEY, KIMI_MODEL from env.txt).

{subject}-grading-workspaces/grading_workspace/
├── .github/skills/hkdse-{subject}-grading/SKILL.md
├── START.md
├── WARNING.md
├── env.txt                     # Local credentials (never commit)
├── env.txt.example             # Template
├── pyproject.toml
├── uv.lock
├── start.sh
├── data/                       # Symlink or mount to top-level data/
│   ├── reference/{subject}/{ref_year}/
│   │   ├── student_answers/level{L}_student{M}.pdf
│   │   ├── rubrics/ OR rubric_and_question/
│   │   ├── question/           (if separate)
│   │   └── reference_mapping.json
│   ├── masked_data/{subject}/{grade_year}/
│   │   ├── student_answers/student{N}.pdf
│   │   ├── rubrics/ OR rubric_and_question/
│   │   └── question/           (if separate)
│   └── groundtruth/{subject}/{grade_year}/groundtruth_mapping.json
├── rubric/{grade_year}/        # Generated rubric artifacts
│   ├── grading_guide.md
│   ├── reference_calibration.md
│   ├── reference_scores.json
│   ├── level_division.json
│   └── calibration/            # Intermediate calibration artifacts
│       └── (draft rubrics, reference grading outputs, score calculations)
├── rubric/reference_data_analysis/  # Insights from reference data (Phase 3)
│   ├── per_level_analysis.md        # Observed patterns per level 1–5
│   ├── rubric_gaps.md               # Gaps/ambiguities in official rubric
│   └── rubric_refinements.md        # Supplementary rubric guidance
├── scripts/                    # Python helper scripts
│   ├── generate_class_report.py
│   ├── generate_student_reports.py
│   ├── validate_extraction.py
│   ├── validate_grading_output.py
│   └── validate_reports.py
├── extracted/{grade_year}/     # Extracted student text
│   └── students/student{N}.txt
└── output/{grade_year}/        # Grading output
    ├── student{N}.json
    └── final_scores.json

source env.txt
uv sync
export GRADING_YEAR={grade_year}   # Set from env.txt or manually

{
  "mappings": [
    {"student_id": 1, "filename": "level3_student1.pdf", "level": 3, ...},
    ...
  ]
}

{
  "reference_year": "{ref_year}",
  "scores": [
    {"student_id": 1, "level": 3, "total_raw_score": 42, "total_max_score": 80, "percentage": 52.5},
    {"student_id": 2, "level": 5, "total_raw_score": 72, "total_max_score": 80, "percentage": 90.0}
  ],
  "level_score_ranges": {
    "1": {"min_pct": 12.5, "max_pct": 22.0, "mean_pct": 17.3, "count": 2},
    "2": {"min_pct": 28.0, "max_pct": 38.5, "mean_pct": 33.3, "count": 2},
    "3": {"min_pct": 45.0, "max_pct": 55.0, "mean_pct": 50.0, "count": 2},
    "4": {"min_pct": 62.0, "max_pct": 74.0, "mean_pct": 68.0, "count": 2},
    "5": {"min_pct": 82.0, "max_pct": 95.0, "mean_pct": 88.5, "count": 2}
  }
}

{
  "student_id": N,
  "questions": [
    {
      "question_id": "Q1a",
      "max_marks": 4,
      "awarded_marks": 3.0,
      "reasoning": "Brief explanation of mark allocation",
      "evidence": "Relevant quote or reference from student answer"
    }
  ],
  "total_raw_score": 45,
  "total_max_score": 60,
  "percentage": 75.0,
  "preliminary_level": 4,
  "level_reasoning": "Based on reference calibration, this student's performance aligns with Level 4 characteristics: ..."
}

{
  "subject": "{subject}",
  "year": "{grade_year}",
  "total_students": N,
  "max_possible_score": 60,
  "students": [
    {
      "student_id": 1,
      "total_raw_score": 45,
      "total_max_score": 60,
      "percentage": 75.0,
      "preliminary_level": 4
    }
  ],
  "statistics": {
    "mean_score": 42.5,
    "median_score": 43.0,
    "std_dev": 8.2,
    "min_score": 20,
    "max_score": 58,
    "mean_percentage": 70.8
  }
}

{
  "method": "reference_score_based",
  "reference_year": "{ref_year}",
  "boundaries": {
    "level_1": {"min_percentage": 0, "max_percentage": 27.5},
    "level_2": {"min_percentage": 27.5, "max_percentage": 41.75},
    "level_3": {"min_percentage": 41.75, "max_percentage": 60.0},
    "level_4": {"min_percentage": 60.0, "max_percentage": 78.0},
    "level_5": {"min_percentage": 78.0, "max_percentage": 100}
  },
  "reference_score_ranges": {
    "level_1": {"min_pct": 12.5, "max_pct": 22.0, "mean_pct": 17.3},
    "level_2": {"min_pct": 28.0, "max_pct": 38.5, "mean_pct": 33.3},
    "level_3": {"min_pct": 45.0, "max_pct": 55.0, "mean_pct": 50.0},
    "level_4": {"min_pct": 62.0, "max_pct": 74.0, "mean_pct": 68.0},
    "level_5": {"min_pct": 82.0, "max_pct": 95.0, "mean_pct": 88.5}
  },
  "adjustment_notes": "No adjustment; current cohort distribution aligns with reference year ranges.",
  "students": [
    {
      "student_id": 1,
      "total_raw_score": 45,
      "percentage": 75.0,
      "preliminary_level": 4,
      "final_level": 4,
      "level_reasoning": "Score 75.0% falls within Level 4 reference range (62.0%–74.0%), closest to Level 4 mean (68.0%)"
    }
  ]
}

HKDSE Subject Grading — Unified Skill | Skills Pool

HKDSE Subject Grading — Unified Skill

HKDSE Subject Grading — Unified Skill

Overview

CRITICAL RULES

Workspace Layout

Phase 1: Environment Setup

Step 1.1: Install Dependencies and Set Year

Step 1.2: Verify Data Availability

Phase 2: Rubric Extraction

Step 2.1: Extract Rubric Content

Step 2.2: Extract Question Paper

Step 2.3: Build Grading Guide

Phase 3: Reference Calibration

Step 3.1: Read Reference Data

Step 3.2: Extract Reference Student Answers

Step 3.3: Read Reference Rubrics

Step 3.4: Grade Reference Students (Iterative Rubric Refinement)

Step 3.4a: Reference Data Analysis and Iterative Rubric Refinement

Step 3.5: Compute Score-to-Level Boundaries

Step 3.6: Build Reference Calibration Document

Phase 4: Student Answer Extraction

Step 4.1: Extract Text from Student PDFs

Step 4.2: Verify Extraction Completeness

Step 4.3: Write and Run Extraction Validation Script

Phase 5: Per-Student Grading

Step 5.1: Launch Sub-Agents

Step 5.2: Per-Student Output Schema

Step 5.3: Validate Sub-Agent Output

Step 5.4: Write and Run Grading Output Validation Script

Phase 6: Score Compilation

Step 6.1: Aggregate All Student JSONs

Phase 7: Level Division

Step 7.1: Determine Level Boundaries

Step 7.2: Assign Final Levels

Step 7.3: Cross-Validate with Reference

Subject-Specific Data Notes

ICT (Information and Communication Technology)

Music

Ethics and Religious Studies

Tourism and Hospitality Studies

Visual Arts

Biology

Economics

Error Handling

Phase 8: Quality Assurance

Step 8.1: Score Distribution Review

Step 8.2: Spot-Check Sample Students

Step 8.3: Cross-Student Consistency

Step 8.4: Final Output Verification

Checklist

Feishu Doc

Summarize

Nano Pdf

Diffs

Customs Trade Compliance

Nutrient Document Processing