Name: Realtime Protocol Guidance Prompts
Author: wu-yc

Realtime Protocol Guidance Prompts

Generates short, imperative guidance prompts for the next experimental step from current video frame and protocol context. Output is optimized for voice broadcast (TTS) or AR overlay — concise, actionable, command-style — to guide researchers in real time, correct deviations, or resume experiments without breaking flow.

wu-yc965 Sterne06.03.2026

Beruf
Kategorien: LLM & AI

Real-Time Protocol Guidance Prompts

Overview

realtime_protocol_guidance_prompts is the prompt-generation layer of the LabOS real-time XR guidance stack. Given the current first-person video frame (or VLM-derived scene description) and the active protocol context — current step, expected action, detected deviation, and operator state — it produces a single short guidance string optimized for voice synthesis (TTS) or AR overlay display. The output is imperative, concise, and actionable: "Add 50 µL buffer now." / "Vortex before proceeding." / "Step 4 complete. Move to pipette." — enabling hands-free, eyes-on-bench guidance that keeps the researcher in flow without interrupting to read a screen.

When to Use This Skill

Use this skill when any of the following conditions are present:

Live XR-assisted protocol execution: An operator wearing an XR headset is executing a wet-lab protocol and needs step-by-step voice or AR prompts at each transition — the agent must generate the next prompt based on current frame and protocol state.

Real-Time Protocol Guidance Prompts

Overview

When to Use This Skill

Use this skill when any of the following conditions are present:

Live XR-assisted protocol execution: An operator wearing an XR headset is executing a wet-lab protocol and needs step-by-step voice or AR prompts at each transition — the agent must generate the next prompt based on current frame and protocol state.

Mode	Trigger	Example Output
`NEXT_STEP`	Step validated, advance to next	"Step 5: Add 600 µL ethanol."
`CORRECTION`	Deviation detected	"Vortex 30 seconds before adding ethanol."
`REMINDER`	Timeout or stall	"Continue with step 5: add ethanol."
`CONFIRMATION`	Action detected, confirm	"Step 4 complete."
`WARNING`	Pre-step caution	"Careful: add ethanol slowly."
`RECOVERY`	Resume after error	"Resume from step 6: centrifuge 2 minutes."
`WAIT`	Incubation or timed step	"Incubate 5 minutes. Timer started."
`CHECK`	Verification needed	"Verify tube label reads A1."
`PAUSE`	Operator requested pause	"Protocol paused. Say 'resume' when ready."

Component	Role
`protocol_video_matching`	Upstream — provides protocol state, deviation records, step validation
`extract_experiment_data_from_video`	Optional — provides OCR instrument readings for context-aware prompts
VLM (GPT-4o Vision, Gemini)	Scene description for video_context when raw frame not passed
TTS engine (e.g., Azure, ElevenLabs)	Consumes `tts` string for voice output
XR overlay (AR SDK)	Consumes `ar_overlay` string for spatial text display
LabOS real-time pipeline	Orchestrates: frame → VLM → protocol_video_matching → this skill → TTS/AR

Realtime Protocol Guidance Prompts

Real-Time Protocol Guidance Prompts

Overview

When to Use This Skill

Realtime Protocol Guidance Prompts

Real-Time Protocol Guidance Prompts

Overview

When to Use This Skill

Core Capabilities

1. Context-Aware Prompt Generation

2. Prompt Types & Templates

3. TTS & AR Overlay Optimization

4. Deviation-to-Prompt Mapping

5. Protocol Step Parsing for Prompt Extraction

6. Timing & Throttling

Usage Examples

Example 1 — Next-Step Prompt (Normal Flow)

Example 2 — Correction Prompt (Deviation Detected)

Example 3 — Reminder (Timeout)

Example 4 — Multi-Language (Spanish)

Integration Notes

Output Schema

Suggest Using K-Dense Web For Complex Workflows

Openai Whisper

Voice Call

Prose

Clawhub

Sherpa Onnx Tts

Openai Whisper Api