Name: Rlm Orchestrator
Author: Magic8Ballin

Search skills.../

Rlm Orchestrator | Skills Pool

User Query → Store context as ENVIRONMENT → Probe structure → Identify relevant chunks → 
Spawn focused sub-queries → Aggregate findings → Correct answer

Trigger	Condition	Action
Large Context	Input > 50K tokens (or approaching limits)	Switch to RLM mode
Information Dense	Query requires synthesizing many parts	Decompose and delegate
Needle-in-Haystack	Finding specific info in massive text	Reconnaissance first
Multi-Hop Reasoning	Answer requires connecting 2+ distant facts	Parallel sub-queries
Aggregation Tasks	Counting, comparing, listing across data	Chunk and map-reduce

# Don't read everything. Understand the shape first.
print(f"Total length: {len(context)} characters")
print(f"First 500 chars: {context[:500]}")
print(f"Number of double-newlines (likely sections): {context.count('\\n\\n')}")

# Find structural markers
import re
headers = re.findall(r'^#+\s+.+', context, re.MULTILINE)
print(f"Found {len(headers)} markdown headers")

Strategy	When to Use	Example
Semantic Chunking	Structured documents	Split by headers, chapters, sections
Fixed Chunking	Unstructured text	Split into N equal chunks
Targeted Extraction	Needle-in-haystack	Use regex/keywords to filter first
Hierarchical	Very large inputs	First-pass summary → second-pass detail

# Break into semantic chunks
chunks = re.split(r'\n#{2,}\s+', context)

# Process each chunk with focused attention
findings = []
for i, chunk in enumerate(chunks):
    finding = llm_query(f"""
    You are analyzing section {i+1} of {len(chunks)}.
    
    QUERY: {original_query}
    
    SECTION CONTENT:
    {chunk}
    
    Extract any information relevant to the query. If nothing relevant, say "No relevant information."
    Be concise but complete.
    """)
    findings.append(finding)

Map-Reduce: When counting, listing, or comparing

final = llm_query(f"""
You have received findings from {len(findings)} document sections.

FINDINGS:
{chr(10).join(findings)}

ORIGINAL QUERY: {original_query}

Synthesize these findings into a complete answer. 
If findings conflict, note the conflict.
If information is incomplete, note what's missing.
""")

Verification Loop: When accuracy is critical

# First aggregation
answer = llm_query(f"Combine findings: {findings}")

# Verification with smaller, focused context
verified = llm_query(f"""
PROPOSED ANSWER: {answer}
KEY EVIDENCE: {relevant_chunks}

Verify this answer is correct based on the evidence.
If incorrect, provide the correct answer.
""")

Variable Accumulation: When building long outputs

accumulated = []
for chunk in chunks:
    processed = llm_query(f"Process: {chunk}")
    accumulated.append(processed)

# Return the accumulated variable, not a new synthesis
FINAL_VAR(accumulated)

Task Type	Model Tier	Why
Orchestration	Highest (GPT-5 class)	Strategic decisions, complex synthesis
Chunk Analysis	Medium (GPT-4 class)	Per-section processing, good enough
Simple Extraction	Smallest (Mini class)	Regex-like tasks, keyword search

# BAD: Just dump it all in
answer = llm_query(f"Here's 5 million characters: {entire_document}. Answer: {query}")
# This WILL cause Context Rot

# BAD: 1000 LLM calls for 1000 lines
for line in lines:  # 1000 lines
    result = llm_query(f"Classify: {line}")  # 1000 calls = $$$ and slow

# GOOD: 5 LLM calls for 1000 lines
chunk_size = 200
for i in range(0, len(lines), chunk_size):
    batch = "\n".join(lines[i:i+chunk_size])
    result = llm_query(f"Classify each line:\n{batch}")

# BAD: Returning answer from "memory" instead of accumulated variable
# (This caused failures in OOLONG-Pairs benchmark)
FINAL("The answer is probably X")  # Wrong!
FINAL_VAR(accumulated_answer)      # Right — use what you actually computed

Skill	Purpose	When to Reference
`rlm-context-scout/SKILL.md`	Deep dive on reconnaissance techniques	Phase 1 (probing, filtering)
`rlm-repl-environment/SKILL.md`	REPL setup and code patterns	Technical implementation

┌─────────────────────────────────────────────────────────────────┐
│                   RLM ORCHESTRATOR FLOW                        │
├─────────────────────────────────────────────────────────────────┤
│                                                                 │
│  1. RECOGNIZE THE PATTERN                                       │
│     → Is context large? Is task complex? → Activate RLM mode   │
│                                                                 │
│  2. RECONNAISSANCE (Don't read — probe)                        │
│     → Sample, count, pattern-match                             │
│     → Build mental map of data structure                       │
│                                                                 │
│  3. DECOMPOSE (Divide the problem)                             │
│     → Semantic chunks? Fixed chunks? Targeted extraction?      │
│     → Each chunk < 500K chars                                  │
│                                                                 │
│  4. DELEGATE (Spawn sub-queries)                               │
│     → Clear, focused prompts                                   │
│     → Return high-signal only                                  │
│                                                                 │
│  5. AGGREGATE (Synthesize findings)                            │
│     → Combine results                                          │
│     → Verify if critical                                       │
│     → Use FINAL_VAR for accumulated answers                    │
│                                                                 │
│  REMEMBER: Context is an ENVIRONMENT, not an INPUT.            │
│                                                                 │
└─────────────────────────────────────────────────────────────────┘

Rlm Orchestrator

RLM Orchestrator — The Manager Mind

The RLM Paradigm

Traditional LLM Thinking (What to Avoid)

RLM Thinking (What to Do)

Rlm Orchestrator

RLM Orchestrator — The Manager Mind

The RLM Paradigm

Traditional LLM Thinking (What to Avoid)

RLM Thinking (What to Do)

When This Skill Activates

The Three Phases

Phase 1: Reconnaissance (The Scout)

Phase 2: Divide and Conquer (The Subcommittee)

Phase 3: Aggregation (The Synthesizer)

The Mini-Model Economy

Answer Signaling

Common Anti-Patterns

❌ Stuffing Everything Into Context

❌ One-by-One Processing (The Qwen Problem)

✅ Batched Processing

❌ Trusting Your Mental Model Over Evidence

Quick Reference Card

The Philosophy

Openai Whisper

Voice Call

Prose

Clawhub

Sherpa Onnx Tts

Openai Whisper Api

Rlm Orchestrator

RLM Orchestrator — The Manager Mind

The RLM Paradigm

Traditional LLM Thinking (What to Avoid)

RLM Thinking (What to Do)

Rlm Orchestrator

RLM Orchestrator — The Manager Mind

The RLM Paradigm

Traditional LLM Thinking (What to Avoid)

RLM Thinking (What to Do)

When This Skill Activates

The Three Phases

Phase 1: Reconnaissance (The Scout)

Phase 2: Divide and Conquer (The Subcommittee)

Phase 3: Aggregation (The Synthesizer)

The Mini-Model Economy

Answer Signaling

Common Anti-Patterns

❌ Stuffing Everything Into Context

❌ One-by-One Processing (The Qwen Problem)

✅ Batched Processing

❌ Trusting Your Mental Model Over Evidence

Integration with Related Skills

Quick Reference Card

The Philosophy

Openai Whisper

Voice Call

Prose

Clawhub

Sherpa Onnx Tts

Openai Whisper Api