Name: Rlm Unlimited Context
Author: felmonon

스킬 검색.../

Rlm Unlimited Context | Skills Pool

# WRONG: Read the whole file into context
cat huge_file.txt

# RIGHT: Keep it external, probe with code
wc -l huge_file.txt
head -50 huge_file.txt
grep -c "error" huge_file.txt

lm_query(chunk, question) → Task tool sub-agent → writes result to file

# Sub-agent writes: results/chunk_0001.json
# Sub-agent writes: results/chunk_0002.json
# You aggregate with code, then read the small summary

Condition	Mode
Single file > 50KB	RLM-lite (no sub-calls)
Collection > 200KB	Full RLM
Needle-in-haystack (find specific item)	RLM-lite with `rlm_search.py`
Linear scan (process everything once)	Full RLM, Map-Reduce
Pairwise comparison (O(N²))	Full RLM, Classify-Then-Compute
Input > 1M tokens	Full RLM, Hierarchical

# Size and structure
wc -l -c INPUT_PATH
file INPUT_PATH
head -100 INPUT_PATH

# For directories
find DIR -type f -name "*.EXT" | wc -l
find DIR -type f | xargs wc -c | sort -rn | head -20

# For structured data
python3 -c "import json; d=json.load(open('f.json')); print(type(d), len(d))"

# Fast regex pre-filter
python3 scripts/rlm_search.py INPUT "keyword1|keyword2" --context 5

# Structure-based filter
grep -rl "class.*Controller" ./src/
find ./repo -name "*.py" -path "*/api/*"

# Statistical filter
python3 -c "
import re
with open('INPUT') as f: text = f.read()
sections = re.split(r'\n## ', text)
relevant = [s for s in sections if 'TARGET_TOPIC' in s.lower()]
print(f'Filtered {len(sections)} → {len(relevant)} sections')
for i, s in enumerate(relevant):
    open(f'filtered_{i}.txt', 'w').write(s)
"

# Auto-detect best strategy
python3 scripts/rlm_chunker.py auto INPUT_PATH --output ./chunks

# Manual control
python3 scripts/rlm_chunker.py file large.txt --method lines --size 500
python3 scripts/rlm_chunker.py file doc.md --method separator --sep "\n## "
python3 scripts/rlm_chunker.py dir ./src --ext .py .js --size 60000

Read the file at {chunk_path}.

TASK: {specific_question}

Write your answer as JSON to {result_path}:
{
  "chunk_index": N,
  "findings": ["finding1", "finding2"],
  "summary": "1-2 sentence summary",
  "relevant": true/false
}

If nothing relevant, set "relevant": false and "findings": [].

# Collect all results
python3 scripts/rlm_aggregate.py ./results --output final_answer.json

# Or manually
python3 -c "
import json, glob
results = []
for p in sorted(glob.glob('./results/*.json')):
    results.append(json.load(open(p)))
relevant = [r for r in results if r.get('relevant')]
findings = [f for r in relevant for f in r.get('findings', [])]
json.dump(findings, open('aggregated.json', 'w'), indent=2)
print(f'{len(relevant)}/{len(results)} chunks relevant, {len(findings)} findings')
"

ORIGINAL QUERY: {query}
AGGREGATED FINDINGS (from {N} chunks):
{findings}
Synthesize a comprehensive answer.

Probe → Chunk → Map (parallel sub-agents) → Reduce (aggregate) → FINAL_ANSWER

Probe → Filter (regex/grep) → Process (sub-agent on filtered content) → FINAL_ANSWER

Probe → Chunk → Classify (sub-agents label each) → Compute pairs (code) → FINAL_ANSWER

Probe → Mega-chunk (~10 groups) → Sub-agent per mega-chunk (each further chunks internally) → Aggregate → FINAL_ANSWER

## Large Input Processing (RLM)

When inputs exceed 50KB or tasks require processing all/most of a large file:

1. Prompt is a VARIABLE (file path), not in context
2. Probe first: wc, head, file, find
3. Filter with grep/regex before any LLM processing
4. Chunk into 30K-80K char pieces (scripts/rlm_chunker.py)
5. Delegate semantic work to sub-agents (Task tool = lm_query)
6. Store all intermediate results in files, never context
7. Aggregate programmatically → FINAL_ANSWER in a file
8. Code for deterministic work, sub-agents for semantic work only

Rlm Unlimited Context

RLM: Recursive Language Model for Unlimited Context

Algorithm 1: The RLM Loop

Rlm Unlimited Context

RLM: Recursive Language Model for Unlimited Context

Algorithm 1: The RLM Loop

Three Design Principles (from the paper)

1. Symbolic Handle to Prompt

2. Symbolic Recursion (sub-LM calls)

3. Output Through Variables

When to Use

The RLM Loop (Step by Step)

Step 1: INIT — Probe the Environment

Step 2: FILTER — Reduce Before Processing

Step 3: CHUNK — Split into Processable Pieces

Step 4: DELEGATE — Sub-LM Calls (Full RLM only)

Step 5: AGGREGATE — Build FINAL_ANSWER

Strategy Patterns

Pattern A: Map-Reduce — O(N) linear tasks

Pattern B: Filter-Then-Process — O(1) needle tasks

Pattern C: Classify-Then-Compute — O(N²) pairwise tasks

Pattern D: Hierarchical — 1M+ token inputs

Cost Control (from paper's Observation 4)

Anti-Patterns

CLAUDE.md Template

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns