Name: Distill Sessions
Author: PARKCHEOLHEE-lab

스킬 검색.../

Distill Sessions | Skills Pool

# Project session directory pattern:
# ~/.claude/projects/{encoded-cwd}/*.jsonl

PROJECT_DIR=$(echo "$PWD" | sed 's|/|-|g; s|^-||')
SESSION_DIR="$HOME/.claude/projects/-${PROJECT_DIR}"

# Pass 1: try --resume with sonnet 5 attempts → haiku 5 attempts on transient errors
MODEL="sonnet"
MAX_RETRIES=5
RESULT=""
TOO_LONG=0

for attempt in $(seq 1 $MAX_RETRIES); do
  RESULT=$(claude -p --resume <session-id> \
    --permission-mode default \
    --allowedTools "Read Grep Glob" \
    --model $MODEL \
    "<extraction prompt below>" 2>&1) && break
  if [[ "$RESULT" == *"Prompt is too long"* ]]; then
    TOO_LONG=1
    break
  fi
  echo "sonnet attempt $attempt failed, retrying..." >&2
  sleep 2
done

if [[ $TOO_LONG -eq 0 ]] && [[ -z "$RESULT" || "$RESULT" == *"overloaded"* || "$RESULT" == *"Error"* ]]; then
  MODEL="haiku"
  for attempt in $(seq 1 $MAX_RETRIES); do
    RESULT=$(claude -p --resume <session-id> \
      --permission-mode default \
      --allowedTools "Read Grep Glob" \
      --model $MODEL \
      "<extraction prompt below>" 2>&1) && break
    if [[ "$RESULT" == *"Prompt is too long"* ]]; then
      TOO_LONG=1
      break
    fi
    echo "haiku attempt $attempt failed, retrying..." >&2
    sleep 2
  done
fi

if [[ $TOO_LONG -eq 1 ]]; then
  # 1) Chunk the session (lossless: keeps everything except file-history-snapshot,
  #    splits oversized single messages with [LARGE MESSAGE k/N] markers).
  CHUNKS_DIR=$(mktemp -d)
  python3 ~/.claude/skills/distill-sessions/scripts/chunk_and_extract.py \
    "$SESSION_JSONL" --out-dir "$CHUNKS_DIR" --max-chars 80000 --overlap 2

  # 2) Extract candidates per chunk with cumulative summary.
  RESULT=$(bash ~/.claude/skills/distill-sessions/scripts/extract-from-chunks.sh \
    "$CHUNKS_DIR" sonnet)

  rm -rf "$CHUNKS_DIR"
fi

Analyze this conversation and extract ONLY information worth remembering
for future sessions. Focus on:

1. **user**: Role, preferences, knowledge level, work style
2. **feedback**: Corrections ("don't do X"), confirmations ("yes, exactly like that")
3. **project**: Non-obvious context about goals, deadlines, decisions, stakeholders
4. **reference**: Pointers to external systems (Linear projects, Slack channels, dashboards)

Do NOT extract:
- Code changes, file paths, or architecture (derivable from code)
- Git history or debugging solutions (derivable from git)
- Anything already in CLAUDE.md
- Ephemeral task details

For each candidate, output as JSON array:
[
  {
    "type": "user|feedback|project|reference",
    "title": "short title",
    "content": "the memory content",
    "why": "why this is worth remembering"
  }
]

If nothing is worth remembering, return: []

# Write all candidates from every session into a single JSON array file,
# then run the dedup pass. The script returns only the merged result and
# never calls the LLM when the input is empty.
ALL_CANDS=$(mktemp --suffix=.json)
DEDUPED=$(mktemp --suffix=.json)
echo "$COMBINED_CANDIDATES_JSON" > "$ALL_CANDS"

python3 ~/.claude/skills/distill-sessions/scripts/dedup_candidates.py \
  --input "$ALL_CANDS" \
  --output "$DEDUPED" \
  --model sonnet

RESULT=$(cat "$DEDUPED")
rm -f "$ALL_CANDS" "$DEDUPED"

## Memory candidates from N sessions

### user
1. [title] — one-line summary
2. [title] — one-line summary

### feedback
3. [title] — one-line summary
4. [title] — one-line summary

### project
5. [title] — one-line summary

(N candidates total)

# Build approval file with selected filenames
# This file is checked by validate-memory.sh — without it, all memory writes are blocked.
cat > /tmp/memory-approved.json <<EOF
{
  "approved_at": "$(date -u +%Y-%m-%dT%H:%M:%SZ)",
  "session_id": "<current-session-id>",
  "files": [
    "feedback_exhaustive_search.md",
    "user_work_style.md"
  ]
}
EOF

rm -f /tmp/memory-approved.json

---

Distill Sessions

Usage

Process

Step 0: Ensure memory-gate hook is registered

Distill Sessions

Usage

Process

Step 0: Ensure memory-gate hook is registered

Step 1: Discover sessions

Step 2: Extract memory candidates from each session

Step 3: Deduplicate and merge

Step 4: Present candidates to user

Step 4.5: Write approval token

Step 5: Determine save location

Step 6: Save memories

Openai Whisper

Voice Call

Prose

Clawhub

Sherpa Onnx Tts

Openai Whisper Api