Name: Converting Books To Skills
Author: LeoDPraetorian

Skills suchen.../

Converting Books To Skills | Skills Pool

# Navigate to repository root
ROOT="$(git rev-parse --show-superproject-working-tree --show-toplevel | head -1)" && cd "$ROOT"

# Verify book file exists (user provides the path)
ls {path-to-book-file}.md

Read(".claude/skill-library/claude/skill-management/converting-epub-to-markdown/SKILL.md")

# Check file size (OCR books are typically 10K-30K lines)
wc -l {path-to-book-file}.md

# Preview first 200 lines to identify structure
head -200 {path-to-book-file}.md

# Find TOC section (usually marked with "Table of Contents" or "Contents")
grep -n -i "table of contents\|^contents$\|^## BRIEF CONTENTS" {path-to-book-file}.md

# Extract chapter titles from TOC (adjust line range based on grep results)
sed -n '100,300p' {path-to-book-file}.md | grep -i "chapter"

# Search for common chapter patterns
grep -n "^##\s\+CHAPTER\|^##\s\+[0-9]\|^##\s\+[0-9][0-9]" {path-to-book-file}.md | head -20

# List all detected chapters in order
grep -n "^##\s\+CHAPTER\|^##\s\+[0-9]" {path-to-book-file}.md

# Check the sequence for gaps (e.g., 1,2,3,8,9,10 - missing 4-7)

# After splitting, check chunk sizes (both lines and tokens)
wc -l references/chapters/*.md

# Check token counts (character count / 4 = approximate tokens)
for f in references/chapters/*.md; do
  chars=$(wc -c < "$f")
  tokens=$((chars / 4))
  echo "$f: $tokens tokens (approx)"
done

# Check for violations:
# - Any file >25,000 tokens (~100,000 characters)? → Need semantic splitting
# - Total count <5? → Need finer granularity (H2 sections)

# Check token count first
chars=$(wc -c < chapter-08.md); echo "$((chars / 4)) tokens (approx)"

# Found semantic split at line 4300 (## Executive objects - major topic shift)
sed -n '1,4299p' chapter-08.md > chapter-08-part1.md    # 4,299 lines, ~17,196 tokens ✓
sed -n '4300,8389p' chapter-08.md > chapter-08-part2.md  # 4,090 lines, ~16,360 tokens ✓
rm chapter-08.md  # Remove unsplit version

# Verify both parts are under 25,000 token limit
for f in chapter-08-part*.md; do
  chars=$(wc -c < "$f")
  echo "$f: $((chars / 4)) tokens (approx)"
done

# Check if splitting needed
chars=$(wc -c < chapter-10.md); echo "$((chars / 4)) tokens (approx)"

# Split at line 2668 (## Windows Management Instrumentation)
sed -n '1,2668p' chapter-10.md > chapter-10-part1.md    # 2,668 lines, ~10,672 tokens ✓
sed -n '2669,5687p' chapter-10.md > chapter-10-part2.md  # 3,019 lines, ~12,076 tokens ✓
rm chapter-10.md

# Create chapters directory
mkdir -p .claude/skill-library/{category}/{skill-name}/references/chapters

# Extract each chapter using line boundaries from Step 3
# Format: sed -n '{start},{end}p' {source} > {output}

# Example from today's conversion (Windows Part 2):
sed -n '534,8922p' {path-to-book-file}.md > .claude/skill-library/{path}/references/chapters/chapter-08.md
sed -n '8923,12692p' {path-to-book-file}.md > .claude/skill-library/{path}/references/chapters/chapter-09.md
# ... repeat for each chapter

# Verify extraction
wc -l .claude/skill-library/{path}/references/chapters/*.md

---

Phase	Purpose	Output
1. Validation	Verify book file exists and is valid markdown	Book path confirmed
2. TOC Analysis	Extract keywords from table of contents	Keyword list for description
3. Chapter Detection

Phase	Purpose	Output
1. Validation	Verify book file exists and is valid markdown	Book path confirmed
2. TOC Analysis	Extract keywords from table of contents	Keyword list for description
3. Chapter Detection

Converting Books To Skills

When to Use

Quick Reference

Converting Books To Skills

When to Use

Quick Reference

Core Workflow

Step 0: Prerequisites

Step 1: Validate Book File

Step 2: Extract Keywords from Table of Contents

Step 3: Detect Chapter Boundaries

Step 4: Split Book into Chapter Files

Step 5: Generate SKILL.md with Chapter Summaries

Feishu Doc

Summarize

Nano Pdf

Diffs

Customs Trade Compliance

Nutrient Document Processing