Name: Book Scrape
Author: chtapodi

Search skills.../

Book Scrape | Skills Pool

# See which CC ingredients are unresolved:
python3.11 -c "
import json
from flavor_graph.generate_recipe_notes import resolve_ingredient, MANUAL_RESOLUTION, EXACT_RESOLUTION

cc = json.load(open('Reference/Documents/scraped_recipes_cocktail_codex.json'))
all_ings = set()
for r in cc:
    for ing in r['ingredients']:
        all_ings.add(ing['raw_name'])

unresolved = [n for n in sorted(all_ings)
              if n not in MANUAL_RESOLUTION
              and n not in EXACT_RESOLUTION
              and n.lower() not in {k.lower() for k in {**MANUAL_RESOLUTION, **EXACT_RESOLUTION}}]
print(f'Unresolved: {len(unresolved)}')
for u in unresolved:
    print(f'  {u!r}')
"

# Normalize (if not already done)
python3.11 flavor_graph/scraper_normalize.py --source cocktail_codex

# Dry-run: preview what will be created vs. skipped (collision = already in vault)
python3.11 flavor_graph/generate_recipe_notes.py --source cocktail_codex --dry-run

# Generate
python3.11 flavor_graph/generate_recipe_notes.py --source cocktail_codex

You are filling flavor descriptions for cocktail recipe notes in the Flavors vault.

Vault root: /home/mango/workspace/Obsidian
Flavors root: /home/mango/workspace/Obsidian/Flavors/

YOUR BATCH: [paste list of recipe note paths here]

For each recipe note:
1. Read the note with obsidian_read_note to get the ingredients list and existing Notes section.
2. Check the Notes section — if it contains the book's own tasting notes (Cocktail Codex
   notes field), use that as primary source material.
3. Use book_search MCP to get the book's perspective on the recipe and its key ingredients:
   - book_recipe_search(query="<recipe name>", limit=3) — find the structured recipe record
   - book_search(query="<recipe name> flavor taste", limit=4) — find any page prose about it
   - book_get_page(book_id, page, context_pages=1) for any strong hits
4. For each key ingredient in the recipe, optionally call:
   - book_search(query="<ingredient> flavor taste aroma", limit=3) to understand its contribution
5. Write a 2-3 sentence flavor description covering: dominant character, balance between
   components, and finish. Be specific and sensory — not generic. Draw from book language first.
6. Replace [[Flavor placeholder]] with 2-4 real [[Flavor Name]] wikilinks from
   Flavors/Flavors/ that match the recipe's character. Check obsidian_list_directory
   on Flavors/Flavors/ to confirm the flavor notes exist before linking. Prefer canonical bare Obsidian links like `[[Bitter Flavor]]`; only use folder-qualified targets when title collisions require disambiguation.
7. Change status: wip → status: complete if both fields are filled.
8. Use obsidian_patch_note for surgical edits — do not rewrite the whole note.
9. Validate any newly created ingredient or flavor note with `PYTHONPATH=. python3.11 run_checks.py --note-standards --paths "<note path>"` before finishing the batch.

INVARIANTS:
- Never rename notes or delete files
- Flavor wikilinks must resolve to existing *Flavor.md notes in Flavors/Flavors/
- Be specific: "dry sherry and Cognac backbone with orange curaçao sweetness and bitters structure"
  beats "complex and well-balanced"
- Append one log entry to Flavors/_system/llm-activity-log.md covering the whole batch

book_search(query="ounce lime juice rum", book_id="target_book", limit=20)
book_search(query="dash bitters stir strain", book_id="target_book", limit=20)

python3.11 -c "
import os
vault = set(f.lower().replace('.md','') for f in os.listdir('Recipes/'))
# For each candidate recipe name:
name = 'my recipe name'
key = (name + ' cocktail recipe').lower()
print('In vault:', key in vault)
"

from flavor_graph.generate_recipe_notes import create_recipe_note
from flavor_graph.scraper_normalize import parse_ingredient_string
from pathlib import Path

recipe = {
    "name": "Recipe Name",
    "source": "Book Title",
    "page": 45,
    "specs": {},
    "method": "Stir and strain...",
    "notes": "Tasting notes or description from book.",
    "ingredients": [parse_ingredient_string(s) for s in ["1½ oz Rum", "¾ oz Lime juice"]],
}
create_recipe_note(recipe, Path("Recipes/"))

book_search_topic(topic="rum", limit=15)       # → for Smugglers Cove rum chapters
book_search_topic(topic="citrus", limit=15)    # → acid/zest flavor chemistry
book_search_topic(topic="bitters", limit=15)   # → bitters ingredient descriptions

Check	Method
Recipe already in vault	`ls Recipes/` normalized filename match
Recipe variant (collision)	Script appends `(CC)` suffix automatically
Ingredient already exists	`obsidian_list_directory` on expected subfolder
Ingredient under different name	`flavor_search(query=name)` — if score > 0.85, alias rather than new note
Ingredient in different subfolder	Check all `Ingredients/*/` not just the expected one

Update _ingestion/book-extraction-ledger.md — mark rows complete, add chunk_id ranges
Append to _system/llm-activity-log.md

If new ingredient/compound notes were written:

PYTHONPATH=. python3.11 run_checks.py --bidirectional --fix
PYTHONPATH=. python3.11 run_checks.py --name-drift --fix

Run quality gate to verify new notes aren't thin:

PYTHONPATH=. python3.11 run_checks.py --note-quality --tier-report

Book	Type	Action	Priority
Cocktail Codex	recipe (json→vault)	Extend MANUAL_RESOLUTION, run generate script, fill flavor descriptions	High
Smugglers Cove	ingredient (rum styles)	Sweep rum chapters via `book_search_topic("rum", book_id="smugglers_cove")`	Medium
Liquid Intelligence	compound / technique	Sweep chemistry chapters for flavor-relevant compounds	Low
All books	recipe flavor descriptions	139 existing wip notes need flavor description fill	Medium

Book Scrape

Book Scrape Skill

Pre-flight (mandatory)

Book Scrape

Book Scrape Skill

Pre-flight (mandatory)

Workflow A — Recipe Sweep

When to use

Step 1 — Ingredient resolution (before generating notes)

Step 2 — Generate skeleton notes

Step 3 — Agent fill pass for flavor descriptions

Step 4 — Update ledger

Workflow B — Recipe Sweep from Cache (no JSON)

When to use

Step 1 — Find recipe pages

Step 2 — Dedup against vault

Step 3 — Extract and write

Workflow C — Ingredient Sweep

When to use

Step 1 — Identify target chapters

Step 2 — Vault gap analysis

Step 3 — Quality gate

Step 4 — Write or enrich

Step 5 — Update ledger

Dedup Rules (all workflows)

Context Safety

After Every Session

Current Scrape Queue (from ledger)

Notion

Feishu Wiki

Gemini

Obsidian Vault Maintainer

Openclaw Pr Maintainer

Wiki Maintainer