Name: Research & Literature Gathering for Engineering Domains
Author: vamseeachanta

SkillsPool

Buscar habilidades.../

Contenido de la habilidad

Full lifecycle: Discover → Search → Download → Validate → Catalogue → Index

When to Use This Skill

Trigger this skill when:

A WRK item requires populating literature for an engineering domain
A new domain is being set up and needs reference material
An existing domain has gaps identified by Phase F of the document-index-pipeline
A calculation implementation needs standards/textbooks not yet downloaded
You see keywords like "gather literature", "download standards", "populate domain"

Engineering Domains

Active Domains (with existing literature directories)

Domain	Literature Path	Key Standards Bodies
cathodic_protection

DOMAIN="geotechnical"   # ← set your domain
LIT_DIR="/mnt/ace-data/digitalmodel/docs/domains/${DOMAIN}/literature"
mkdir -p "${LIT_DIR}"
ls -la "${LIT_DIR}"

mkdir -p "${LIT_DIR}"/{textbooks,standards,course-notes,worked-examples}

uv run --no-project python scripts/data/document-index/query-ledger.py \
  --domain ${DOMAIN} --verbose

uv run --no-project python -c "
import json
from collections import Counter
matches = []
with open('data/document-index/index.jsonl') as f:
    for line in f:
        rec = json.loads(line)
        path_lower = rec.get('path', '').lower()
        summary_lower = (rec.get('summary') or '').lower()
        if '${DOMAIN}' in path_lower or '${DOMAIN}' in summary_lower:
            matches.append(rec)
print(f'Found {len(matches)} documents')
by_source = Counter(r['source'] for r in matches)
for s, c in by_source.most_common():
    print(f'  {s}: {c}')
"

uv run --no-project python -c "
import yaml
with open('specs/capability-map/digitalmodel.yaml') as f:
    data = yaml.safe_load(f)
for m in data['modules']:
    if '${DOMAIN}' in m['module'].lower():
        print(f\"Module: {m['module']} ({m.get('standards_count', '?')} standards)\")
        for s in m.get('standards', [])[:30]:
            print(f\"  {s['status']:15s} {s['org']:8s} {s['id'][:70]}\")
"

Site	Issue	Action
eagle.org (ABS)	Cloudflare WAF blocks wget/curl	Add to pending_manual
archive.org borrow	HTTP 403 for borrow-only items	Add to pending_manual
IEEE Xplore	Paywalled unless institutional login	Skip or pending_manual
ASME Digital Collect	Paywall	Check og_standards DB

uv run --no-project python scripts/data/research-literature/research-domain.py \
  --category ${DOMAIN} --repo digitalmodel --generate-download-script

#!/usr/bin/env bash
# ABOUTME: Download open-access ${DOMAIN} literature
# Usage: bash download-literature.sh [--dry-run]

set -uo pipefail

DEST="/mnt/ace-data/digitalmodel/docs/domains/${DOMAIN}/literature"
LOG_DIR="$(git rev-parse --show-toplevel)/.claude/work-queue/assets"
LOG_FILE="${LOG_DIR}/download-${DOMAIN}.log"
DRY_RUN=false
[[ "${1:-}" == "--dry-run" ]] && DRY_RUN=true

mkdir -p "${DEST}"/{textbooks,standards,course-notes,worked-examples}
mkdir -p "${LOG_DIR}"

# shellcheck source=scripts/lib/download-helpers.sh
source "$(git rev-parse --show-toplevel)/scripts/lib/download-helpers.sh"

log "=== ${DOMAIN} Literature Download ==="
log "Destination: ${DEST}"
log "Dry run: ${DRY_RUN}"

# ─── TEXTBOOKS ────────────────────────────────
log "--- Textbooks ---"

download \
  "https://example.org/textbook.pdf" \
  "${DEST}/textbooks" \
  "Author-Year-Short-Title.pdf"

# ─── STANDARDS ────────────────────────────────
log "--- Standards ---"

download \
  "https://rules.dnv.com/docs/pdf/dnvpm/codes/docs/..." \
  "${DEST}/standards" \
  "DNV-RP-XXXX-Title-Year.pdf" || true

log "=== Download complete ==="
total=$(find "${DEST}" -name "*.pdf" | wc -l)
log "  Total PDFs: ${total}"

# Dry run first
bash download-literature.sh --dry-run

# Execute
bash download-literature.sh

# Validate all PDFs are real PDFs (not HTML/WAF responses)
find "${LIT_DIR}" -name "*.pdf" -exec file {} \; | grep -v "PDF document"

Research & Literature Gathering for Engineering Domains | Skills Pool

Domain	Typical Target Repo
catenary	digitalmodel
mooring	digitalmodel
risers	digitalmodel
drilling	OGManufacturing
bsee	worldenergydata
economics	worldenergydata

Research & Literature Gathering for Engineering Domains

Research & Literature Gathering for Engineering Domains

When to Use This Skill

Engineering Domains

Active Domains (with existing literature directories)

Additional Domains (in standards but not yet in literature tree)

Legacy Standards Storage

Step-by-Step Procedure

Step 0 — Verify Domain Directory Exists

Step 1 — Query the Standards Ledger

Step 2 — Query the Document Index

Step 3 — Cross-Reference Capability Map

Step 4 — Web Search for Open-Access Literature

Step 5 — Generate or Update the Download Script

Step 6 — Execute Downloads and Validate

Step 7 — Create Catalogue YAML

Notion

Feishu Wiki

Gemini

Obsidian Vault Maintainer

Openclaw Pr Maintainer

Wiki Maintainer