Name: Web Scraper
Author: sahiixx

SkillsPool

搜索技能.../

技能内容

Modern web scraping with intelligent content extraction.

Quick Start

Fetch and Extract

node /path/to/skills/web-scraper/scripts/fetch.js https://example.com

Extract with Selectors

node /path/to/skills/web-scraper/scripts/extract.js https://example.com --selector "h1,h2,p"

Get Structured Data

node /path/to/skills/web-scraper/scripts/metadata.js https://example.com

Scripts

node fetch.js <url> [OPTIONS]

node extract.js <url> --selector <css> [OPTIONS]

node metadata.js <url> [OPTIONS]

node links.js <url> [OPTIONS]

node sitemap.js <url> [OPTIONS]

node fetch.js https://blog.example.com/article --output markdown

node extract.js https://shop.example.com --selector "a.product-link" --attr href --multiple

node metadata.js https://example.com

{
  "title": "Example Page",
  "description": "Page description",
  "openGraph": {
    "title": "Example OG Title",
    "image": "https://example.com/image.jpg",
    "type": "website"
  }
}

node links.js https://example.com --external --format csv

node sitemap.js https://example.com/sitemap.xml --filter "/blog/"

# Page Title

Main content extracted and converted to markdown...

## Section Heading

Paragraph text with [links](https://example.com).

{
  "url": "https://example.com",
  "selector": "h2",
  "matches": [
    { "text": "First Heading", "html": "<h2>First Heading</h2>" },
    { "text": "Second Heading", "html": "<h2>Second Heading</h2>" }
  ],
  "count": 2
}

{
  "url": "https://example.com",
  "title": "Page Title",
  "description": "Meta description",
  "canonical": "https://example.com/page",
  "openGraph": {
    "title": "OG Title",
    "description": "OG Description",
    "image": "https://example.com/og-image.jpg",
    "type": "article"
  },
  "twitterCard": {
    "card": "summary_large_image",
    "site": "@example"
  },
  "jsonLd": [
    { "@type": "Article", "headline": "Article Title" }
  ]
}

Web Scraper | Skills Pool

Web Scraper

Web Scraper

Quick Start

Fetch and Extract

Extract with Selectors

Get Structured Data

Scripts

fetch.js

extract.js

metadata.js

links.js

sitemap.js

Examples

Extract Article Content

Get All Product Links

Extract Open Graph Data

Get External Links

Process Sitemap

Output Formats

fetch.js (markdown)

extract.js (JSON)

metadata.js

Best Practices

Notes

Feishu Perm

Discord

Coding Agent (bash-first)

Apple Notes

Feishu Wiki

Bear Notes