技能檔案

Web Fetcher

Name: Web Fetcher
Author: openclaw

Smart web content fetcher - articles and videos from WeChat, Feishu, Bilibili, Zhihu, Toutiao, YouTube, etc. Triggers: '抓取文章', '下载网页', '保存文章', 'fetch URL', '下载视频', '抓取飞书文档', '抓取微信文章', '把这个链接内容保存下来', '下载B站视频', 'download video', 'scrape article'.

openclaw4,189 星標2026年3月11日

職業
分類: 內容創作

技能內容

Smart web content fetcher for Claude Code. Automatically detects platform and uses the best strategy to fetch articles or download videos.

Quick Start

# Fetch an article
python3 {SKILL_DIR}/fetcher.py "URL" -o ~/docs/

# Download a video
python3 {SKILL_DIR}/fetcher.py "https://b23.tv/xxx" -o ~/videos/

# Batch fetch from file
python3 {SKILL_DIR}/fetcher.py --urls-file urls.txt -o ~/docs/

Install Dependencies

Install only what you need — dependencies are checked at runtime:

Dependency	Purpose	Install
scrapling	Article fetching (HTTP + browser)	`pip install scrapling`
yt-dlp	Video download	`pip install yt-dlp`
camoufox	Anti-detection browser (Xiaohongshu, Weibo)

相關技能

Web Fetcher | Skills Pool

Platform	Method	Notes
mp.weixin.qq.com	scrapling	Extracts `data-src` images, handles SVG placeholders
*.feishu.cn	Virtual scroll	Collects all blocks via scrolling, downloads images with cookies
zhuanlan.zhihu.com	scrapling	`.Post-RichText` selector
www.zhihu.com	scrapling	`.RichContent` selector
www.toutiao.com	scrapling	Handles `toutiaoimg.com` base64 placeholders
www.xiaohongshu.com	camoufox	Anti-bot protection requires stealth browser
www.weibo.com	camoufox	Anti-bot protection requires stealth browser
bilibili.com / b23.tv	yt-dlp	Video download, supports quality selection
youtube.com / youtu.be	yt-dlp	Video download
douyin.com	yt-dlp	Video download
Unknown URLs	scrapling	Generic fetch with fallback tiers

python3 {SKILL_DIR}/fetcher.py [URL] [OPTIONS]

Arguments:
  url                    URL to fetch

Options:
  -o, --output DIR       Output directory (default: current)
  -q, --quality N        Video quality, e.g. 1080, 720 (default: 1080)
  --method METHOD        Force method: scrapling, camoufox, ytdlp, feishu
  --selector CSS         Force CSS selector for content extraction
  --urls-file FILE       File with URLs (one per line, # for comments)
  --audio-only           Extract audio only (video downloads)
  --no-images            Skip image download (articles)
  --cookies-browser NAME Browser for cookies (e.g., chrome, firefox)

Problem	Solution
`scrapling not found`	`pip install scrapling`
`yt-dlp not found`	`pip install yt-dlp`
Article content too short	Try `--method camoufox` for JS-heavy pages
Feishu returns login page	The doc may require authentication
Bilibili 403	Use `--cookies-browser chrome`
Image download fails	Check network; WeChat images need Referer header (auto-handled)

from lib.router import route, check_dependency
from lib.article import fetch_article
from lib.video import fetch_video
from lib.feishu import fetch_feishu

# Route a URL
r = route("https://mp.weixin.qq.com/s/xxx")
# {'type': 'article', 'method': 'scrapling', 'selector': '#js_content', 'post': 'wx_images'}

# Fetch article
fetch_article(url, output_dir="/tmp/out", route_config=r)

# Download video
fetch_video(url, output_dir="/tmp/out", quality="720")

# Fetch Feishu doc
fetch_feishu(url, output_dir="/tmp/out")

Web Fetcher

Quick Start

Install Dependencies

Web Fetcher

Quick Start

Install Dependencies

Smart Routing

CLI Reference

Platform Notes

WeChat (mp.weixin.qq.com)

Feishu (*.feishu.cn)

Bilibili

Troubleshooting

Manual Usage

Article Writing

Article Writing

Content Engine

Brand Voice

Article Writing

Article Writing