Curates foreign tech blogs (e.g. Daily Dose of Data Science Substack) via RSS, classifies items into learn-likecc RAG vs Agent “技术热点” sub-topics, drafts Chinese summaries with attribution, and appends rows to the site HTML tables. Use when the user asks to ingest Daily Dose of DS, Substack feeds, auto-fetch hot topics, translate articles for topic-rag-hot or topic-agent-hot, or refresh external curated content for the course site.
python3 tools/fetch_substack_rss.py --url https://blog.dailydoseofds.com/feed --limit 25 --json
Use another --url for a different Substack or any RSS 2.0 feed.
linksite/topic-rag-hot.html or site/topic-agent-hot.html using the keyword heuristic in reference.md. If both apply, pick the stronger match or ask the user once.For each selected item, produce:
Agent / RAG / 工程 / 训练 等简短词。Always keep the 原文链接 (link from RSS).
Append a <tr> block inside the <tbody> of the target page’s curated table. Match existing columns:
pubDate 转短日期或 — 若不可靠)RAG 技术热点 或 Agent 技术热点)Add a footnote line if the batch is large: “本期条目来自 RSS 抓取 + 人工摘要,版权归原作者。”
target="_blank" with rel="noopener noreferrer".site/topic-rag.html(All-in-RAG 体系参考见同页 #all-in-rag)→ 论文分析 / 技术热点site/topic-agent.html → 论文分析 / 技术热点 / 生态对比