Scan a Yuque knowledge base to find stale or potentially outdated documents, analyze their relevance, and generate a maintenance report with update or archive recommendations. For personal/individual use.
Scan a Yuque knowledge base to discover documents that haven't been updated in a long time, analyze whether their content may be outdated, and generate a maintenance report with actionable recommendations.
All tools are from the yuque-mcp server:
yuque_list_books — List user's knowledge basesyuque_list_docs — List all documents in a knowledge base with metadatayuque_get_doc — Read document content for staleness analysisyuque_get_toc — Get knowledge base structureCase A — User specifies a knowledge base:
Extract repo_id (namespace) from the provided link or name.
Case B — User wants to scan all knowledge bases: List all repos first:
Tool: yuque_list_books
Parameters:
user_id: "<user_id>"
Present the list and let the user choose, or scan them one by one if the user confirms.
Case C — User is vague: Ask: "你想检查哪个知识库?我可以先列出你的所有知识库。"
Get all documents in the target knowledge base:
Tool: yuque_list_docs
Parameters:
repo_id: "<namespace>"
Extract key metadata for each document:
title — Document titleslug — Document identifierupdated_at — Last update timestampcreated_at — Creation timestampword_count — Document lengthstatus — Publication statusIf the knowledge base is empty (no documents), inform the user: "这个知识库目前没有文档。"
Calculate the age of each document (days since last update) and classify:
| Category | Criteria | Label |
|---|---|---|
| 🟢 Fresh | Updated within 90 days | 活跃 |
| 🟡 Aging | Updated 90-180 days ago | 老化中 |
| 🟠 Stale | Updated 180-365 days ago | 陈旧 |
| 🔴 Dormant | Not updated for 365+ days | 休眠 |
For documents classified as 🟠 Stale or 🔴 Dormant, read the content of up to 10 documents (prioritize the oldest and most important-looking ones by title):
Tool: yuque_get_doc
Parameters:
repo_id: "<namespace>"
doc_id: "<slug>"
For each document, analyze:
Time-sensitive content — Does it reference specific dates, versions, or events that may have passed?
Evergreen content — Is the content timeless?
Recommendation:
If yuque_get_doc fails (404/403), note the document as inaccessible and skip.
## 🔍 知识库健康报告
**知识库**:[知识库名称](知识库链接)
**扫描时间**:YYYY-MM-DD HH:MM
**文档总数**:X 篇
### 📊 整体健康度
| 状态 | 数量 | 占比 |
|------|------|------|
| 🟢 活跃(90 天内更新) | X | XX% |
| 🟡 老化中(90-180 天) | X | XX% |
| 🟠 陈旧(180-365 天) | X | XX% |
| 🔴 休眠(365 天以上) | X | XX% |
**健康评分**:X/100
(计算方式:🟢×100 + 🟡×70 + 🟠×30 + 🔴×0,加权平均)
### 🔴 需要关注的文档
#### 建议更新
| 文档 | 上次更新 | 原因 |
|------|----------|------|
| [文档标题](链接) | YYYY-MM-DD | [e.g., 引用了 Node 12,当前已是 Node 22] |
| [文档标题](链接) | YYYY-MM-DD | [e.g., 包含 2023 年的计划,需要更新] |
#### 建议归档
| 文档 | 上次更新 | 原因 |
|------|----------|------|
| [文档标题](链接) | YYYY-MM-DD | [e.g., 已完成的项目记录,可归档] |
#### 建议保留(虽旧但有效)
| 文档 | 上次更新 | 原因 |
|------|----------|------|
| [文档标题](链接) | YYYY-MM-DD | [e.g., 通用方法论,内容不过时] |
#### 需要人工判断
| 文档 | 上次更新 | 原因 |
|------|----------|------|
| [文档标题](链接) | YYYY-MM-DD | [e.g., 无法确定内容是否仍然有效] |
### 💡 维护建议
1. [具体建议 1:e.g., 建议每季度审查一次「技术文档」分类下的文档]
2. [具体建议 2:e.g., 考虑创建一个「归档」知识库,将不再维护的文档迁移过去]
3. [具体建议 3:e.g., 有 X 篇文档超过 2 年未更新,建议集中处理]
After presenting the report, offer:
These are suggestions only — do not take action without user confirmation.
| Situation | Action |
|---|---|
| Knowledge base not found | Inform user and suggest listing their repos with yuque_list_books |
| Knowledge base is empty | Inform user: "这个知识库目前没有文档,无需体检" |
yuque_list_docs returns error | Inform user of the issue, suggest checking the knowledge base link |
| Too many documents (100+) | Analyze metadata for all, but only deep-read the top 10 stalest documents |
| All documents are fresh | Congratulate the user: "知识库状态很健康!所有文档都在活跃维护中 🎉" |
| API rate limiting | Slow down requests, inform user if the scan takes longer than expected |