Archivo del skill

Research Retrieval — Per-Resource Detailed Report Generator

Name: Research Retrieval — Per-Resource Detailed Report Generator
Author: YuriNakayama

Retrieves detailed information from URLs, PDFs, or keyword-based searches and generates comprehensive per-resource report files with an index. Handles academic papers (arXiv-first), patents, technical articles, and business cases. Works standalone with user-provided URLs/PDFs/keywords, or as the deep-dive phase after research-gather output. Use this skill when the user wants to "create detailed reports from these URLs", "analyze this paper in depth", "investigate these patents", "write a detailed report for each resource", "このURLの論文を詳しく調べて", "これらの特許を詳細分析して", "各リソースの詳細レポートを作って", "この論文PDFを分析して", or any request to produce per-resource detailed reports from URLs, files, or search results. Also triggers on requests like "research these papers", "survey this paper list", "サーベイして", "論文の詳細をまとめて". Use proactively whenever the user has a list of resources (from gather output or manually provided) and wants detailed analysis of each one.

YuriNakayama0 estrellas7 abr 2026

Ocupación
Categorías: Documentos

Contenido de la habilidad

Takes URLs, PDFs, keyword searches, or research-gather output as input, retrieves detailed information for each resource, and generates individual report files + an index file. Supports academic papers, patents, technical articles, and business cases.

Auto Mode (`--auto`)

When $ARGUMENTS contains --auto, run the entire workflow non-interactively — skip ALL AskUserQuestion calls and use the following defaults:

Parameter	Default Value
Priority Sections	All sections equally (Core method + Results + Problem + Practical applications)
Detail Level	Overview level (100-200 lines per report)
Additional Elements	None
Next Action (Step 7)	Done (自動終了)

In --auto mode, the remaining text in $ARGUMENTS (after removing --auto) is used as the input (file path, URLs, or keywords). For example: /research-retrieval --auto docs/research/resources-llm.md → input is the gather result file.

Skills relacionados

Research Retrieval — Per-Resource Detailed Report Generator | Skills Pool

Archivo del skill

Research Retrieval — Per-Resource Detailed Report Generator

YuriNakayama0 estrellas7 abr 2026

Ocupación
Categorías: Documentos

Contenido de la habilidad

Auto Mode (`--auto`)

When $ARGUMENTS contains --auto, run the entire workflow non-interactively — skip ALL AskUserQuestion calls and use the following defaults:

Parameter	Default Value
Priority Sections	All sections equally (Core method + Results + Problem + Practical applications)
Detail Level	Overview level (100-200 lines per report)
Additional Elements	None
Next Action (Step 7)	Done (自動終了)

Skills relacionados

research-clustering → research-gather → research-retrieval
(domain mapping)      (resource lists)   (detailed reports)

Type	Detection signals
Academic Paper	arXiv URL, DOI link, `.pdf` from academic domain, gather output marked as paper
Patent	Patent number format, patents.google.com URL, gather output marked as patent
Technical Article	Blog URLs, GitHub repos, conference talk links, technical documentation
Business Case	Company case study URLs, market report links, press releases

AskUserQuestion:
  question: "Which sections should be emphasized in each report? (multiple selection)"
  header: "Priority Sections"
  multiSelect: true
  options:
    - label: "Core method/technology details"
      description: "Algorithm steps, technical architecture, formulas, key innovations in detail"
    - label: "Results & evaluation"
      description: "Experimental results, performance metrics, comparison with alternatives"
    - label: "Problem & motivation"
      description: "What problem is being solved, why it matters, background context"
    - label: "Practical applications"
      description: "Real-world use cases, implementation considerations, applicability conditions"

AskUserQuestion:
  question: "What detail level should each report have?"
  header: "Detail Level"
  multiSelect: false
  options:
    - label: "Overview level (recommended)"
      description: "100-200 lines per report. Concise summary of key points"
    - label: "Detailed level"
      description: "200-400 lines per report. In-depth analysis with formulas/architecture"
    - label: "Brief level"
      description: "50-100 lines per report. Minimal: summary, key points, and links only"

AskUserQuestion:
  question: "Would you like to include any additional elements? (multiple selection)"
  header: "Additional Elements"
  multiSelect: true
  options:
    - label: "Cross-resource relationship map"
      description: "Show relationships, citations, or evolution between resources in index.md"
    - label: "Comparison table"
      description: "Add a feature/method comparison table to index.md"
    - label: "Further investigation candidates"
      description: "List related resources worth investigating from references/citations"
    - label: "None"
      description: "Basic structure only"

uv run --with pymupdf python <skill-dir>/scripts/extract_figures.py <pdf_path> <output_dir>/figures/ [--min-size 150]

![Figure 1: DPO概要図](https://arxiv.org/html/2305.18290v3/figures/diagrams/teaser.png)
![Figure 2: 報酬-KLフロンティア](https://arxiv.org/html/2305.18290v3/x1.png)

# {Paper title}

- **Link**: {URL}
- **Authors**: {Author list}
- **Year**: {Publication year}
- **Venue**: {Journal/Conference name}
- **Type**: Academic Paper

## Abstract

{Original English abstract text}

## Abstract (Japanese Translation)

{Japanese translation of the abstract. Preserve the original meaning accurately while writing natural Japanese}

## Overview

{Overview of the paper. Not a mere repetition of the Abstract — organize and present the key points of the entire paper}

## Problem

{Problems the paper aims to solve, in list format}

- **{Problem name}**: {Description}

## Proposed Method

**{Method name}**

{Description of the method:}

- Core idea
- Main algorithm steps (decompose into numbered steps)
- Differences from existing methods

**Key Formulas**:

{Include the core mathematical formulations. For example:}

$$L(\theta) = \sum_{i=1}^{N} \ell(y_i, f(x_i; \theta)) + \lambda \|\theta\|_2^2$$

{Explain each variable and what the formula represents.}

**Features**:

- {Feature 1}
- {Feature 2}

## Algorithm (Pseudocode)

{Always include pseudocode for algorithmic methods.
Present in pseudocode format with explanations for each step.}


## Architecture / Process Flow

{Create an ASCII or Mermaid diagram showing the overall architecture, data flow, or process stages.}


## Figures & Tables

{THIS SECTION IS MANDATORY AND MUST NOT BE SKIPPED. Include at minimum 4 visual elements.

Reproduce every significant table and figure from the source. If the paper has 5 tables, include all 5.
If the source has architecture diagrams, recreate them. The reader should never need to open the original.

1. **Main results table** — reproduce the primary experimental results with exact numbers:

| Method | Dataset A | Dataset B | Dataset C |
|--------|-----------|-----------|-----------|
| Proposed | 95.2 | 87.1 | 92.3 |
| Baseline 1 | 91.4 | 83.2 | 88.7 |
| Baseline 2 | 89.8 | 81.5 | 86.4 |

2. **Architecture / system diagram** — Mermaid or ASCII art:

```mermaid
graph LR
    A[Input] --> B[Module A]
    B --> C[Module B]
    C --> D[Output]
    B --> E[Side Process]
    E --> C


#### 5b: Patent Report Template

```markdown
# {Patent title}

- **Patent Number**: {number}
- **Assignee**: {assignee/applicant}
- **Inventors**: {inventor list}
- **Filing Date**: {date}
- **Grant Date**: {date, if granted}
- **Classification**: {IPC/CPC codes}
- **Link**: {URL}
- **Type**: Patent

## Abstract

{Patent abstract — original language}

## Abstract (Japanese Translation)

{Japanese translation if the original is not in Japanese}

## Overview

{Concise overview of the invention and its significance}

## Technical Problem

{What technical problem does this patent address?}

## Technical Solution

{Core technical approach of the invention. Decompose into numbered steps:}

### Process Steps

1. **Step 1**: {Description} → {Output}
2. **Step 2**: {Description} → {Output}
3. **Step 3**: {Description} → {Output}

### Key Innovation

- What is novel compared to prior art
- Technical advantage gained

### Mathematical Relationships

{If the patent includes any mathematical formulas or relationships in the claims or description:}

$$\text{formula from patent}$$

## Key Claims

{Summarize the independent claims. Decompose complex claims into sub-elements:}

### Claim 1 (Main)

{Summary of the main independent claim, broken into elements:}

- **Element a)**: {description}
- **Element b)**: {description}
- **Element c)**: {description}

### Claim N

{Other notable independent claims, similarly decomposed}

## Process Flow Diagram

{Mandatory. Recreate the patent's process as an ASCII or Mermaid flow diagram:}


## Figures & Tables

{Mandatory. Include at minimum 2 visual elements:

1. **Claims structure table**:

| Claim # | Type | Depends On | Key Element |
|---------|------|-----------|-------------|
| 1 | Independent | — | {main invention} |
| 2 | Dependent | 1 | {specific feature} |

2. **Prior art comparison table**:

| Feature | This Patent | Prior Art A | Prior Art B |
|---------|------------|-------------|-------------|
| {feature} | {value} | {value} | {value} |

3. **System architecture diagram** — recreate from patent figures}

## Patent Family & Related Art

{Related patents, patent family members, cited prior art — to the extent available}

## Notes

{Commercial significance, licensing status, potential applications}

# {Article title}

- **Source**: {Author/Organization}
- **Date**: {Publication date}
- **Link**: {URL}
- **Type**: Technical Article / OSS Project / Conference Talk

## Overview

{Concise overview of the article's key message and significance}

## Key Technical Content

{Main technical insights, architecture decisions, or implementation details.
Decompose into structured subsections:}

### Architecture / Design

{Describe the system architecture or design. Include a diagram:}


### Implementation Steps

{Break down the implementation or approach into numbered steps:}

1. **Step 1**: {description}
2. **Step 2**: {description}
3. **Step 3**: {description}

### Key Formulas / Metrics

{If the article includes any quantitative analysis or formulas:}

| Metric | Value | Context |
|--------|-------|---------|
| {metric} | {value} | {what it means} |

## Figures & Tables

{Mandatory. Include at minimum 2 visual elements:
- Architecture diagram (ASCII/Mermaid)
- Performance or comparison table
- Timeline or process flow diagram}

## Practical Takeaways

{What can a practitioner learn and apply from this? Structure as actionable items:}

1. {Takeaway 1}
2. {Takeaway 2}
3. {Takeaway 3}

## Notes

{Community reception, follow-up work, related resources}

# {Case title}

- **Company/Organization**: {name}
- **Industry**: {industry}
- **Date**: {date}
- **Link**: {URL}
- **Type**: Business Case

## Overview

{Concise overview of the case and its significance}

## Problem & Context

{Business problem being addressed, market context}

## Solution

{Technology/approach implemented:}

- What was built or adopted
- Key technical decisions
- Integration approach

## Results & Impact

{Quantitative and qualitative results:}

| Metric | Before | After |
|--------|--------|-------|
| {metric} | {value} | {value} |

## Figures & Tables

{Mandatory. Visualize key results, architecture, or timeline.}

## Lessons Learned

{Key takeaways, challenges encountered, recommendations}

## Notes

{Follow-up developments, related cases, broader implications}

# {Research Theme} — Detailed Reports

## Parameters

- **Resources analyzed**: {total count}
- **Resource types**: {Paper / Patent / Technical / Business}
- **Generated on**: {YYYY-MM-DD}
- **Input source**: {gather output / user URLs / PDF files / keyword search}

## Report List

### Academic Papers

| # | Title | Year | Venue | Summary | Report |
|---|-------|------|-------|---------|--------|
| 1 | {title} | {year} | {venue} | {one-line} | [Details](01-xxx.md) |

### Patents

| # | Title | Patent No. | Assignee | Year | Report |
|---|-------|-----------|----------|------|--------|
| 1 | {title} | {number} | {assignee} | {year} | [Details](02-xxx.md) |

### Technical Articles

| # | Title | Source | Date | Report |
|---|-------|--------|------|--------|
| 1 | {title} | {source} | {date} | [Details](03-xxx.md) |

### Business Cases

| # | Title | Company | Date | Report |
|---|-------|---------|------|--------|
| 1 | {title} | {company} | {date} | [Details](04-xxx.md) |

{Include only the sections relevant to the resources analyzed. Omit empty sections.}

## Cross-Resource Insights

{If "Cross-resource relationship map" was selected: show relationships, common themes, or evolution across resources.}

## Comparison Table

{If "Comparison table" was selected: comparative table of methods, approaches, or solutions.}

## Further Investigation Candidates

{If selected: list of related resources worth investigating, discovered during retrieval.}

AskUserQuestion:
  question: "Reports are complete. What would you like to do next?"
  header: "Next Action"
  multiSelect: false
  options:
    - label: "Done"
      description: "Finalize the current output"
    - label: "Revise specific reports"
      description: "Revise or enhance specific report files"
    - label: "Change structure/detail level"
      description: "Adjust section structure or detail level and regenerate"
    - label: "Investigate additional resources"
      description: "Find and analyze additional related resources"

Identify the domain (<domain>, snake_case):
- If the input is a gather/clustering file under docs/research/runs/<domain>/..., use that <domain>.
- Otherwise infer from context or ask the user.
Identify the cluster (<cluster>):
- From the gather file's cluster context (e.g., metalearner, nl2sql-nl2code).
- If retrieving for a free-form URL/PDF list with no cluster context, use all.
If docs/research/domains/<domain>/domain.yaml defines output_paths.retrieval, use it.
Otherwise use the default path:
```
docs/research/runs/<domain>/retrieval/<YYYYMMDD>_<cluster>/
```
- Place index.md and per-resource files (NN-kebab-case-name.md) inside this directory.
Never write directly under docs/research/domains/<domain>/reports/ — that layer is symlinks.
Never overwrite previous retrieval runs — append-only.

ln -snf <YYYYMMDD>_<cluster> docs/research/runs/<domain>/retrieval/latest_<cluster>
ln -snf ../../../runs/<domain>/retrieval/latest_<cluster> docs/research/domains/<domain>/reports/<cluster>

Configuration	Metric	Delta
Full model	95.2	—
w/o Component A	92.1	-3.1
w/o Component B	93.5	-1.7

Research Retrieval — Per-Resource Detailed Report Generator

Auto Mode (--auto)

Research Retrieval — Per-Resource Detailed Report Generator

Auto Mode (--auto)

Pipeline Position

Report Quality Principles — Visual Richness

Figures and Tables — The Most Important Part

Other Structural Elements

Workflow

Step 1: Parse Input

Step 2: Resource Type Classification

Step 3: User Hearing

Hearing 1: Priority Sections

Hearing 2: Detail Level

Hearing 3: Additional Elements

Step 4: Information Retrieval

Figure & Table Acquisition Strategy

Method 1: PDF Figure Extraction (for PDF input)

Method 2: HTML Image Reference (for URL input with HTML version)

Method 3: Text-based Recreation (fallback)

4a: Academic Papers (arXiv-first)

4b: Patents

4c: Technical Articles

4d: Business Cases

Step 5: Generate Report Files

5a: Academic Paper Report Template

Experiments & Evaluation

Setup

Main Results

Ablation Study

Notes

5c: Technical Article Report Template

5d: Business Case Report Template

Step 6: Generate Index File

Step 7: Output Confirmation

Output Location

Path resolution

After writing

Filename conventions

Parallel Processing

Integration with Other Skills

Language

Feishu Doc

Summarize

Nano Pdf

Diffs

Customs Trade Compliance

Nutrient Document Processing

Auto Mode (`--auto`)

Auto Mode (`--auto`)