Skill File

Research Clustering — Domain Identification, Partitioning & Structuring

Name: Research Clustering — Domain Identification, Partitioning & Structuring
Author: YuriNakayama

Identifies research domains from text or keywords, clusters them into structured sub-areas, and generates an overview, keywords, and research strategy for each cluster. Use this skill for the initial "mapping" phase of any research project — patent searches, literature surveys, technology trend analysis, business case studies. Trigger on requests like "organize research areas", "classify research fields", "identify surrounding domains for this technology", "plan a survey", "decide patent search strategy", "literature mapping", "research landscape". Also triggers on Japanese equivalents like "調査領域を整理して", "研究分野を分類して", "サーベイの計画を立てて", "特許調査の方針を決めて". Use proactively whenever the user wants to structure research targets from keywords or text, understand the big picture before diving into papers/patents/technology, or split a broad field into prioritized sub-areas.

YuriNakayama0 starsApr 7, 2026

Occupation
Categories: LLM & AI

Skill Content

Takes text or keyword groups as input, identifies the academic/technical domains worth investigating, clusters (partitions) them, and outputs an overview, keywords, and research strategy for each cluster. This skill handles the "map-making" phase of a research project.

Auto Mode (`--auto`)

When $ARGUMENTS contains --auto, run the entire workflow non-interactively — skip ALL AskUserQuestion calls and use the following defaults:

Parameter	Default Value
Research Type	Academic Paper Survey
Time Range	Last 4 years
Search Languages	English + Japanese
Output Granularity	Standard
Next Action (Step 7)	Done (自動終了)

In --auto mode, the remaining text in $ARGUMENTS (after removing --auto) is used as the research theme input. For example: /research-clustering --auto LLM agent orchestration → theme is "LLM agent orchestration".

Related Skills

Research Clustering — Domain Identification, Partitioning & Structuring | Skills Pool

Skill File

Research Clustering — Domain Identification, Partitioning & Structuring

YuriNakayama0 starsApr 7, 2026

Occupation
Categories: LLM & AI

Skill Content

Auto Mode (`--auto`)

When $ARGUMENTS contains --auto, run the entire workflow non-interactively — skip ALL AskUserQuestion calls and use the following defaults:

Parameter	Default Value
Research Type	Academic Paper Survey
Time Range	Last 4 years
Search Languages	English + Japanese
Output Granularity	Standard
Next Action (Step 7)	Done (自動終了)

Related Skills

AskUserQuestion:
  question: "What type of research would you like to conduct? (multiple selection)"
  header: "Research Type"
  multiSelect: true
  options:
    - label: "Academic Paper Survey"
      description: "Focus on academic papers (arXiv, IEEE, ACM, etc.)"
    - label: "Patent Search"
      description: "Focus on patent literature (USPTO, EPO, JPO, etc.)"
    - label: "Technology Trend Analysis"
      description: "Tech blogs, conference talks, OSS projects, etc."
    - label: "Business Case Study"
      description: "Enterprise adoption cases, market reports, industry trends"

AskUserQuestion:
  question: "What time range should the research cover?"
  header: "Time Range"
  multiSelect: false
  options:
    - label: "Last 4 years (default)"
      description: "Results from 2022 to present"
    - label: "Last 2 years"
      description: "Focus on the latest trends only"
    - label: "Last 7 years"
      description: "Broader coverage"
    - label: "Custom range"
      description: "Specify a custom time range"

AskUserQuestion:
  question: "Which languages should be used for web searches? (multiple selection)"
  header: "Search Languages"
  multiSelect: true
  options:
    - label: "English"
      description: "Search in English (covers most academic and international sources)"
    - label: "Japanese"
      description: "Search in Japanese (useful for JPO patents, domestic cases, Japanese papers)"
    - label: "Chinese"
      description: "Search in Chinese (useful for Chinese patents, CNKI papers)"
    - label: "Other"
      description: "Specify additional languages"

AskUserQuestion:
  question: "What level of detail do you want in the output?"
  header: "Output Granularity"
  multiSelect: false
  options:
    - label: "Standard (recommended)"
      description: "Domain partitioning + overview, keywords, and research strategy per cluster"
    - label: "Detailed"
      description: "Standard + representative papers/patents/cases per cluster"
    - label: "Overview only"
      description: "Domain partitioning and keywords only — quick big-picture view"

{output-dir}/
  index.md          # Overall overview + cluster list
  cluster-01-xxx.md # Detailed info for each cluster
  cluster-02-xxx.md
  ...

# {Research Theme}

## Research Parameters

- **Research type**: {Academic Paper Survey / Patent Search / ...}
- **Time range**: {YYYY – YYYY}
- **Generated on**: {YYYY-MM-DD}
- **Input keywords**: {original keyword list}

## Big Picture

{3–5 sentences on the overall positioning of the research theme, current state of the field, and major trends}

## Reference Survey/Review Papers

{For academic paper surveys. List discovered survey papers. Note that their taxonomies informed the domain partitioning}

| Title | Year | Summary | Link |
|-------|------|---------|------|
| {title} | {year} | {one-line summary} | {url} |

## Domain Map

{Conceptual diagram showing inter-cluster relationships. Use ASCII art or Mermaid notation}

## Cluster Summary

| # | Cluster Name | Keyword Count | Summary |
|---|-------------|---------------|---------|
| 1 | {name} | {n} | {one-line summary} |

## Cluster Details

### Cluster 1: {Cluster Name}

**Overview**: {2–5 sentence description}

**Keywords**:
`keyword1`, `keyword2`, `keyword3`, ...

**Research Strategy**:
- {Recommended search queries and information sources}
- {Notable research groups or companies}
- {Recommended reading order from survey papers}

**Representative Resources** (detailed granularity only):
| Title | Type | Year | Summary |
|-------|------|------|---------|
| {title} | Paper/Patent/Case | {year} | {summary} |

---

{Repeat for each cluster}

AskUserQuestion:
  question: "The domain map is complete. What would you like to do next?"
  header: "Next Action"
  multiSelect: false
  options:
    - label: "Done"
      description: "Finalize the current output"
    - label: "Adjust clusters"
      description: "Change partitioning granularity or classification axes and regenerate"
    - label: "Deep-dive into a specific cluster"
      description: "Investigate papers/patents for a selected cluster in detail"
    - label: "Generate research prompts"
      description: "Generate research prompt files for each cluster (integrates with research-prompt-builder)"

Identify the domain (<domain>, snake_case):
- Infer from input file path if it lives under docs/research/(runs|domains)/<domain>/...
- Otherwise infer from the research theme or ask the user
If docs/research/domains/<domain>/domain.yaml exists and defines output_paths.clustering, use it.
Otherwise use the default path:
```
docs/research/runs/<domain>/clustering/<YYYYMMDD>/
```
- Output index.md and cluster-NN-*.md files inside this directory.
- <YYYYMMDD> is today's date (UTC or JST is acceptable, use whichever the pipeline uses).
Never write directly under docs/research/domains/<domain>/ — that layer is composed of symlinks pointing into runs/.
Never overwrite or modify existing files under runs/<domain>/clustering/<old_date>/ — clustering is append-only. Re-clustering creates a NEW dated directory.

ln -snf <YYYYMMDD> docs/research/runs/<domain>/clustering/latest
ln -snf ../../runs/<domain>/clustering/latest docs/research/domains/<domain>/clustering

Research Clustering — Domain Identification, Partitioning & Structuring

Auto Mode (`--auto`)

Research Clustering — Domain Identification, Partitioning & Structuring

Auto Mode (`--auto`)

Workflow

Step 1: Parse Input

Step 1.5: Preliminary Scan (Quick Scan)

Step 2: User Hearing

Hearing 1: Research Type

Hearing 2: Time Range

Hearing 3: Search Languages

Hearing 4: Output Granularity

Step 3: Domain Identification (Web Search)

Step 4: Domain Clustering (Partitioning)

Step 5: Cluster Elaboration

Step 6: Output File Generation

Small to medium scale (5 or fewer clusters): Single file

Large scale (6 or more clusters): Directory structure

Output Template (single file / index.md)

Step 7: Output Confirmation and Next Actions

Output Location

Path resolution

After writing

Parallel Processing

Integration with Other Skills

Language

Openai Whisper

Voice Call

Prose

Clawhub

Sherpa Onnx Tts

Openai Whisper Api