Name: Idea Screen
Author: megumi-ben

搵技能.../

Idea Screen | Skills Pool

mcp__codex__codex:
  config: {"model_reasoning_effort": "xhigh"}
  prompt: |
    I need to verify the novelty of a research idea.

    Proposed idea: [IDEA DESCRIPTION]

    Papers found that may overlap:
    [LIST EACH PAPER: title, authors, year, venue, abstract summary]

    Core claims to verify:
    [LIST EACH CLAIM]

    For each core claim, answer THREE questions:
    1. Has this EXACT mechanism been published? (cite specific paper + section if yes)
    2. Has a CLOSELY RELATED mechanism been published that achieves the same goal through a different path? (cite + explain degree of overlap)
    3. Would a reviewer at [TARGET_VENUE] consider this sufficiently novel? (yes/no + reasoning)

    Overall novelty assessment:
    - Score: 0-10 (where 10 = completely unprecedented, 0 = already published verbatim)
    - Recommendation: PROCEED / PROCEED WITH CAUTION / ABANDON
    - Key differentiator (what, if anything, makes this unique)
    - Suggested positioning to maximize novelty perception at the target venue

### Novelty: [Idea Title]
- **Score**: X/10
- **Recommendation**: PROCEED / PROCEED WITH CAUTION / ABANDON
- **Core Claims**:
  1. [Claim 1] — Novelty: HIGH/MEDIUM/LOW — Closest: [paper title, year]
  2. [Claim 2] — Novelty: HIGH/MEDIUM/LOW — Closest: [paper title, year]
  3. [Claim 3] — Novelty: HIGH/MEDIUM/LOW — Closest: [paper title, year]
- **Closest Prior Work**:

| Paper | Year | Venue | Overlap | Key Difference |
|-------|------|-------|---------|----------------|
| ...   | ...  | ...   | ...     | ...            |

- **Key differentiator**: [what makes this unique, if anything]
- **Suggested positioning**: [how to frame the contribution to maximize novelty perception]

mcp__codex__codex:
  model: REVIEWER_MODEL
  config: {"model_reasoning_effort": "xhigh"}
  prompt: |
    你将模拟 [VENUE_NAME] ([VENUE_FULL_NAME]) 的审稿委员会。

    你评审的是一个**研究 Idea**（不是完成的论文）。核心问题是：
    "如果这个 idea 被正确执行，产出的论文能发 [VENUE_NAME] 吗？"

    ## 评审校准标准
    [INJECT CALIBRATION TIERS FROM VENUE PROFILE]

    ## 审稿人画像
    [INJECT REVIEWER PROFILES FROM VENUE PROFILE]

    === IDEA ===
    Title: [title]
    Thesis: [one-sentence thesis]
    Problem: [gap addressed]
    Core Mechanism: [key technical insight]
    Contribution Type: [empirical/method/theory/diagnostic]
    Closest Work: [paper + delta, from Module A]
    Novelty Score: [X/10, from Module A]
    === END IDEA ===

    请为每位审稿人输出：
    1. **校准层级**: Tier 1/2/3，附理由
    2. **Strengths**: 从该审稿人视角出发的优点（至少 2 个具体点）
    3. **Critical Weaknesses**: 2-3 个具体、可操作的弱点（不要泛泛而谈）
    4. **Verdict**: [从 verdict_options 中选择: Strong Reject / Reject / Weak Reject / Weak Accept / Accept / Strong Accept]
    5. **"怎样才能让我给 Accept"**: 1-2 句话，告诉作者具体需要什么

    然后写 **Meta Review**:
    - 审稿人之间的核心争议（如果有）—— 审稿人之间应该有分歧，不要三人一致
    - 最终裁决: [从 verdict_options 选择]
    - 如果拒稿: 这个 idea 适合什么级别的会议？（e.g., "适合 AAAI/IJCAI" 或 "建议转投 Workshop"）
    - 如果接收: 怎样才能冲击 Best Paper?
    - 执行中的 Top 3 风险（技术风险、实验风险、定位风险）

Score Range	Meaning
1-3	Likely obsolete within 1 year (e.g., tied to a specific model version or API)
4-6	Relevant for 2-3 years (e.g., current architectural paradigm)
7-10	Addresses a fundamental, long-standing problem (e.g., generalization, efficiency, interpretability)

Score Range	Meaning
1-3	One-off finding, no natural follow-up work
4-6	One clear extension paper possible
7-10	Opens a new sub-area; 3+ papers are naturally achievable (foundational contribution → extensions → system/application)

Score Range	Meaning
1-3	Pure theoretical curiosity with no foreseeable application
4-6	Benchmark-only demonstration (e.g., improves CIFAR-10 accuracy)
7-10	Clear industry or societal application (e.g., healthcare, sustainability, production ML systems)

Score Range	Meaning
1-3	Any competent team could do this equally well; high risk of being scooped
4-6	Moderate advantage (e.g., some relevant prior work, partial infrastructure)
7-10	Strong unique position (e.g., proprietary data, unique computational resources, rare domain expertise, established collaboration)

Score Range	Meaning
1-3	Each iteration takes weeks (e.g., large-scale pre-training, human studies)
4-6	Each iteration takes days (e.g., medium-scale training, moderate compute)
7-10	Iterations within hours; rapid signal on whether the idea works (e.g., small diagnostic experiments, existing benchmarks, fast prototyping)

### Strategic Fit: [Idea Title]
- **Strategic Score**: X.X/10
- **Dimensions**:
  | Dimension | Score | Justification |
  |-----------|-------|---------------|
  | Longevity | X/10 | [1-2 sentences] |
  | Roadmap Viability | X/10 | [1-2 sentences] |
  | Application Grounding | X/10 | [1-2 sentences] |
  | Execution Uniqueness | X/10 | [1-2 sentences] |
  | Iteration Readiness | X/10 | [1-2 sentences] |
- **Strategic recommendation**: [1-2 sentences on whether this is a good bet for the researcher]

COMPOSITE = (
    COMPOSITE_WEIGHTS.novelty    * Novelty_Score    +   # 0-10 from Module A
    COMPOSITE_WEIGHTS.venue      * Venue_Score      +   # 0-10 from Module B
    COMPOSITE_WEIGHTS.strategic  * Strategic_Score   +   # 0-10 from Module C
    COMPOSITE_WEIGHTS.feasibility * Feasibility_Score    # 0-10 carried from idea-gen
)

Composite Score	Recommendation	Action
>= 7.0 (PROCEED_THRESHOLD)	PROCEED	Move to `/idea-refine` for detailed development
5.0 - 6.9 (CAUTION to PROCEED range)	PROCEED WITH CAUTION	Address specific weaknesses first; consider `/lit-survey` on flagged sub-topics
< 5.0 (below CAUTION_THRESHOLD)	ABANDON	Document for future reference; do not invest further effort

mkdir -p outputs

# Screening Report

**Direction**: [research direction]
**Venue**: [target venue]
**Date**: [YYYY-MM-DD]
**Ideas screened**: N
**Composite weights**: novelty=X, venue=X, strategic=X, feasibility=X

## Executive Summary

[2-3 paragraphs summarizing the screening results. How many ideas passed? What are the top recommendations? Any surprises?]

## Per-Idea Reports

### Idea 1: [Title] — [PROCEED/CAUTION/ABANDON]

#### Module A: Novelty Assessment
[Full Phase D novelty report]

#### Module B: Venue Reviewer Simulation ([VENUE])
[Full 3-reviewer + meta-review output]

#### Module C: Strategic Fit Assessment
[Full 5-dimension strategic report]

#### Composite Score
| Component | Score | Weight | Weighted |
|-----------|-------|--------|----------|
| Novelty | X/10 | 0.25 | X.XX |
| Venue | X/10 | 0.35 | X.XX |
| Strategic | X/10 | 0.20 | X.XX |
| Feasibility | X/10 | 0.20 | X.XX |
| **Composite** | | | **X.XX** |

**Recommendation**: PROCEED / PROCEED WITH CAUTION / ABANDON

---

### Idea 2: [Title] — [PROCEED/CAUTION/ABANDON]
[repeat structure]

---

[repeat for all ideas]

# Screening Results: Ranked Ideas

**Direction**: [direction]
**Venue**: [venue]
**Date**: [YYYY-MM-DD]
**Ideas screened**: N

## Rankings

| Rank | Idea | Novelty | Venue Score | Strategic | Feasibility | Composite | Recommendation |
|------|------|---------|-------------|-----------|-------------|-----------|----------------|
| 1    | ...  | 8.5     | 7.2         | 8.0       | 7.5         | 7.8       | PROCEED        |
| 2    | ...  | 7.0     | 6.8         | 7.5       | 8.0         | 7.2       | PROCEED        |
| 3    | ...  | 6.0     | 5.5         | 6.0       | 7.0         | 6.0       | CAUTION        |
| 4    | ...  | 4.0     | 3.5         | 5.0       | 6.0         | 4.4       | ABANDON        |

## Detailed Per-Idea Reports

### Rank 1: [Title] — PROCEED

#### Module A: Novelty
- Score: X/10
- Key differentiator: [what makes it unique]
- Closest prior work: [paper, year, delta]

#### Module B: Venue Simulation ([VENUE])
- Reviewer 1 ([persona]): [Verdict] — [1-line summary]
- Reviewer 2 ([persona]): [Verdict] — [1-line summary]
- Reviewer 3 ([persona]): [Verdict] — [1-line summary]
- Meta-review: [Final verdict] — [1-line summary]
- Top risk: [the single biggest execution risk]

#### Module C: Strategic Fit
- Longevity: X/10 — [1 line]
- Roadmap Viability: X/10 — [1 line]
- Application Grounding: X/10 — [1 line]
- Execution Uniqueness: X/10 — [1 line]
- Iteration Readiness: X/10 — [1 line]

---

### Rank 2: [Title] — PROCEED
[repeat structure]

---

[repeat for all ideas, in rank order]

## Next Steps

### For PROCEED ideas:
- Run `/idea-refine` to develop detailed research plans, experimental designs, and paper outlines.

### For PROCEED WITH CAUTION ideas:
- Run `/lit-survey` on the specific sub-topics flagged as weak by the reviewers.
- Address the critical weaknesses identified in Module B before proceeding.
- Re-screen after improvements.

### For ABANDON ideas:
- Documented here for future reference.
- May revisit if the landscape changes (new tools, new datasets, paradigm shifts).
- Consider whether a sub-component of the idea could be extracted and developed independently.

cat << 'SCREENING_EOF' > outputs/SCREENING_REPORT.md
[content]
SCREENING_EOF

/lit-survey → /idea-gen → /idea-screen  ← you are here  → /idea-refine

Verdict	Score
Strong Reject	1
Reject	3
Weak Reject	4
Weak Accept	6
Accept	8
Strong Accept	10

Idea Screen

Idea Screen — Multi-Dimensional Research Idea Screening

Constants

Overview

Idea Screen

Idea Screen — Multi-Dimensional Research Idea Screening

Constants

Overview

Input

Parsing Logic

Module A: Novelty Assessment

Phase A: Extract Key Claims

Phase B: Multi-Source Literature Search

Phase C: Cross-Model Verification

Phase D: Novelty Report (per idea)

Important Rules for Module A

Module B: Venue Reviewer Simulation

How It Works

Venue Selection Logic

Fallback Generic Profile

The Screening Prompt

Score Mapping (Verdicts to Numeric)

Module C: Strategic Fit Assessment

1. Longevity (1-10)

2. Research Roadmap Viability (1-10)

3. Application Grounding (1-10)

4. Execution Uniqueness (1-10)

5. Iteration Readiness (1-10)

Strategic Score Calculation

Strategic Report (per idea)

Composite Scoring

Recommendation Thresholds

Execution Order

Output

outputs/SCREENING_REPORT.md

outputs/SCREENING_RANKED.md

Large File Handling

Key Rules

Composing with Other Skills

Taskflow Inbox Triage

Accessibility

Open a Pull Request

Investor Materials

Continuous Agent Loop

Configure Ecc

`outputs/SCREENING_REPORT.md`

`outputs/SCREENING_RANKED.md`