Name: Literature Survey
Author: DaryLu0v0

スキルを検索.../

Literature Survey | Skills Pool

Parameter	Required	Example
Frequency range	Yes	1-5 THz, 8-14 um, 0.3-0.5 THz
Response type	Yes	Absorber, filter, polarizer, sensor
Performance target	Yes	>90% absorption, Q>100
Polarization	No	Insensitive, TE/TM, dual-band
Material constraints	No	CMOS-compatible, no gold
Fabrication constraints	No	Photolithography, min feature 1 um
Number of layers	No	Single-layer, MIM stack, max 5
Angular stability	No	Stable up to 60 degrees

cd D:/Claude && python -c "
from target_to_hypothesis.skills.target_interpreter import interpret_target
from target_to_hypothesis.skills.query_planner import plan_queries
from target_to_hypothesis.utils.llm import make_llm_fn
import json, os
os.environ['OPENAI_API_KEY'] = 'USER_KEY'
llm_fn = make_llm_fn(max_tokens=2000)
target = interpret_target(description='USER_DESC', freq_min_ghz=FMIN, freq_max_ghz=FMAX, constraints={}, llm_fn=llm_fn)
query_plan = plan_queries(target, llm_fn=llm_fn, max_primary=5, max_secondary=5)
print(json.dumps({
    'primary': query_plan.primary_queries,
    'secondary': query_plan.secondary_queries,
    'embedding_query': query_plan.embedding_query,
    'related_mechanisms': query_plan.related_mechanisms,
    'related_geometry_terms': query_plan.related_geometry_terms,
}, indent=2))
"

llm_fn = make_llm_fn(
    api_key='...',           # OpenAI API key (or env OPENAI_API_KEY)
    model='gpt-5.4',        # default model
    base_url=None,           # custom endpoint (DeepSeek, local vLLM, Azure, etc.)
    temperature=0.2,
    max_tokens=4000,
    max_retries=3,
    json_mode=False,         # set True for structured JSON output
)

For each query in query_plan.primary_queries + query_plan.secondary_queries:
  results = WebSearch(query="{query} metamaterial site:scholar.google.com OR site:sciencedirect.com OR site:ieee.org OR site:nature.com")

WebSearch(query="metamaterial absorber {frequency_range} {response_type} design fabrication")
WebSearch(query="{topology_keywords} metamaterial {frequency_band} absorption")

content = WebFetch(url="https://pmc.ncbi.nlm.nih.gov/articles/PMC...", prompt="Extract: title, authors, abstract, key design parameters, absorption performance, frequency range, materials used, and figure captions.")

Backend	API	When used
Browser (default)	Google Scholar via MCP	`BROWSER_AVAILABLE = true`
SemanticScholarBackend	Semantic Scholar REST API	Fallback when browser unavailable
CrossrefBackend	Crossref REST API	Additional backend for DOI-rich queries
ArxivBackend	arXiv API	Preprints, open-access papers

from target_to_hypothesis.skills.retriever import retrieve_papers, SemanticScholarBackend, CrossrefBackend, ArxivBackend

# The pipeline calls this internally with pre-scraped candidates:
papers = retrieve_papers(
    plan=query_plan,
    backends=[SemanticScholarBackend(), CrossrefBackend(), ArxivBackend()],
    per_query_limit=20,     # papers per query per backend
    total_limit=40,         # final cap after dedup
)

retrieval_score = 0.40 × keyword_overlap + 0.30 × recency + 0.30 × citation_score

WebSearch(query="Willie Padilla metamaterial absorber {user_frequency_band} {user_response_type}")
WebSearch(query="Willie Padilla Duke metamaterial terahertz absorber most cited")

WebFetch(url="https://scholars.duke.edu/person/willie.padilla/publications", prompt="List all publications by Willie Padilla related to metamaterial absorbers. Include title, year, journal, and any citation info.")

padilla_papers = [
    {"title": "...", "year": 2008, "venue": "Physical Review Letters",
     "citations": 8696, "relevance_note": "..."},
    # ... 4 more
]

WebFetch(url="paper_url", prompt="Extract: abstract, key design parameters, unit cell geometry, materials, absorption spectrum details, and figure captions with descriptions.")

content = WebFetch(
  url="https://doi.org/{DOI}" or paper.url,
  prompt="Extract the following from this research paper:
    1. Title and authors
    2. Full abstract
    3. Design parameters (unit cell dimensions, period, thickness, materials)
    4. Absorption/transmission performance (peak values, bandwidth, frequency range)
    5. Physical mechanisms (impedance matching, Fabry-Perot, magnetic resonance, etc.)
    6. Figure captions and descriptions (especially absorption spectra and unit cell geometry)
    7. Key conclusions and design insights
    Format as structured text with clear sections."
)

full_text_dict[paper.paper_id] = {
    "title": extracted_title,
    "abstract": extracted_abstract,
    "parameters": extracted_params,
    "performance": extracted_perf,
    "mechanisms": extracted_mechanisms,
    "figure_captions": extracted_fig_captions,
    "source": "full_text" if len(content) > 500 else "abstract_only",
}

curl -L -o "D:/Claude/artifacts/figures/{paper_id}_fig{N}.png" "FIGURE_URL"

figure_map = {}  # keyed by paper_id

# After each successful curl download:
if paper_id not in figure_map:
    figure_map[paper_id] = []
figure_map[paper_id].append({
    "path": f"D:/Claude/artifacts/figures/{paper_id}_fig{N}.png",
    "caption": caption_text,
    "fig_num": N
})

For each hypothesis topology family + target frequency range:
  1. Read .templates/library-index.json
  2. Find templates with matching family
  3. Compute frequency overlap fraction

  Scoring:
  - Exact match (same family, >80% freq overlap):  readiness = 1.0
  - Partial match (same family, >50% freq overlap): readiness = 0.7
  - Family match only (different freq):             readiness = 0.4
  - No match in library:                            readiness = 0.1

  If a template exists, note in the hypothesis:
  "Existing template: {id}, {best_absorption}% absorption in {iterations} iterations.
   Reusable parameters: {params}. Expected fast convergence."

@dataclass
class PipelineConfig:
    # --- Retrieval ---
    retrieval_mode: str = "browser"           # "browser" | "semantic_scholar" | "embedding"
    semantic_scholar_api_key: str = None       # API key for Semantic Scholar (optional, higher rate limits)
    per_query_limit: int = 20                  # papers per query per backend
    total_paper_limit: int = 40                # final cap after dedup + scoring

    # --- Embedding & LLM ---
    openai_api_key: str = None                 # for embedding search + LLM calls
    embedding_model: str = "text-embedding-3-large"  # OpenAI embedding model
    llm_model: str = "gpt-5.4"                # LLM for analysis/synthesis
    use_llm: bool = True                       # set False to skip LLM-based steps
    llm_fn: callable = None                    # custom LLM callable (overrides model/key)

    # --- Pre-scraped data (from Steps 2-4) ---
    pre_scraped_papers: list[dict] = None      # raw dict format
    pre_scraped_candidates: list[PaperCandidate] = None  # structured format
    full_text_contents: dict[str, ExtractedPaperContent] = None  # from Step 4

    # --- Frequency filtering ---
    use_frequency_filter: bool = True          # enable 4-stage frequency filter
    frequency_filter_top_n: int = 30           # keep top N after frequency filtering

    # --- Full text ---
    full_text_top_n: int = 10                  # papers to attempt full-text extraction

    # --- Hypothesis generation ---
    hypothesis_mode: str = "paper_based"       # "paper_based" | "ontology"
    max_hypotheses: int = 5                    # max hypotheses to generate

    # --- Interactive mode ---
    interactive: bool = False                  # enable Step 0 interactive target clarification
    dialog_fn: callable = None                 # callable for user dialogs in interactive mode

    # --- Phase 3 ---
    run_phase3: bool = True                    # enable deep per-paper analysis
    phase3_top_n: int = 10                     # papers to analyze in Phase 3

    # --- Padilla Lab ---
    padilla_papers: list[dict] = None          # pre-populated by agent (Step 3)

    # --- Output ---
    save_artifacts: bool = True                # save all intermediate artifacts
    artifacts_dir: str = None                  # custom output directory (default: D:/Claude/artifacts)

cd D:/Claude && python -c "
import json, os, sys
os.environ['OPENAI_API_KEY'] = 'USER_KEY'

from target_to_hypothesis.pipelines.run_target_to_hypothesis import PipelineConfig, run_pipeline

config = PipelineConfig(
    retrieval_mode='browser',
    hypothesis_mode='paper_based',
    pre_scraped_candidates=final_candidates,
    full_text_contents=full_text_dict,
    padilla_papers=padilla_papers,
    openai_api_key=os.environ['OPENAI_API_KEY'],
    embedding_model='text-embedding-3-large',
    llm_model='gpt-5.4',
    max_hypotheses=5,
    per_query_limit=20,
    total_paper_limit=30,
    use_frequency_filter=True,
    frequency_filter_top_n=30,
    full_text_top_n=10,
    save_artifacts=True,
    run_phase3=True,
    phase3_top_n=10,
)

result = run_pipeline(
    description='USER_DESC',
    freq_min_ghz=FMIN, freq_max_ghz=FMAX,
    constraints={}, config=config,
)
print(f'Run: {result.run_id}')
print(f'Papers: {len(result.paper_candidates)}')
print(f'Evidence: {result.evidence_grades.overall_evidence_quality}')
print(f'Hypotheses: {len(result.ranked_hypotheses.hypotheses)}')
for i, h in enumerate(result.ranked_hypotheses.hypotheses):
    print(f'  {i+1}. {h.family_display_name or h.family} (score={h.score:.3f})')
print(f'Report: {result.report_path}')
"

class EmbeddingSearcher:
    def __init__(self, papers, run_id, model_name, api_key, base_url=None, cache_dir=None)
    def search(self, query=None, target_description=None, top_k=10) -> list[dict]
    def search_with_examples(self, example_papers, top_k=10) -> list[dict]

Stage	What it does
1. Band-label matching	Translate user freq → band labels (IEEE L/S/C/X/Ku/K/Ka/V/W + microwave/THz/IR/mmW/5G/Wi-Fi/ISM/satellite)
2. Frequency extraction	Regex to extract "6.45 GHz", "1.723 THz", "6-14 GHz" ranges from abstract/title
3. Overlap calculation	Check `paper_range ∩ target_range` with ±20% broadband tolerance
4. Scoring	Combine overlap + band match into 0-1 score

from target_to_hypothesis.skills.frequency_filter import filter_papers_by_frequency

filtered = filter_papers_by_frequency(
    papers=candidates,
    target_min_ghz=FMIN,
    target_max_ghz=FMAX,
    is_broadband=False,    # True for broadband designs (relaxes tolerance)
)

L-band: 1-2 GHz | S-band: 2-4 GHz | C-band: 4-8 GHz | X-band: 8-12 GHz
Ku-band: 12-18 GHz | K-band: 18-27 GHz | Ka-band: 26.5-40 GHz
V-band: 40-75 GHz | W-band: 75-110 GHz

Literature Survey

Metamaterial Literature Survey (v4)

When to Use

When NOT to Use

Literature Survey

Metamaterial Literature Survey (v4)

When to Use

When NOT to Use

STEP 0 (MANDATORY): Verify Web Tools

Step 1: Clarify the Design Target

Step 2: Plan Queries + Scrape Google Scholar

2a: Generate search queries

2b: Search for papers using WebSearch

2b-alt: Use WebFetch for open-access full text

2c: Alternative retrieval backends (if browser unavailable)

Step 4: Extract Full Text from Top Papers

A. Attempt full text extraction via WebFetch

B. Store extracted content

C. Download figures (if URLs found)

D. Build figure-to-paper mapping

Reference: browser_paper_reader module (for Python pipeline)

Step 5: Analyze Papers and Generate Hypotheses

5a: Rank papers by relevance

5b: Synthesize across papers

5c: Generate design hypotheses

5d: Proceed directly to Step 6

Reference: target_to_hypothesis Python Pipeline (DO NOT RUN unless user explicitly requests)

5-ref-a: PipelineConfig — Full Reference

5-ref-b: Run the pipeline

5-ref-c: Pipeline internal steps

Healthcare Cdss Patterns

Drug Discovery

Qmd

Attack Tree Construction

Azure Ai Anomalydetector Java

Viboscope

Literature Survey

Metamaterial Literature Survey (v4)

When to Use

When NOT to Use

Literature Survey

Metamaterial Literature Survey (v4)

When to Use

When NOT to Use

STEP 0 (MANDATORY): Verify Web Tools

Step 1: Clarify the Design Target

Step 2: Plan Queries + Scrape Google Scholar

2a: Generate search queries

2b: Search for papers using WebSearch

2b-alt: Use WebFetch for open-access full text

2c: Alternative retrieval backends (if browser unavailable)

Step 3: Search Padilla Lab Related Work (ALWAYS)

Step 4: Extract Full Text from Top Papers

A. Attempt full text extraction via WebFetch

B. Store extracted content

C. Download figures (if URLs found)

D. Build figure-to-paper mapping

Reference: browser_paper_reader module (for Python pipeline)

Step 5: Analyze Papers and Generate Hypotheses

5a: Rank papers by relevance

5b: Synthesize across papers

5c: Generate design hypotheses

5d: Proceed directly to Step 6

Reference: target_to_hypothesis Python Pipeline (DO NOT RUN unless user explicitly requests)

5-ref-a: PipelineConfig — Full Reference

5-ref-b: Run the pipeline

5-ref-c: Pipeline internal steps

Healthcare Cdss Patterns

Drug Discovery

Qmd

Attack Tree Construction

Azure Ai Anomalydetector Java

Viboscope