Skip to content

搵技能.../

Agent Skill Search Engine

搜尋

搜尋
分類
職業

About

About
Privacy
Terms

© 2026 Skills Pool. All rights reserved.

Mcts Backpropagate | Skills Pool

技能檔案

Mcts Backpropagate

Execute the BACKPROPAGATION phase of MCTS to update node statistics from leaf to root

NewJerseyStyle0 星標2026年1月24日

職業: 軟件開發人員
分類: 機器學習

技能內容

MCTS Backpropagation Phase

You are executing the BACKPROPAGATION phase of Monte Carlo Tree Search.

Backpropagation Algorithm

Start from the simulated node
Traverse up to root:
- For each node on the path:
  - Increment visit count: N = N + 1
  - Add reward to value: Q = Q + reward
Record the update for analysis

Using MCP Tools

Call mcts_backpropagate with:

node_id: The leaf node where simulation ended
reward: The reward from simulation
path: (optional) Explicit path to update

The tool returns:

nodes_updated: List of updated node IDs
new_statistics: Updated Q and N for each node

相關技能

快速安裝

Mcts Backpropagate

npx skillvault add NewJerseyStyle/newjerseystyle-plugin-mcts-skills-mcts-backpropagate-skill-md

下載 Skill 打開源碼倉庫

作者: NewJerseyStyle
星標: 0
更新時間: 2026年1月24日
職業

本頁內容

01MCTS Backpropagation Phase

tree_depth: Current maximum depth

Statistics Update

For each node in the path from leaf to root:

node.N += 1
node.Q += reward
node.avg_reward = node.Q / node.N

Backpropagation Strategy

For the current context: $ARGUMENTS

Standard Update

Each node gets the same reward
Simple and effective for most problems

Discounted Update (optional)

Apply discount factor γ as you go up
Nodes closer to outcome get more credit
node.Q += reward * (γ ^ depth_from_leaf)

Observation Recording

After backpropagation:

Record any new insights as observations
Update beliefs if the result was surprising
Note if any branch is now clearly best/worst

Convergence Check

After updating, check:

Best path stability: Has the best path changed?
Value convergence: Are top nodes' values stabilizing?
Sufficient exploration: Have all branches been tried?

Output

After backpropagation, report:

Nodes updated with new statistics
Current best path and its average reward
Exploration coverage (% of nodes visited)
Whether to continue or extract solution

If continuing, return to SELECTION phase. If converged or budget exhausted, extract the solution.

02

Backpropagation Algorithm

03Using MCP Tools

04Statistics Update

05Backpropagation Strategy

06Standard Update

07Discounted Update (optional)

08Observation Recording

09Convergence Check

Continuous Learning V2

Instinct-based learning system that observes sessions via hooks, creates atomic instincts with confidence scoring, and evolves them into skills/commands/agents. v2.1 adds project-scoped instincts to prevent cross-project contamination.

Continuous Learning V2

Hook'lar aracılığıyla oturumları gözlemleyen, güven skorlaması ile atomik instinct'ler oluşturan ve bunları skill/command/agent'lara evriltiren instinct tabanlı öğrenme sistemi. v2.1 çapraz proje kontaminasyonunu önlemek için proje kapsamlı instinct'ler ekler.

Continuous Learning V2

훅을 통해 세션을 관찰하고, 신뢰도 점수가 있는 원자적 본능을 생성하며, 이를 스킬/명령어/에이전트로 진화시키는 본능 기반 학습 시스템. v2.1에서는 프로젝트 간 오염을 방지하기 위한 프로젝트 범위 본능이 추가되었습니다.

Continuous Learning

Claude Codeセッションから再利用可能なパターンを自動的に抽出し、将来の使用のために学習済みスキルとして保存します。

Continuous Learning

Automatically extract reusable patterns from Claude Code sessions and save them as learned skills for future use.

Pytorch Patterns

PyTorch deep learning patterns and best practices for building robust, efficient, and reproducible training pipelines, model architectures, and data loading.

軟件開發人員