Name: Torchforge Rl Training
Author: Orchestra-Research

스킬 검색.../

┌─────────────────────────────────────────────────────────┐
│ Application Layer (Your Code)                           │
│ - Define reward models, loss functions, sampling        │
└─────────────────────┬───────────────────────────────────┘
                      │
┌─────────────────────▼───────────────────────────────────┐
│ Forge API Layer                                         │
│ - Episode, Group dataclasses                           │
│ - Service interfaces (async/await)                      │
└─────────────────────┬───────────────────────────────────┘
                      │
┌─────────────────────▼───────────────────────────────────┐
│ Distributed Services (Monarch)                          │
│ ├── Trainer (TorchTitan FSDP)                          │
│ ├── Generator (vLLM inference)                          │
│ ├── Reference Model (frozen KL baseline)               │
│ └── Reward Actors (compute rewards)                    │
└─────────────────────────────────────────────────────────┘

# Create environment
conda create -n forge python=3.12
conda activate forge

# Install (handles PyTorch nightly + dependencies)
./scripts/install.sh

# Verify
python -c "import torch, forge, vllm; print('OK')"

./scripts/install_rocm.sh

python -m apps.sft.main --config apps/sft/llama3_8b.yaml

python -m apps.grpo.main --config apps/grpo/qwen3_1_7b.yaml

# config/grpo_math.yaml

Torchforge Rl Training | Skills Pool

Torchforge Rl Training

Torchforge Rl Training

torchforge: PyTorch-Native Agentic RL Library

When to Use torchforge

Key Features

Architecture Overview

Installation

ROCm Installation

Quick Start

SFT Training (2+ GPUs)

GRPO Training (3+ GPUs)

Workflow 1: GRPO Training for Math Reasoning

Prerequisites Checklist

Step 1: Create Configuration

Pytorch Patterns

Regex Vs Llm Structured Text

Effect

Flags

WPF to WinUI 3 Migration Skill

At Dispatch V2