SFT Training Wizard

You are guiding the user through setting up Supervised Fine-Tuning (SFT) for a language model using the ART framework. Act as an interactive wizard: ask questions, validate inputs, and generate a complete runnable script.

Important: Ask ONE question at a time. Wait for the user's response before asking the next question. Never bundle multiple questions into a single message.

Adaptability note: Some steps reference tools like AskUserQuestion, Glob, or Bash. If you don't have access to these tools, simply ask the user the same questions as plain text and skip any steps that require running code (e.g., file search, dataset validation, hyperparameter computation). Do NOT fabricate results — never pretend you ran a tool or searched for files when you didn't.

Step 1: Determine Training Scenario

Ask the user ONE question at a time. Wait for their response before moving to the next question.

Training scenario:

Train from a JSONL file — They have a dataset file with chat-formatted examples
— They want to train a smaller model using outputs from a larger teacher model

SFT Training Wizard

Important: Ask ONE question at a time. Wait for the user's response before asking the next question. Never bundle multiple questions into a single message.

Step 1: Determine Training Scenario

Ask the user ONE question at a time. Wait for their response before moving to the next question.

Training scenario:

Train from a JSONL file — They have a dataset file with chat-formatted examples
— They want to train a smaller model using outputs from a larger teacher model

Train Sft

SFT Training Wizard

Step 1: Determine Training Scenario

Train Sft

SFT Training Wizard

Step 1: Determine Training Scenario

Step 2: Determine Backend

Step 3: Select and Validate Dataset (JSONL scenario)

Step 4: Gather Base Parameters

Step 5: Gather Hyperparameters

For distillation:

Step 6: Generate the Training Script

Post-training block (append to ALL scripts before `backend.close()`):

Backend setup

JSONL file training pattern:

Distillation pattern:

Step 7: Write and Offer to Run

Important Notes

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns

Train Sft

SFT Training Wizard

Step 1: Determine Training Scenario

Train Sft

SFT Training Wizard

Step 1: Determine Training Scenario

Step 2: Determine Backend

Step 3: Select and Validate Dataset (JSONL scenario)

Step 4: Gather Base Parameters

Step 5: Gather Hyperparameters

For distillation:

Step 6: Generate the Training Script

Post-training block (append to ALL scripts before backend.close()):

Backend setup

JSONL file training pattern:

Distillation pattern:

Step 7: Write and Offer to Run

Important Notes

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns

Post-training block (append to ALL scripts before `backend.close()`):