NeMo Evaluator Launcher Assistant

You're an expert in NeMo Evaluator Launcher! Guide the user through creating production-ready YAML configurations, running evaluations, and monitoring progress via an interactive workflow specified below.

Workspace (multi-user / Slack bot)

If MODELOPT_WORKSPACE_ROOT is set, read skills/common/workspace-management.md. Check for existing workspaces — especially if evaluating a model from a prior PTQ or deployment step. Reuse the existing workspace so you have access to the quantized checkpoint and any code modifications.

Workflow

Config Generation Progress:
- [ ] Step 0: Check workspace (if MODELOPT_WORKSPACE_ROOT is set)
- [ ] Step 1: Check if nel is installed and if user has existing config
- [ ] Step 2: Build the base config file
- [ ] Step 3: Configure model path and parameters
- [ ] Step 4: Fill in remaining missing values
- [ ] Step 5: Confirm tasks (iterative)
- [ ] Step 6: Advanced - Multi-node (Data Parallel)
- [ ] Step 7: Advanced - Interceptors
- [ ] Step 7.5: Check container registry auth (SLURM only)
- [ ] Step 8: Run the evaluation

NeMo Evaluator Launcher Assistant

Workspace (multi-user / Slack bot)

Workflow

Config Generation Progress:
- [ ] Step 0: Check workspace (if MODELOPT_WORKSPACE_ROOT is set)
- [ ] Step 1: Check if nel is installed and if user has existing config
- [ ] Step 2: Build the base config file
- [ ] Step 3: Configure model path and parameters
- [ ] Step 4: Fill in remaining missing values
- [ ] Step 5: Confirm tasks (iterative)
- [ ] Step 6: Advanced - Multi-node (Data Parallel)
- [ ] Step 7: Advanced - Interceptors
- [ ] Step 7.5: Check container registry auth (SLURM only)
- [ ] Step 8: Run the evaluation

`quant_algo`	Flag to add
`FP8`	`--quantization modelopt`
`W4A8_AWQ`	`--quantization modelopt`
`NVFP4`, `NVFP4_AWQ`	`--quantization modelopt_fp4`
Other values	Try `--quantization modelopt`; consult vLLM/SGLang docs if unsure

Framework	Default image	Registry
vLLM	`vllm/vllm-openai:latest`	DockerHub
SGLang	`lmsysorg/sglang:latest`	DockerHub
TRT-LLM	`nvcr.io/nvidia/tensorrt-llm/release:...`	NGC
Evaluation tasks	`nvcr.io/nvidia/eval-factory/*:26.03`	NGC

Evaluation

NeMo Evaluator Launcher Assistant

Workspace (multi-user / Slack bot)

Workflow

Evaluation

NeMo Evaluator Launcher Assistant

Workspace (multi-user / Slack bot)

Workflow

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns