Archivo del skill

Llm Bench

Name: Llm Bench
Author: idank

Run the LLM extractor benchmark and compare results. Use when the user wants to benchmark, test, or compare LLM extraction performance on manpages.

idank14,018 estrellas12 abr 2026

Ocupación
Categorías: Depuración

Contenido de la habilidad

Run the LLM extractor benchmark tool and compare results against previous runs.

Usage

/llm-bench [--model <model>] [--batch <size>] [-d <description>] [--baseline <path>] [files...]

Arguments

model (optional): LLM model to use. Defaults to openai/gpt-5-mini.
batch (optional): Batch size for provider batch API. Defaults to 50.
description (optional): Short description tag for this run.
baseline (optional): Baseline report path to compare against. Use list to find paths. When omitted, compares against the most recent previous report.
files (optional): Specific .gz files or directories. Defaults to the built-in corpus.

Steps

Skills relacionados

Llm Bench | Skills Pool

source /home/idank/dev/vibe/explainshell/.venv/bin/activate && python /home/idank/dev/vibe/explainshell/tools/llm_bench.py run --model <model> --batch <size> -d '<description>' [files...]

source /home/idank/dev/vibe/explainshell/.venv/bin/activate && python /home/idank/dev/vibe/explainshell/tools/llm_bench.py compare [--baseline <path>]

Llm Bench

Usage

Arguments

Steps

Llm Bench

Usage

Arguments

Steps

Session Logs

OpenClaw Test Heap Leaks

Node Connect

Openclaw Qa Testing

Openclaw Secret Scanning Maintainer

Flags