Archivo del skill

Memory Budget

Name: Memory Budget
Author: urcraft

Calculate which LLM and embedding models fit in available memory at each quantization level. Requires a hardware profile from hardware-profiler. Use after hardware profiling to determine feasible model sizes.

urcraft0 estrellas15 mar 2026

Ocupación
Categorías: Química Computacional

Contenido de la habilidad

Memory Budget Calculator — Phase 1(b)

Compute feasible (model_size, quantization) pairs based on available memory.

Prerequisites

Read results/phase1/hardware_profile.json. If it doesn't exist, run the hardware-profiler skill first.

Formula

required_memory_gb = (parameters_billions × bits_per_weight) / 8 + overhead

Where:

parameters_billions = model parameter count (e.g., 7, 13, 34, 70, 235)
bits_per_weight = quantization bits (4 for Q4, 5 for Q5, 6 for Q6, 8 for Q8, 16 for FP16)
overhead = 15% of the raw weight size (covers KV-cache, activations, framework)

Memory budget

Skills relacionados

Memory Budget | Skills Pool

Memory Budget

Memory Budget Calculator — Phase 1(b)

Prerequisites

Formula

Memory budget

Memory Budget

Memory Budget Calculator — Phase 1(b)

Prerequisites

Formula

Memory budget

Steps

Healthcare Cdss Patterns

Drug Discovery

Qmd

Attack Tree Construction

Azure Ai Anomalydetector Java

Viboscope