Name: Purpose
Author: merceralex397-collab

Use this skill to select the right quantization format and bit width for a given model, hardware target, and quality requirement — then execute the conversion, validate output quality, and document the tradeoff decision.

When to use this skill

Use this skill when:

choosing between GGUF, GPTQ, AWQ, or bitsandbytes for a deployment target
converting a model from FP16/BF16 to a quantized format using llama.cpp/quantize, auto-gptq, autoawq, or transformers with bitsandbytes
selecting a bit width (Q2_K through Q8_0 for GGUF, 4-bit/3-bit for GPTQ/AWQ, nf4/fp4 for bitsandbytes)
preparing or selecting a calibration dataset for GPTQ or AWQ quantization
benchmarking quantized model quality against the FP16 baseline (perplexity, task accuracy, output diff)
estimating VRAM/RAM requirements for a quantized model at a given context length
debugging quality regressions after quantization (incoherent output, degraded accuracy on specific tasks)

Do not use this skill when

When to use this skill

Use this skill when:

choosing between GGUF, GPTQ, AWQ, or bitsandbytes for a deployment target
converting a model from FP16/BF16 to a quantized format using llama.cpp/quantize, auto-gptq, autoawq, or transformers with bitsandbytes
selecting a bit width (Q2_K through Q8_0 for GGUF, 4-bit/3-bit for GPTQ/AWQ, nf4/fp4 for bitsandbytes)
preparing or selecting a calibration dataset for GPTQ or AWQ quantization
benchmarking quantized model quality against the FP16 baseline (perplexity, task accuracy, output diff)
estimating VRAM/RAM requirements for a quantized model at a given context length
debugging quality regressions after quantization (incoherent output, degraded accuracy on specific tasks)

Purpose

When to use this skill

Do not use this skill when

Purpose

When to use this skill

Do not use this skill when

Operating procedure

Decision rules

Output requirements

References

Anti-patterns

Failure handling

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns

Purpose

When to use this skill

Do not use this skill when

Purpose

When to use this skill

Do not use this skill when

Operating procedure

Decision rules

Output requirements

References

Related skills

Anti-patterns

Failure handling

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns