Name: Model Merging
Author: merceralex397-collab

Purpose

Combine multiple fine-tuned model checkpoints into a single model using weight-space merging techniques via mergekit, selecting the right merge strategy and validating results with benchmark evaluation.

When to use this skill

Use this skill when:

Combining two or more fine-tuned models (e.g., code + chat + math specialists) into one checkpoint
Writing a mergekit YAML config for linear, SLERP, TIES, DARE, or passthrough (frankenmerge) methods
Performing task arithmetic: computing task vectors (fine_tuned - base) and adding/subtracting them
Deciding merge strategy based on model similarity, task overlap, and parameter conflict density
Evaluating whether a merged model retains capabilities from each source

Do not use this skill when

The task involves training or fine-tuning a model from scratch — use fine-tuning or pretraining-pipeline
The task is about combining LoRA adapters at inference time (adapter stacking) — not weight merging

Purpose

When to use this skill

Use this skill when:

Combining two or more fine-tuned models (e.g., code + chat + math specialists) into one checkpoint
Writing a mergekit YAML config for linear, SLERP, TIES, DARE, or passthrough (frankenmerge) methods
Performing task arithmetic: computing task vectors (fine_tuned - base) and adding/subtracting them
Deciding merge strategy based on model similarity, task overlap, and parameter conflict density
Evaluating whether a merged model retains capabilities from each source

Do not use this skill when

The task involves training or fine-tuning a model from scratch — use fine-tuning or pretraining-pipeline
The task is about combining LoRA adapters at inference time (adapter stacking) — not weight merging

Model Merging

Purpose

When to use this skill

Do not use this skill when

Model Merging

Purpose

When to use this skill

Do not use this skill when

Operating procedure

Decision rules

Output requirements

References

Failure handling

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns

Model Merging

Purpose

When to use this skill

Do not use this skill when

Model Merging

Purpose

When to use this skill

Do not use this skill when

Operating procedure

Decision rules

Output requirements

References

Related skills

Failure handling

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns