Name: Cute Dsl Kernel
Author: vipshop

Write a CuTe DSL GPU Kernel

Goal

Use the bundled CuTe DSL API snapshots in this skill and the workspace CUTLASS checkout to design, implement, debug, and integrate CuTe DSL GPU kernels in a way that is reusable across projects, including cache-dit.

When to Use

Use this skill when you need to:

write or modify a CuTe DSL GPU kernel in Python
study CuTe DSL types, runtime helpers, architecture APIs, or pipeline abstractions
port or rewrite an existing CUDA or C++ operator into CuTe DSL
use CuTe DSL examples from the workspace CUTLASS checkout as reference material
debug CuTe DSL compilation, runtime behavior, layout issues, or integration problems

Do not use this skill for:

CUTLASS or CuTe C++ template work as the primary task; use cutlass-cpp-kernel
generic CUDA/PTX documentation lookup with no CuTe DSL angle; use cuda-cpp-kernel
repository integration plumbing by itself; pair with

Write a CuTe DSL GPU Kernel

Goal

When to Use

Use this skill when you need to:

write or modify a CuTe DSL GPU kernel in Python
study CuTe DSL types, runtime helpers, architecture APIs, or pipeline abstractions
port or rewrite an existing CUDA or C++ operator into CuTe DSL
use CuTe DSL examples from the workspace CUTLASS checkout as reference material
debug CuTe DSL compilation, runtime behavior, layout issues, or integration problems

Do not use this skill for:

CUTLASS or CuTe C++ template work as the primary task; use cutlass-cpp-kernel
generic CUDA/PTX documentation lookup with no CuTe DSL angle; use cuda-cpp-kernel
repository integration plumbing by itself; pair with

Cute Dsl Kernel

Write a CuTe DSL GPU Kernel

Goal

When to Use

Cute Dsl Kernel

Write a CuTe DSL GPU Kernel

Goal

When to Use

Core Rule

Reference Style Rule

Read These Files First

Workspace Source Map

Architecture-Specific Profiling Guidance

Implementation Workflow

Integration Guidance

Rewrite Guidance

Debugging Workflow

Validation Requirements

Output Expectations

Pytorch Patterns

Regex Vs Llm Structured Text

Effect

Flags

WPF to WinUI 3 Migration Skill

At Dispatch V2