Source extension: .py | Compiler: TileLang JIT -> PTX via TVM | Group: python-jit

Environment Validation

python3 -c "import tilelang; print(tilelang.__version__)"

BUILD / RUN Templates

build_iter<NNN>.sh (syntax/import check):

#!/usr/bin/env bash
python3 -c "import ast; ast.parse(open('$SRC').read()); print('parse OK')"
python3 -c "
import importlib.util, sys
spec = importlib.util.spec_from_file_location('kernel', '$SRC')
mod = importlib.util.module_from_spec(spec); spec.loader.exec_module(mod)
print('import OK')
"

JIT compile errors surface at first run, not import. Treat first-run JIT failure the same as a compile failure (bounded retry budget, then attempt<AAAA>).

run_iter<NNN>.sh:

Source extension: .py | Compiler: TileLang JIT -> PTX via TVM | Group: python-jit

Environment Validation

python3 -c "import tilelang; print(tilelang.__version__)"

BUILD / RUN Templates

build_iter<NNN>.sh (syntax/import check):

#!/usr/bin/env bash
python3 -c "import ast; ast.parse(open('$SRC').read()); print('parse OK')"
python3 -c "
import importlib.util, sys
spec = importlib.util.spec_from_file_location('kernel', '$SRC')
mod = importlib.util.module_from_spec(spec); spec.loader.exec_module(mod)
print('import OK')
"

JIT compile errors surface at first run, not import. Treat first-run JIT failure the same as a compile failure (bounded retry budget, then attempt<AAAA>).

run_iter<NNN>.sh:

Bottleneck	Ideas
Memory-bound	Increase `block_M/N`; add `num_stages`
Compute-bound	Use `T.mma` with larger tile; tune `thread_num`
Latency-bound	Increase `num_stages`; `T.Pipelined` with prefetch
L2 locality	`T.use_swizzle` with different swizzle factors
Shared memory	Pad smem (`pad=N`) to avoid bank conflicts

Croq Dsl Tilelang

Environment Validation

BUILD / RUN Templates

Croq Dsl Tilelang

Environment Validation

BUILD / RUN Templates

PROFILE — ncu Command

Pure Implementation Rule

IDEA Menu

VERIFY / MEASURE

BASELINE (iter000)

Bluebubbles

Add Tracing

Analytics Events

Add Expert

Arthas

Arthas Eagleeye Traceid