Name: SYCL/ESIMD Kernel → Python Wheel (Windows, Intel oneAPI)
Author: ModelTC

SYCL/ESIMD Kernel → Python Wheel (Windows, Intel oneAPI)

Full pipeline for turning a SYCL/ESIMD GPU kernel into a Python-importable wheel package on Windows with Intel oneAPI 2025.x and conda. Covers every layer of the stack: ESIMD kernel (.cpp/.h) → Windows DLL (icpx) → PyTorch C++ extension (.pyd, CMake) → Python package → wheel (.whl, scikit-build-core). Use this skill whenever the user is working on Intel Arc GPU (Xe2 / BMG / PTL-H) SYCL or ESIMD kernels and wants to expose them to Python, package them as a wheel, set up a build script, debug build failures, or understand how the DLL + .pyd + wheel layers fit together. Also use it when they hit Windows-specific build issues like setvars.bat failing, cmake.exe producing no output, or ur_api.h not found.

ModelTC2,191 estrellas17 abr 2026

Ocupación
Categorías: Internos de Frameworks

Pipeline overview

lgrf_uni/kernels.cpp   ──icpx──►  my_kernel.dll + my_kernel.lib
                                         │
csrc/entry.cpp                           │ (linked via .lib)
csrc/wrapper.cpp       ──cmake──►  _ext.cp311-win_amd64.pyd
                                         │
python/my_package/                       │ (copied in)
  __init__.py          ──build──►  dist/my_package-0.0.1-cp311-abi3-win_amd64.whl
  _ext.*.pyd
  my_kernel.dll

Two compilation passes, always in this order:

ESIMD DLL — compiled with icpx -fsycl, AOT for the target GPU
Python extension — compiled with icx (host-only), links the DLL's .lib

Canonical directory structure

my_kernel/
├── CMakeLists.txt
├── pyproject.toml
├── build.bat                     ← all-in-one build script
├── run_build.bat                 ← Claude Code launcher (sets conda env, calls build.bat)
├── lgrf_uni/
│   ├── esimd_kernel_api.h        ← dllexport / dllimport macro
│   ├── kernels.cpp               ← extern "C" dispatchers, sycl::queue interop
│   └── single_kernels/           ← header-only kernel implementations
│       └── my_kernel.h
├── csrc/
│   ├── entry.cpp                 ← PYBIND11_MODULE registrations
│   ├── my_kernel_wrapper.cpp     ← thin C++ wrapper (Tensor checks → DLL call)
│   └── utils.h                   ← get_queue() helper
├── python/my_package/
│   ├── __init__.py               ← DLL preload + _ext import
│   └── version.py
└── test/
    └── test_my_kernel.py

SYCL/ESIMD Kernel → Python Wheel (Windows, Intel oneAPI)

ModelTC2,191 estrellas17 abr 2026

Ocupación
Categorías: Internos de Frameworks

Pipeline overview

lgrf_uni/kernels.cpp ──icpx──► my_kernel.dll + my_kernel.lib │ csrc/entry.cpp │ (linked via .lib) csrc/wrapper.cpp ──cmake──► _ext.cp311-win_amd64.pyd │ python/my_package/ │ (copied in) __init__.py ──build──► dist/my_package-0.0.1-cp311-abi3-win_amd64.whl _ext.*.pyd my_kernel.dll

Two compilation passes, always in this order:

ESIMD DLL — compiled with icpx -fsycl, AOT for the target GPU

Python extension — compiled with icx (host-only), links the DLL's .lib

Canonical directory structure

my_kernel/ ├── CMakeLists.txt ├── pyproject.toml ├── build.bat ← all-in-one build script ├── run_build.bat ← Claude Code launcher (sets conda env, calls build.bat) ├── lgrf_uni/ │ ├── esimd_kernel_api.h ← dllexport / dllimport macro │ ├── kernels.cpp ← extern "C" dispatchers, sycl::queue interop │ └── single_kernels/ ← header-only kernel implementations │ └── my_kernel.h ├── csrc/ │ ├── entry.cpp ← PYBIND11_MODULE registrations │ ├── my_kernel_wrapper.cpp ← thin C++ wrapper (Tensor checks → DLL call) │ └── utils.h ← get_queue() helper ├── python/my_package/ │ ├── __init__.py ← DLL preload + _ext import │ └── version.py └── test/ └── test_my_kernel.py

SYCL/ESIMD Kernel → Python Wheel (Windows, Intel oneAPI)

Pipeline overview

Canonical directory structure

SYCL/ESIMD Kernel → Python Wheel (Windows, Intel oneAPI)

Pipeline overview

Canonical directory structure

Step 1 — Write the ESIMD kernel (DLL layer)

`lgrf_uni/esimd_kernel_api.h`

`lgrf_uni/kernels.cpp`

Build DLL command

Step 2 — Write the PyTorch extension bridge (csrc layer)

`csrc/utils.h` — borrow PyTorch's XPU queue

`csrc/my_kernel_wrapper.cpp`

Pytorch Patterns

Regex Vs Llm Structured Text

Effect

Flags

WPF to WinUI 3 Migration Skill

At Dispatch V2

SYCL/ESIMD Kernel → Python Wheel (Windows, Intel oneAPI)

Pipeline overview

Canonical directory structure

SYCL/ESIMD Kernel → Python Wheel (Windows, Intel oneAPI)

Pipeline overview

Canonical directory structure

Step 1 — Write the ESIMD kernel (DLL layer)

lgrf_uni/esimd_kernel_api.h

lgrf_uni/kernels.cpp

Build DLL command

Step 2 — Write the PyTorch extension bridge (csrc layer)

csrc/utils.h — borrow PyTorch's XPU queue

csrc/my_kernel_wrapper.cpp

Pytorch Patterns

Regex Vs Llm Structured Text

Effect

Flags

WPF to WinUI 3 Migration Skill

At Dispatch V2

`lgrf_uni/esimd_kernel_api.h`

`lgrf_uni/kernels.cpp`

`csrc/utils.h` — borrow PyTorch's XPU queue

`csrc/my_kernel_wrapper.cpp`