You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First steps to enable SYCL backend in Python Interface (#155)
First implementation steps towards supporting the SYCL backend in the
CUTLASS Python Interface.
The main additions from this PR are:
* Generating a suitable GEMM template and arguments for the CUTLASS 3.x
API and Intel PVC as target.
* Calling DPC++ instead of `nvcc` to compile device and host code.
* Using the DPCTL library to transfer data and launch the kernel via
SYCL.
The support so far focuses on a simple GEMM, epilogues (e.g, with
visitor) are not yet supported.
Compilation is currently only possible with development versions of
DPC++, the `-fsycl-rtc-mode` flag that was added to support CUTLASS
nested parameter classes in free-function kernels as part of this work
is not yet available in releases.
The activation of the SYCL backend via environment variable is a
temporary solution, a follow-up will look into a cleaner solution.
---------
Signed-off-by: Lukas Sommer <[email protected]>
Co-authored-by: Alejandro Acosta <[email protected]>
0 commit comments