how to generate a kernel with cuda c code implementation with triton #461
Unanswered
TigerYang414
asked this question in
Q&A
Replies: 2 comments
-
Unfortunately this is not possible, but you can get the PTX -- though this would require more glue code to call it for inference |
Beta Was this translation helpful? Give feedback.
0 replies
-
Yes you can. Triton will generate cubin, then you can use CUmodule to load them in c file then executed in cuLaunchKernel. That means now you have the c source for the kernel. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
need to run with c++ code with online infer
Beta Was this translation helpful? Give feedback.
All reactions