-
Notifications
You must be signed in to change notification settings - Fork 61
Open
Description
🚀 The feature, motivation and pitch
torch.linalg.cholesky is used by llm-compressor gptq. Now, it will show below log on xpu:
/home/sdp/jiqing/llm-compressor/src/llmcompressor/modifiers/quantization/gptq/gptq_quantize.py:160: UserWarning: Aten Op fallback from XPU
to CPU happends. This may have performance implications. If need debug the fallback ops please set environment variable `PYTORCH_DEBUG_XPU_
FALLBACK=1` (Triggered internally at /pytorch/build/xpu/ATen/RegisterXPU_0.cpp:54523.)
H = torch.linalg.cholesky(H)
Alternatives
No response
Additional context
No response