Error with --half and onnx-low-memory #382
Labels
model/diffusion
provider/directml
status/planned
issues that have been planned but not started
type/bug
broken features
Milestone
with server on --half and onnx-low-memory disabled, the onnx runtime produces a matrix multiplication error during image generation
This error appears to be caused specifically by the enable_mem_reuse optimization for onnx-runtime. The other optimizations prevented by onnx-low-memory cause no errors. This error only occurs when onnxruntime-directml 1.15.0 is installed; error does not appear with onnxruntime-directml 1.14.1 installed.
Error:
The text was updated successfully, but these errors were encountered: