You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
test_unet.py will run into a regression failure when exercising it locally on int8_fp16 mixed precision tests if not careful about it. The problem here is two-fold:
compiler team needs to figure out and fix the regression without spec file
usability of the script: assume a user of the script just want to reproduce the github action result, make it take the default spec location instead of relying on an environment variable. I can assign you to this subtask. CC @geomin12
Steps to reproduce your issue
To reproduce the failure
Use a MI300 machine, build iree and make sure iree-compile and iree-run-module are accessible from PATH
Run standalone pytest without fully mimicking the CI steps, basically:
cd iree
pytest ./experimental/regression_suite/shark-test-suite-models/sdxl/test_unet.py -k int8_fp16_rocm -rpfE --log-cli-level=info --timeout=600 --durations=0
Observe the failure and corresponding log
input = None, capture_output = True, timeout = None, check = True
popenargs = (['iree-run-module', '--device=hip', '--module=/root/iree/sdxl_punet_int8_fp16_vmfbs/punet_fp16.rocm_gfx942.vmfb', '--...fp16/punet_weights.irpa', '--input=1x4x128x128xf16=@/root/iree/artifacts/sdxl_punet_int8/inference_input.0.bin', ...],) kwargs = {'cwd': PosixPath('/root/iree/sdxl_punet_int8_fp16_vmfbs'), 'stderr': -1, 'stdout': -1}, process = <Popen: returncode: 1 args: ['iree-run-module', '--device=hip', '--module=/r...>
stdout = b'EXEC @main\n[FAILED] result[0]: element at index 0 (0.0032959) does not match the expected (0.0914307); expected tha......][...][...][...][...][...][...][...][...][...][...][...][...][...][...][...][...][...][...][...][...][...][...]]]\n' stderr = b'', retcode = 1
What happened?
test_unet.py
will run into a regression failure when exercising it locally on int8_fp16 mixed precision tests if not careful about it. The problem here is two-fold:Steps to reproduce your issue
To reproduce the failure
cd iree pytest ./experimental/regression_suite/shark-test-suite-models/sdxl/test_unet.py -k int8_fp16_rocm -rpfE --log-cli-level=info --timeout=600 --durations=0
To fix it
Comment out below lines
iree/experimental/regression_suite/shark-test-suite-models/sdxl/test_unet.py
Lines 294 to 304 in 8fab35c
Replace it with the right spec location, i.e, in my docker container, it is:
Then run pytest again, observe the unit tests pass with matching values.
What component(s) does this issue relate to?
No response
Version information
latest: 5767be3
Additional context
No response
The text was updated successfully, but these errors were encountered: