Enable Nvidia's ModelOpt fp8 quantized models #3573
pr-test.yml
on: pull_request
Matrix: unit-test-backend-1-gpu
unit-test-frontend
3m 40s
unit-test-backend-2-gpu
12m 6s
performance-test-1-gpu-part-1
2m 11s
performance-test-1-gpu-part-2
14m 17s
performance-test-2-gpu
10m 54s
accuracy-test-1-gpu
11m 52s
accuracy-test-2-gpu
6m 53s
finish
0s
Annotations
13 errors and 1 warning
performance-test-1-gpu-part-1
Process completed with exit code 1.
|
unit-test-frontend
Process completed with exit code 1.
|
unit-test-backend-1-gpu (0-6)
Process completed with exit code 1.
|
unit-test-backend-1-gpu (30-100)
The job was canceled because "_0-6" failed.
|
unit-test-backend-1-gpu (30-100)
The operation was canceled.
|
unit-test-backend-1-gpu (16-23)
The job was canceled because "_0-6" failed.
|
unit-test-backend-1-gpu (16-23)
The operation was canceled.
|
unit-test-backend-1-gpu (23-30)
The job was canceled because "_0-6" failed.
|
unit-test-backend-1-gpu (23-30)
The operation was canceled.
|
unit-test-backend-2-gpu
The action 'Evaluate data parallelism accuracy (DP=2)' has timed out after 10 minutes.
|
unit-test-backend-1-gpu (6-16)
The job was canceled because "_0-6" failed.
|
accuracy-test-1-gpu
Process completed with exit code 1.
|
performance-test-1-gpu-part-2
The action 'Benchmark offline throughput (w/o RadixAttention)' has timed out after 10 minutes.
|
unit-test-backend-1-gpu (6-16)
Runner ci3-gpu-0 did not respond to a cancelation request with 00:05:00.
|