Skip to content

Enable Nvidia's ModelOpt fp8 quantized models #3573

Enable Nvidia's ModelOpt fp8 quantized models

Enable Nvidia's ModelOpt fp8 quantized models #3573

Re-run triggered January 2, 2025 02:20
Status Failure
Total duration 25m 14s
Artifacts

pr-test.yml

on: pull_request
Matrix: unit-test-backend-1-gpu
unit-test-frontend
3m 40s
unit-test-frontend
unit-test-backend-2-gpu
12m 6s
unit-test-backend-2-gpu
performance-test-1-gpu-part-1
2m 11s
performance-test-1-gpu-part-1
performance-test-1-gpu-part-2
14m 17s
performance-test-1-gpu-part-2
performance-test-2-gpu
10m 54s
performance-test-2-gpu
accuracy-test-1-gpu
11m 52s
accuracy-test-1-gpu
accuracy-test-2-gpu
6m 53s
accuracy-test-2-gpu
Fit to window
Zoom out
Zoom in

Annotations

13 errors and 1 warning
performance-test-1-gpu-part-1
Process completed with exit code 1.
unit-test-frontend
Process completed with exit code 1.
unit-test-backend-1-gpu (0-6)
Process completed with exit code 1.
unit-test-backend-1-gpu (30-100)
The job was canceled because "_0-6" failed.
unit-test-backend-1-gpu (30-100)
The operation was canceled.
unit-test-backend-1-gpu (16-23)
The job was canceled because "_0-6" failed.
unit-test-backend-1-gpu (16-23)
The operation was canceled.
unit-test-backend-1-gpu (23-30)
The job was canceled because "_0-6" failed.
unit-test-backend-1-gpu (23-30)
The operation was canceled.
unit-test-backend-2-gpu
The action 'Evaluate data parallelism accuracy (DP=2)' has timed out after 10 minutes.
unit-test-backend-1-gpu (6-16)
The job was canceled because "_0-6" failed.
accuracy-test-1-gpu
Process completed with exit code 1.
performance-test-1-gpu-part-2
The action 'Benchmark offline throughput (w/o RadixAttention)' has timed out after 10 minutes.
unit-test-backend-1-gpu (6-16)
Runner ci3-gpu-0 did not respond to a cancelation request with 00:05:00.