YOLO V8 to ONNX Quantized model conversion compatibility #80

dadanugm · 2024-05-04T02:28:25Z

Hi everyone.
Would like to discuss about AMD-Ryzen AI IPU quantize model for object detection. Need input from AMD on how to provide suitable trained model for IPU, so I can utilize the IPU to do the inference.

I have pre-trained model YOLOV8, converted to ONNX model using:
model.export(format="onnx") # export the model to ONNX format

and then I added pre-processing for the model.
add_ppp.yolo_detection(input_model_file, output_model_file, "jpg", onnx_opset=18)

and running the ONNX inference:
session = ort.InferenceSession(str(onnx_model_file), providers=providers, sess_options=session_options)

But the result of ONNX inference from converted model is really poor if I compare with YOLOV8 Inference result. Even after ONNX quantize, the detection result was getting worse.

onnx 1.16.1
onnxruntime 1.17.3
Ultralytics 8.2.1

Would like to know the recommendations how to train model or convert the model to ONNX so it is highly compatible with IPU.

Thanks.

result yolov8

result ONNX conversion

uday610 · 2024-05-04T16:39:56Z

Instead of ONNX PTQ, you can try QAT (quantization-aware training) using PyTorch flow. There is a tutorial at https://github.com/amd/RyzenAI-SW/tree/main/tutorial/yolov8_e2e that has a QAT flow.

dadanugm · 2024-05-05T03:30:01Z

From the code snippet, looks like I can train my custom model using my own dataset to produce my object detection model, using below command?

yolo detect train data="datasets/coco.yaml" model=${WEIGHTS} pretrained=True sync_bn=True
epochs=${EPOCH} batch=${BATCH} optimizer="AdamW" device=${GPU_ID} lr0=0.0001 nndct_quant=True --nndct_convert_sigmoid_to_hsigmoid --nndct_convert_silu_to_hswish

Seems that it trains the model using local machine, since I dont have GPU, would that be able also run on Colab?

Thanks

dadanugm · 2024-05-06T09:22:45Z

Any idea which package that I need to install to solve SyntaxError: 'nndct_convert_sigmoid_to_hsigmoid' is not a valid YOLO argument?
I ran that Yolo detect command, and it throws this error. From screen capture, I ran this on AMD AI software installed on Conda venv. I already update my venv using env_setup.sh but still caught this error.

Thanks

uday610 · 2024-05-07T02:31:57Z

Hi @dadanugm , the instructions in the tutorial ask for using docker. Are you running inside the docker?

dadanugm · 2024-05-08T01:16:40Z

Hi @uday610 . Thanks for the pointer!. I am able to resolve the syntax error by running it on Docker.

But got another Error after resolving that one :)). Seems that the code requires to run on CUDA.

I search the solutions, some of them suggest to reinstall torch with torch+cpu only, and add parameter device=cpu. but the assertion error keeps coming back.

Also when I run run_qat.sh, another error pops up.

Looks like the container missing a file?
Would like to know if you got this similiar error from your end?
Do you have input to solve these issues?

Thanks

fanz-xlnx · 2024-05-09T02:02:48Z

Hi @uday610 . Thanks for the pointer!. I am able to resolve the syntax error by running it on Docker.

But got another Error after resolving that one :)). Seems that the code requires to run on CUDA. I search the solutions, some of them suggest to reinstall torch with torch+cpu only, and add parameter device=cpu. but the assertion error keeps coming back.

Also when I run run_qat.sh, another error pops up. Looks like the container missing a file? Would like to know if you got this similiar error from your end? Do you have input to solve these issues?

Thanks

Please run the ptq first, then you should be able to get the json file. You can use the quantized model to run the QAT afterwards.

dadanugm · 2024-05-09T05:30:00Z

Hi @fanz-xlnx . Thanks for the input.

Any idea to bypass this raise AssertionError("Torch not compiled with CUDA enabled") AssertionError: Torch not compiled with CUDA enabled ?. This gated me from running the run_ptq. in this case, I dont have CUDA/Nvidia GPU.

My engine is AMD Ryzen 9 7940HS w/ Radeon 780M Graphics, Would be happy if I can utilize this to replace the CUDA for training.

Thanks

dadanugm · 2024-05-22T02:30:36Z

Hi @uday610 @fanz-xlnx

I got the GPU (NVIDIA GTX 1070 with CUDA 12.0) to test out the docker (https://hub.docker.com/r/amdih/ryzen-ai-pytorch).

but the AssertionError("Torch not compiled with CUDA enabled") AssertionError: Torch not compiled with CUDA enabled is still pops out.

I thought by installing Pytorch Cuda compatible will resolve the issue (pytorch 1.12.1+cu113) but it hits another errors

Would like to know what is the real issue here? even though I have the Docker run on GPU but still can't run the Ryzen-AI environment. Do I need specific GPU to run the env?

Thanks.

fanz-xlnx · 2024-05-22T02:37:20Z

Hi @uday610 @fanz-xlnx

I got the GPU (NVIDIA GTX 1070 with CUDA 12.0) to test out the docker (https://hub.docker.com/r/amdih/ryzen-ai-pytorch).

but the AssertionError("Torch not compiled with CUDA enabled") AssertionError: Torch not compiled with CUDA enabled is still pops out.

I thought by installing Pytorch Cuda compatible will resolve the issue (pytorch 1.12.1+cu113) but it hits another errors

Would like to know what is the real issue here? even though I have the Docker run on GPU but still can't run the Ryzen-AI environment. Do I need specific GPU to run the env?

Thanks.

Thanks for the updated info.
You have downloaded the dockerfile and build the GPU docker yourself right? Any issues met during the build progress?

updated rag readme.md

savitha-srinivasan pushed a commit to savitha-srinivasan/RyzenAI-SW that referenced this issue Jul 29, 2024

Merge pull request amd#80 from pooja/dev

d078781

updated rag readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

YOLO V8 to ONNX Quantized model conversion compatibility #80

YOLO V8 to ONNX Quantized model conversion compatibility #80

dadanugm commented May 4, 2024

uday610 commented May 4, 2024

dadanugm commented May 5, 2024

dadanugm commented May 6, 2024 •

edited

Loading

uday610 commented May 7, 2024

dadanugm commented May 8, 2024 •

edited

Loading

fanz-xlnx commented May 9, 2024

dadanugm commented May 9, 2024 •

edited

Loading

dadanugm commented May 22, 2024 •

edited

Loading

fanz-xlnx commented May 22, 2024

YOLO V8 to ONNX Quantized model conversion compatibility #80

YOLO V8 to ONNX Quantized model conversion compatibility #80

Comments

dadanugm commented May 4, 2024

uday610 commented May 4, 2024

dadanugm commented May 5, 2024

dadanugm commented May 6, 2024 • edited Loading

uday610 commented May 7, 2024

dadanugm commented May 8, 2024 • edited Loading

fanz-xlnx commented May 9, 2024

dadanugm commented May 9, 2024 • edited Loading

dadanugm commented May 22, 2024 • edited Loading

fanz-xlnx commented May 22, 2024

dadanugm commented May 6, 2024 •

edited

Loading

dadanugm commented May 8, 2024 •

edited

Loading

dadanugm commented May 9, 2024 •

edited

Loading

dadanugm commented May 22, 2024 •

edited

Loading