Name		Name	Last commit message	Last commit date
parent directory ..
220-yolov5-accuracy-check-and-quantization.ipynb		220-yolov5-accuracy-check-and-quantization.ipynb
README.md		README.md

README.md

Quantize the Ultralytics YOLOv5 model and check accuracy using the OpenVINO POT API

This tutorial demonstrates step-by-step how to perform model quantization using the OpenVINO Post-Training Optimization Tool (POT), compare model accuracy between the FP32 precision and quantized INT8 precision models and run a demo of model inference based on sample code from Ultralytics Yolov5 with the OpenVINO backend.

Notebook Contents

The notebook uses Ultralytics Yolov5 to obtain the YOLOv5-m model in OpenVINO Intermediate Representation (IR) format. Then, the OpenVINO Post-Training Optimization Tool (POT) API is used to quantize the model based on Non-Max Suppression (NMS) processing provided by Ultralytics. To ensure minimal accuracy loss, the accuracy is compared between the FP32 model and the INT8 model quantized by POT using "DefaultQuantization" algorithm. Finally, the code sample detect.py from Ultralytics is used to perform inference the INT8 model and check performance using OpenVINO with sync API enabled.

Installation Instructions

If you have not installed all required dependencies, follow the Installation Guide.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

220-yolov5-accuracy-check-and-quantization

220-yolov5-accuracy-check-and-quantization

README.md

Quantize the Ultralytics YOLOv5 model and check accuracy using the OpenVINO POT API

Notebook Contents

Installation Instructions

Files

220-yolov5-accuracy-check-and-quantization

Directory actions

More options

Directory actions

More options

Latest commit

History

220-yolov5-accuracy-check-and-quantization

Folders and files

parent directory

README.md

Quantize the Ultralytics YOLOv5 model and check accuracy using the OpenVINO POT API

Notebook Contents

Installation Instructions