v1.13.0: 4-bit quantization, stateful models, Whisper

echarlaix released this 25 Jan 16:48

· 413 commits to main since this release

OpenVINO

Weight only 4-bit quantization

Add weight only 4-bit quantization support by @AlexKoff88 in #469

optimum-cli export openvino --model gpt2 --weight-format int4_sym_g128 ov_model

Stateful

Add support for stateful models by @eaidova in #493

New architectures

Whisper

Add support for export and inference for whisper models by @eaidova in #470

Contributors

AlexKoff88 and eaidova

Assets 2