fp16 and int8 support for vulkan backend #4785

vjayd · 2023-06-07T07:32:50Z

vjayd
Jun 7, 2023

Do we plan to make the quantized model int8 run on the Vulkan backend? Especially generative AI models are very slow and need extremely fast technique to run on Vulkan device

nihui · 2023-07-06T07:13:15Z

nihui
Jul 6, 2023
Maintainer

Yes, but will wait for upstream glslang to support int8 dot product extension
Without this extension, int8 would be very slow

1 reply

Mek101 Jul 14, 2023

You mean https://registry.khronos.org/vulkan/specs/1.3-extensions/man/html/VK_KHR_shader_integer_dot_product.html ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fp16 and int8 support for vulkan backend #4785

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

fp16 and int8 support for vulkan backend #4785

vjayd Jun 7, 2023

Replies: 1 comment · 1 reply

nihui Jul 6, 2023 Maintainer

Mek101 Jul 14, 2023

vjayd
Jun 7, 2023

Replies: 1 comment 1 reply

nihui
Jul 6, 2023
Maintainer