Block size = 32 assertion fails #110

rukshankr · 2024-06-17T07:11:02Z

I have tried with both LLaMA and VILA models.

Both give this when ./chat is run:
../kernels/avx/matmul_avx_int4.cc:701: void matmul::MatmulOperator::mat_mul_accelerator_int4_fast_no_offset(const matmul_params*): Assertion params->block_size == 32' failed. Aborted (core dumped)

When I print the block_size parameter that comes to the above functions it says 128.
does anyone know why this happens? How can I define block size as 32?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Block size = 32 assertion fails #110

Block size = 32 assertion fails #110

rukshankr commented Jun 17, 2024 •

edited

Loading

Block size = 32 assertion fails #110

Block size = 32 assertion fails #110

Comments

rukshankr commented Jun 17, 2024 • edited Loading

rukshankr commented Jun 17, 2024 •

edited

Loading