Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Block size = 32 assertion fails #110

Open
rukshankr opened this issue Jun 17, 2024 · 0 comments
Open

Block size = 32 assertion fails #110

rukshankr opened this issue Jun 17, 2024 · 0 comments

Comments

@rukshankr
Copy link

rukshankr commented Jun 17, 2024

I have tried with both LLaMA and VILA models.

Both give this when ./chat is run:
../kernels/avx/matmul_avx_int4.cc:701: void matmul::MatmulOperator::mat_mul_accelerator_int4_fast_no_offset(const matmul_params*): Assertion params->block_size == 32' failed. Aborted (core dumped)

When I print the block_size parameter that comes to the above functions it says 128.
does anyone know why this happens? How can I define block size as 32?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant