Skip to content

updated to quantize k_pe_bmm for MLA

fdbfefe
Select commit
Loading
Failed to load commit list.
Open

Support for KV cache quantization for MLA Attention vLLM fakequant #714

updated to quantize k_pe_bmm for MLA
fdbfefe
Select commit
Loading
Failed to load commit list.