Releases: gpustack/gguf-parser-go
Releases · gpustack/gguf-parser-go
v0.13.16
fix: offload clipper to vram0 Signed-off-by: thxCode <[email protected]>
v0.13.15
refactor: estimate partial offloading Signed-off-by: thxCode <[email protected]>
v0.13.14
refactor: estimate moe Signed-off-by: thxCode <[email protected]>
v0.13.13
fix: wrong output layer offload at zero ts input Signed-off-by: thxCode <[email protected]>
v0.13.12
refactor: estimate Signed-off-by: thxCode <[email protected]>
v0.13.11
docs: readme Signed-off-by: thxCode <[email protected]>
v0.13.10
fix: calculation error of projector zero offloading Signed-off-by: thxCode <[email protected]>
v0.13.9
feat: support offloading sd to multi devs Signed-off-by: thxCode <[email protected]>
v0.13.8
refactor: support qk_m distributable Signed-off-by: thxCode <[email protected]>
v0.13.7
refactor: estimate Signed-off-by: thxCode <[email protected]>