Bump llama.cpp to 74ade52 (b9672), release v0.8.26 by nyo16 · Pull Request #58 · nyo16/llama_cpp_ex

nyo16 · 2026-06-16T23:40:57Z

Summary

Updates the vendor/llama.cpp submodule from 597b6672e (b9621) to
74ade5274 (b9672, latest release tag) — 51 commits — and cuts v0.8.26.

No NIF changes were required. Every public header the binding compiles
against is unchanged across the range (zero diff): include/llama.h,
ggml/include/ggml.h, ggml/include/ggml-backend.h, common/chat.h,
common/speculative.h, common/json-schema-to-grammar.h,
common/sampling.h, and common/common.h. Even common_params, which
gained a field in v0.8.24, is untouched this time.

What's in the 51 commits

model/vocab — Cohere2-MoE arch support + vocab + dedicated chat parser (#24260, #24601, #24615)
speculative — EAGLE3 backend sampling (#24655), acceptance-rate spec metrics (#24536)
common/chat — LFM2 tool-call double-escape fix (#24667), hardened peg parsing (#24329), grammar-generator + whitespace fixes (#24653, #24624)
jinja (chat-template engine) — filter aliases, negative-step slices, empty-arg split/replace fixes (#24606, #24580, #24574)
mtmd — n_tokens miscount fix (#24656), post-decode callback (#24645)
backends — CUDA GGML_OP_REPEAT F32/F16 (#24533); Metal bf16 repeat (#24638); many Vulkan, SYCL, and WebGPU improvements
webui / vendor / ci / docs — BoringSSL 0.20260616.0 (#24693) and assorted UI / build fixes

Versioning note

v0.8.25 was tagged & released (Hub HTTP/HTTPS proxy support, #57) without a
version/CHANGELOG bump, so @version was still 0.8.24 and the changelog had
a gap. This PR therefore:

advances @version 0.8.24 → 0.8.26 (the v0.8.25 tag is already published), and
backfills the missing ## v0.8.25 CHANGELOG entry so the history is contiguous.

Verification

mix test — 158 passed, 4 skipped (against a force-rebuilt NIF, dev + test envs)
mix test --include smoke — 7/7 pass end-to-end on real GGUF models:
generation, streaming, chat templates, JSON-schema grammar, raw GBNF, embeddings
mix format --check-formatted — clean
mix dialyzer — 0 errors

checksum.exs is intentionally untouched — CI regenerates it after the release tag is pushed.

@Version

Update the vendor/llama.cpp submodule from 597b6672e (b9621) to 74ade5274 (b9672), 51 commits. No NIF changes required: every header the binding compiles against (include/llama.h, ggml.h, ggml-backend.h, common/chat.h, speculative.h, json-schema-to-grammar.h, sampling.h, common.h) is unchanged. Verified end-to-end against the freshly built NIF: 158 tests + 7 smoke tests pass (generation, streaming, chat templates, JSON-schema grammar, raw GBNF, embeddings), formatting clean, Dialyzer 0 errors. Also backfill the missing v0.8.25 CHANGELOG entry (Hub HTTP/HTTPS proxy support, #57), which shipped as a tag without a changelog/version bump, and advance @Version 0.8.24 -> 0.8.26 accordingly.

nyo16 merged commit b772813 into master Jun 16, 2026
4 checks passed

nyo16 deleted the bump-llama-cpp-b9672 branch June 16, 2026 23:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump llama.cpp to 74ade52 (b9672), release v0.8.26#58

Bump llama.cpp to 74ade52 (b9672), release v0.8.26#58
nyo16 merged 1 commit into
masterfrom
bump-llama-cpp-b9672

nyo16 commented Jun 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

nyo16 commented Jun 16, 2026

Summary

What's in the 51 commits

Versioning note

Verification

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant