Skip to content

Bump llama.cpp to 74ade52 (b9672), release v0.8.26#58

Merged
nyo16 merged 1 commit into
masterfrom
bump-llama-cpp-b9672
Jun 16, 2026
Merged

Bump llama.cpp to 74ade52 (b9672), release v0.8.26#58
nyo16 merged 1 commit into
masterfrom
bump-llama-cpp-b9672

Conversation

@nyo16

@nyo16 nyo16 commented Jun 16, 2026

Copy link
Copy Markdown
Owner

Summary

Updates the vendor/llama.cpp submodule from 597b6672e (b9621) to
74ade5274 (b9672, latest release tag) — 51 commits — and cuts v0.8.26.

No NIF changes were required. Every public header the binding compiles
against is unchanged across the range (zero diff): include/llama.h,
ggml/include/ggml.h, ggml/include/ggml-backend.h, common/chat.h,
common/speculative.h, common/json-schema-to-grammar.h,
common/sampling.h, and common/common.h. Even common_params, which
gained a field in v0.8.24, is untouched this time.

What's in the 51 commits

  • model/vocab — Cohere2-MoE arch support + vocab + dedicated chat parser (#24260, #24601, #24615)
  • speculative — EAGLE3 backend sampling (#24655), acceptance-rate spec metrics (#24536)
  • common/chat — LFM2 tool-call double-escape fix (#24667), hardened peg parsing (#24329), grammar-generator + whitespace fixes (#24653, #24624)
  • jinja (chat-template engine) — filter aliases, negative-step slices, empty-arg split/replace fixes (#24606, #24580, #24574)
  • mtmdn_tokens miscount fix (#24656), post-decode callback (#24645)
  • backends — CUDA GGML_OP_REPEAT F32/F16 (#24533); Metal bf16 repeat (#24638); many Vulkan, SYCL, and WebGPU improvements
  • webui / vendor / ci / docs — BoringSSL 0.20260616.0 (#24693) and assorted UI / build fixes

Versioning note

v0.8.25 was tagged & released (Hub HTTP/HTTPS proxy support, #57) without a
version/CHANGELOG bump, so @version was still 0.8.24 and the changelog had
a gap. This PR therefore:

  • advances @version 0.8.24 → 0.8.26 (the v0.8.25 tag is already published), and
  • backfills the missing ## v0.8.25 CHANGELOG entry so the history is contiguous.

Verification

  • mix test — 158 passed, 4 skipped (against a force-rebuilt NIF, dev + test envs)
  • mix test --include smoke7/7 pass end-to-end on real GGUF models:
    generation, streaming, chat templates, JSON-schema grammar, raw GBNF, embeddings
  • mix format --check-formatted — clean
  • mix dialyzer — 0 errors

checksum.exs is intentionally untouched — CI regenerates it after the release tag is pushed.

Update the vendor/llama.cpp submodule from 597b6672e (b9621) to 74ade5274
(b9672), 51 commits. No NIF changes required: every header the binding
compiles against (include/llama.h, ggml.h, ggml-backend.h, common/chat.h,
speculative.h, json-schema-to-grammar.h, sampling.h, common.h) is unchanged.

Verified end-to-end against the freshly built NIF: 158 tests + 7 smoke
tests pass (generation, streaming, chat templates, JSON-schema grammar,
raw GBNF, embeddings), formatting clean, Dialyzer 0 errors.

Also backfill the missing v0.8.25 CHANGELOG entry (Hub HTTP/HTTPS proxy
support, #57), which shipped as a tag without a changelog/version bump,
and advance @Version 0.8.24 -> 0.8.26 accordingly.
@nyo16 nyo16 merged commit b772813 into master Jun 16, 2026
4 checks passed
@nyo16 nyo16 deleted the bump-llama-cpp-b9672 branch June 16, 2026 23:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant