Skip to content

Update dependency ggml-org/llama.cpp to v9553 - autoclosed#220

Closed
renovate[bot] wants to merge 1 commit into
mainfrom
renovate/ggml-org-llama.cpp-9553.x
Closed

Update dependency ggml-org/llama.cpp to v9553 - autoclosed#220
renovate[bot] wants to merge 1 commit into
mainfrom
renovate/ggml-org-llama.cpp-9553.x

Conversation

@renovate

@renovate renovate Bot commented Jun 8, 2026

Copy link
Copy Markdown
Contributor

ℹ️ Note

This PR body was truncated due to platform limits.

This PR contains the following updates:

Package Update Change
ggml-org/llama.cpp major b9066b9553

Release Notes

ggml-org/llama.cpp (ggml-org/llama.cpp)

vb9553

Compare Source

Details

common : relax sampler name matching (#​23744)

  • common : relax sampler name matching

Currently, in some cases, the alternative names for samplers (like
top-k and min-p instead of the canonical top_k and min_p) are
not always recognized by the common_sampler_types_from_names function
in common/sampling.cpp.

This PR changes the signature of this function to remove the bool allow_alt_names flag, and removes all occurences of the flag from call
sites. Therefore, the function will now always match all known names.

I also changed the logic of the function to unconditionally check the
provided sampler names against both the canonical and alternative names,
and to be case-insensitive.

This fixes an issue I was seeing wherein samplers specified in the
llama-server UI were not recognized as valid when the alternative
names were used.

  • add more alt names

  • cont. fix

  • cast to unsigned char for correctness

  • common : unify sampler name mapping

  • annotate canonical vs. alt sampler name mappings per @​CISC

  • Update common/sampling.cpp

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • common : auto-generate sampler name aliases per @​ngxson

  • use merged map for matching

  • use .merge instead of iterating

  • nit: simplify comment

  • nit: use insert everywhere, not index assignment


Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9551

Compare Source

Details

kv-cache : avoid kv cells copies (#​24277)

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9550

Compare Source

Details

kv-cache: follow the source cache size when sharing cells (#​24267)

A fitted target context can end up smaller than the draft default, the
oversized assistant views then overflow the shared K/V tensors and trip
the ggml_view_4d size assert during graph reserve.

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9549

Compare Source

Details

llama : add Gemma4 MTP (#​23398)

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9548

Compare Source

Details

spec : fix vocab compatibility check (#​24256)

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9547

Compare Source

Details

arg: Skip mmproj download when user supplied mmproj (#​24239)

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9544

Compare Source

Details

common/chat : fix LFM2/LFM2.5 reasoning round-trip and leak (#​24234)

  • common/chat : fix LFM2 reasoning round-trip and stray leak
  • Gate by reasoning format and whether the template supports

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9543

Compare Source

Details

mtmd: support "frame merge" for qwen-vl-based models (#​21858)

  • feat: add video support for Qwen3.5

  • various clean up

  • revise the design

  • fix llava-uhd case

  • nits

  • nits 2


Co-authored-by: andrewmd5 <1297077+andrewmd5@​users.noreply.github.com>

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9542

Compare Source

Details

completion : remove useless statics (#​24226)

Signed-off-by: Adrien Gallouët angt@huggingface.co

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9541

Compare Source

Details

completion : fix format specifier in LOG_INF (#​24213)

Signed-off-by: Adrien Gallouët angt@huggingface.co

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9538

Compare Source

Details

model : rename local n_layer_all variable (#​24209)

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9537

Compare Source

Details

context : fix off-by-one comparisons to n_gpu_layers (#​24208)

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9536

Compare Source

Details

opencl: improve get_rows, cpy, concat and q6_k flat gemv (#​24160)

  • opencl: allow multiple workgroups for large rows

  • opencl: improve small cpy

  • opencl: packed concat for small input

  • opencl: tweak flat q6_K gemv, increase N_DST and remap threads

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9535

Compare Source

Details

common/chat : unify and fix LFM2/LFM2.5 tool parser (#​24178)

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9534

Compare Source

Details

vulkan: add fwht support for Intel with shmem reduction (#​23964)

  • vulkan: add fwht support for Intel with shmem reduction

  • don't use N as workgroup size

  • disable subgroup shuffle on MoltenVK AMD

  • disable fwht shader on Intel Windows due to driver bug

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

vb9533

Compare Source

Details

model: fix build failed (#​24193)

macOS/iOS:

Linux:

Note

PR body was truncated to here.


Configuration

📅 Schedule: (UTC)

  • Branch creation
    • At any time (no schedule defined)
  • Automerge
    • At any time (no schedule defined)

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.


  • If you want to rebase/retry this PR, check this box

This PR was generated by Mend Renovate. View the repository job log.

@renovate renovate Bot changed the title Update dependency ggml-org/llama.cpp to v9553 Update dependency ggml-org/llama.cpp to v9553 - autoclosed Jun 8, 2026
@renovate renovate Bot closed this Jun 8, 2026
@renovate renovate Bot deleted the renovate/ggml-org-llama.cpp-9553.x branch June 8, 2026 12:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants