-
Notifications
You must be signed in to change notification settings - Fork 12.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
compare-commits.sh: support both llama-bench and test-backend-ops
python
python script changes
script
Script related
#14392
opened Jun 26, 2025 by
yeahdongcn
Loading…
llama : return mistral-v7-tekken as default template only
#14390
opened Jun 26, 2025 by
CISC
Loading…
Add conv2d cpu2
ggml
changes relating to the ggml tensor library for machine learning
#14388
opened Jun 26, 2025 by
am17an
Loading…
metal : add special-case mat-vec mul for ne00 == 4
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#14385
opened Jun 26, 2025 by
ggerganov
Loading…
metal : batch rows copy in a single threadgroup
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#14384
opened Jun 26, 2025 by
ggerganov
Loading…
ggml-cpu: Build variant targeting Neoverse-V2
ggml
changes relating to the ggml tensor library for machine learning
#14380
opened Jun 25, 2025 by
ckastner
Loading…
vulkan: handle noncontig in the final case of ggml_vk_get_cpy_pipeline
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14378
opened Jun 25, 2025 by
jeffbolznv
Loading…
webui: preserve partial content when streaming errors occur
examples
server
#14374
opened Jun 25, 2025 by
Aaryan-549
Loading…
5 of 8 tasks
Q2k interleaving implementation - x86/x64 SIMD
ggml
changes relating to the ggml tensor library for machine learning
#14373
opened Jun 25, 2025 by
Srihari-mcw
Loading…
test-backend-ops: add support for specifying output format
testing
Everything test related
#14368
opened Jun 25, 2025 by
yeahdongcn
Loading…
vulkan: Add fusion support for RMS_NORM+MUL
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#14366
opened Jun 24, 2025 by
jeffbolznv
•
Draft
ggml : add pointer to attach user data
ggml
changes relating to the ggml tensor library for machine learning
#14365
opened Jun 24, 2025 by
koush
Loading…
llama : add high-throughput mode
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
CUDA: add bf16 and f32 support to cublas_mul_mat_batched
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#14361
opened Jun 24, 2025 by
am17an
Loading…
build: refine toplevel .gitignore
script
Script related
#14355
opened Jun 24, 2025 by
zhouwg
Loading…
1 task done
Add script to test op perf and compare
python
python script changes
script
Script related
#14354
opened Jun 24, 2025 by
yeahdongcn
Loading…
vulkan: Increase workgroup size for GLU, for performance
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14345
opened Jun 23, 2025 by
jeffbolznv
Loading…
Make the shell scripts cross-platform
devops
improvements to build systems and github actions
examples
script
Script related
server
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#14341
opened Jun 23, 2025 by
vedranmiletic
Loading…
vulkan: lock accesses of pinned_memory vector
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14333
opened Jun 22, 2025 by
jeffbolznv
Loading…
Fix appearance of the chats list context menu for the browser Safari
examples
server
#14322
opened Jun 22, 2025 by
rntk
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.