-
Notifications
You must be signed in to change notification settings - Fork 12k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
context : fix pos_min initialization upon decode error
#14008
opened Jun 4, 2025 by
ggerganov
Loading…
Fix CUDA build failure on AutoDL cloud platforms
devops
improvements to build systems and github actions
#14005
opened Jun 4, 2025 by
pockers21
Loading…
opencl: preliminary support for Q4_0 mul_mat_id using matvec
ggml
changes relating to the ggml tensor library for machine learning
[CANN]:Replace aclrtMemsetSync with InplaceZero operator for zero tensor creation
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#14002
opened Jun 4, 2025 by
luyhcsu
Loading…
vulkan: Enable VK_KHR_cooperative_matrix extension for Intel Xe2 GPUs
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14001
opened Jun 4, 2025 by
rillomas
Loading…
kv-cache : refactor the update/defrag mechanism
#13988
opened Jun 3, 2025 by
ggerganov
Loading…
1 task done
chore(server): split context-server to its own file
examples
server
#13987
opened Jun 3, 2025 by
mudler
Loading…
llama : allow building all tests on windows when not using shared libs
devops
improvements to build systems and github actions
testing
Everything test related
#13980
opened Jun 2, 2025 by
slaren
Loading…
sycl: GGML_SYCL_DISABLE_OPT on by default for all Intel Devices
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13973
opened Jun 2, 2025 by
ShanoToni
Loading…
ci: add LoongArch cross-compile build
devops
improvements to build systems and github actions
#13944
opened May 31, 2025 by
wojiushixiaobai
Loading…
llama : support multiple classifier outputs and labels
examples
#13940
opened May 31, 2025 by
CISC
Loading…
vulkan: automatically deduce size of push constants
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#13936
opened May 31, 2025 by
jeffbolznv
Loading…
chat
: improve llama 3.x handling of <|python_tag|> (+ allow --special combo)
testing
[CANN]Support Acl Graph
ggml
changes relating to the ggml tensor library for machine learning
#13915
opened May 30, 2025 by
noemotiovon
•
Draft
[Ascend NPU] Enable labeler
devops
improvements to build systems and github actions
#13914
opened May 30, 2025 by
shink
Loading…
remove WIP since PR has been merged
documentation
Improvements or additions to documentation
#13912
opened May 30, 2025 by
pepijndevos
Loading…
convert: add eagle2 draft arch
python
python script changes
#13908
opened May 30, 2025 by
pockers21
Loading…
ci(intel): venv for python & pip installation for intel docker
devops
improvements to build systems and github actions
#13898
opened May 29, 2025 by
Thammachart
Loading…
ggml-cpu : split arch-specific implementations
ggml
changes relating to the ggml tensor library for machine learning
#13892
opened May 29, 2025 by
xctan
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.