-
Notifications
You must be signed in to change notification settings - Fork 12.6k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
CANN: Add fused FFN op
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
CANN: Add broadcast for softmax and FA
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
introduce how to build with Vulkan on Raspbian OS
documentation
Improvements or additions to documentation
#15206
opened Aug 10, 2025 by
MaoJianwei
Loading…
webui: prettify styling
examples
server
#15201
opened Aug 9, 2025 by
olegshulyakov
Loading…
11 tasks done
server: implementation of v1/completions echo logprobs support
examples
server
#15189
opened Aug 9, 2025 by
fo40225
Loading…
ggml-rpc: chunk send()/recv() to avoid EINVAL for very large tensors over RPC (macOS & others)
ggml
changes relating to the ggml tensor library for machine learning
#15188
opened Aug 9, 2025 by
Tak-RS
Loading…
ggml: add changes relating to the ggml tensor library for machine learning
testing
Everything test related
conv3d
op
ggml
#15182
opened Aug 8, 2025 by
rmatif
Loading…
server : implement /api/version endpoint for ollama compatibility (#15167 )
examples
server
#15177
opened Aug 8, 2025 by
albert-polak
Loading…
GPT-OSS: parse commentary tool calls; handle glued 'json'; add unit tests (#15102)
testing
Everything test related
#15158
opened Aug 7, 2025 by
Nerexis
Loading…
sycl: Fix and disable more configurations of mul_mat
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#15151
opened Aug 7, 2025 by
Rbiessy
Loading…
kleidiai: fix unsigned overflow bug
ggml
changes relating to the ggml tensor library for machine learning
#15150
opened Aug 7, 2025 by
chaxu01
Loading…
chat : Avoid partial reasoning tags in response content
testing
Everything test related
#15149
opened Aug 7, 2025 by
p1-0tr
Loading…
SVE support for exponential functions
ggml
changes relating to the ggml tensor library for machine learning
#15145
opened Aug 7, 2025 by
s-goto-11
Loading…
OpenCL: allow mixed f16/f32 add
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#15140
opened Aug 6, 2025 by
rmatif
Loading…
CUDA: Optimize changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
reduce_rows_f32
kernel, leading up to 25x perf improvement on kernel-level and 10% perf increase for Gemma3n
ggml
#15132
opened Aug 6, 2025 by
ORippler
Loading…
Add T5Gemma support #14940
python
python script changes
#15123
opened Aug 6, 2025 by
baonudesifeizhai
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.