Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

CANN: Add fused FFN op Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#15209 opened Aug 10, 2025 by hipudding Draft
CANN: Add broadcast for softmax and FA Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#15208 opened Aug 10, 2025 by hipudding Draft
introduce how to build with Vulkan on Raspbian OS documentation Improvements or additions to documentation
#15206 opened Aug 10, 2025 by MaoJianwei Loading…
webui: prettify styling examples server
#15201 opened Aug 9, 2025 by olegshulyakov Loading…
11 tasks done
fix: llama_memory_seq_rm(mem, -1, ...) examples
#15200 opened Aug 9, 2025 by leok7v Loading…
ggml-rpc: chunk send()/recv() to avoid EINVAL for very large tensors over RPC (macOS & others) ggml changes relating to the ggml tensor library for machine learning
#15188 opened Aug 9, 2025 by Tak-RS Loading…
common : add GLM-4.5 tool calling support
#15186 opened Aug 8, 2025 by dhandhalyabhavik Loading…
ggml: add conv3d op ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#15182 opened Aug 8, 2025 by rmatif Loading…
gpt-oss: implement harmony parsing examples server testing Everything test related
#15181 opened Aug 8, 2025 by aldehir Loading…
kv-cache : log (debug) all streams in find_slot
#15176 opened Aug 8, 2025 by danbev Loading…
server : enable -td and -tbd parameters examples server
#15172 opened Aug 8, 2025 by CISC Loading…
MoE Expert manipulation args
#15165 opened Aug 8, 2025 by kooshi Loading…
tool-call: Qwen3 Coder chat format support testing Everything test related
#15162 opened Aug 8, 2025 by ochafik Draft
Fix prompt cache
#15160 opened Aug 7, 2025 by 708-145 Loading…
sycl: Fix and disable more configurations of mul_mat ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#15151 opened Aug 7, 2025 by Rbiessy Loading…
kleidiai: fix unsigned overflow bug ggml changes relating to the ggml tensor library for machine learning
#15150 opened Aug 7, 2025 by chaxu01 Loading…
chat : Avoid partial reasoning tags in response content testing Everything test related
#15149 opened Aug 7, 2025 by p1-0tr Loading…
SVE support for exponential functions ggml changes relating to the ggml tensor library for machine learning
#15145 opened Aug 7, 2025 by s-goto-11 Loading…
OpenCL: allow mixed f16/f32 add ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#15140 opened Aug 6, 2025 by rmatif Loading…
CUDA: Optimize reduce_rows_f32 kernel, leading up to 25x perf improvement on kernel-level and 10% perf increase for Gemma3n ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#15132 opened Aug 6, 2025 by ORippler Loading…
Add T5Gemma support #14940 python python script changes
#15123 opened Aug 6, 2025 by baonudesifeizhai Loading…
ProTip! Exclude everything labeled bug with -label:bug.