-
Notifications
You must be signed in to change notification settings - Fork 13.3k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
webui: remove client-side context pre-check and rely on backend for limits
examples
server
#16506
opened Oct 10, 2025 by
ServeurpersoCom
Loading…
fix: add remark plugin to render raw HTML as literal text
examples
server
#16505
opened Oct 10, 2025 by
ServeurpersoCom
Loading…
Switch to using Ubuntu 25.10 vulkan/mesa
devops
improvements to build systems and github actions
#16497
opened Oct 10, 2025 by
ericcurtin
Loading…
metal : fix mul-mm condition + fix mul-mv permuted kernels
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#16494
opened Oct 10, 2025 by
ggerganov
Loading…
CUDA: faster tile FA, add oob checks, more HSs
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
#16492
opened Oct 9, 2025 by
JohannesGaessler
Loading…
Remove Legacy Copy-OP Pointer Indirection Code
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16485
opened Oct 9, 2025 by
anavp-nvidia
Loading…
Add AfmoeForCausalLM support
python
python script changes
#16477
opened Oct 8, 2025 by
bartowski1182
•
Draft
fix: convert_hf_to_gguf - change Jamba non-sentencepiece mode (tokeni…
python
python script changes
#16470
opened Oct 8, 2025 by
amirai21
Loading…
vulkan: Add State Space Model (SSM) Operations Support
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16463
opened Oct 7, 2025 by
giuseppe
Loading…
Add hipblasLt implementation for batched gemm to improve performance for CDNA3 only
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16457
opened Oct 7, 2025 by
peizhang56
Loading…
vulkan: Handle FA with all -inf mask values
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16447
opened Oct 6, 2025 by
jeffbolznv
Loading…
Metal Pool 1D Kernel
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#16429
opened Oct 5, 2025 by
ThoreKoritzius
Loading…
fix: add generic fallback to detect trailing <think> tags in Jinja templates and handle forced-open reasoning blocks
testing
Everything test related
#16426
opened Oct 4, 2025 by
ServeurpersoCom
•
Draft
server / ranking : add sorting and management of top_n
examples
server
#16403
opened Oct 3, 2025 by
YannFollet
Loading…
Add ARANGE Operator to SYCL Backend (Small & Focused Changes)
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16362
opened Sep 30, 2025 by
GittyBurstein
Loading…
feat: render user content as markdown option
examples
server
#16358
opened Sep 30, 2025 by
ServeurpersoCom
Loading…
SYCL SET operator optimized for F32 tensors
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16350
opened Sep 30, 2025 by
GittyBurstein
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.