Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

common : add minimalist multi-thread progress bar
#17602 opened Nov 29, 2025 by angt Loading…
cuda : add error checking for cudaMemcpyAsync in argsort (#12836) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17599 opened Nov 29, 2025 by Mahekk357 Loading…
clip: fix nb calculation for qwen3-vl examples
#17594 opened Nov 29, 2025 by ngxson Loading…
Feature/kimi linear support ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17592 opened Nov 29, 2025 by cacaview Loading…
Override SSM_A op for Qwen3 Next to reduce splits model Model specific
#17587 opened Nov 29, 2025 by pwilkin Loading…
Improve Qwen3-Next Speed model Model specific
#17585 opened Nov 29, 2025 by lovedheart Draft
Add support for CUMSUM and TRI for CUDA. ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17584 opened Nov 28, 2025 by pwilkin Loading…
cmake: fix macOS build with -DGGML_BACKEND_DL=ON ggml changes relating to the ggml tensor library for machine learning
#17581 opened Nov 28, 2025 by giladgd Loading…
Add safetensors support
#17580 opened Nov 28, 2025 by ericcurtin Draft
Add PagedAttention support (experimental, CUDA only) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17579 opened Nov 28, 2025 by ericcurtin Loading…
model: LFM2-VL fixes examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17577 opened Nov 28, 2025 by tdakhran Loading…
HIP: enable WMMA-MMQ INT kernels for RDNA 3 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17576 opened Nov 28, 2025 by jiachengjason Draft
mtmd: support dots.ocr examples python python script changes
#17575 opened Nov 28, 2025 by ngxson Draft
[SYCL] enhance argsort for UT ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17573 opened Nov 28, 2025 by NeoZhangJianyu Loading…
Server: Change Invalid Schema from Server Error (500) to User Error (400) examples python python script changes server testing Everything test related
#17572 opened Nov 28, 2025 by chadvoegele Loading…
ggml-hexagon: fix rope failure at test-backend-ops ggml changes relating to the ggml tensor library for machine learning
#17565 opened Nov 28, 2025 by chraac Loading…
CANN: The Ger operator of OUT_PROD is not supported on the 310p device Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17563 opened Nov 28, 2025 by TianHao324 Loading…
New llama-run examples server
#17554 opened Nov 27, 2025 by ericcurtin Loading…
cmake : add option to build and link LibreSSL
#17552 opened Nov 27, 2025 by angt Loading…
ggml-cpu: Add operator-level execution time profiling ggml changes relating to the ggml tensor library for machine learning
#17544 opened Nov 27, 2025 by kimminsu38oo Loading…
CANN: add support for partial RoPE and Vision mode Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17543 opened Nov 27, 2025 by noemotiovon Loading…
ProTip! Filter pull requests by the default branch with base:master.