Releases: CodeLinaro/llama.cpp
Releases · CodeLinaro/llama.cpp
b3409
CONTRIBUTING.md : remove mention of noci (#8541)
b3398
ggml : suppress unknown pragma 'GCC' on windows (#8460) This commit adds a macro guard to pragma GCC to avoid the following warning on windows: ```console C:\llama.cpp\ggml\src\ggml-aarch64.c(17,9): warning C4068: unknown pragma 'GCC' [C:\lama.cpp\build\ggml\src\ggml.vcxproj] ```
b3373
CUDA: optimize and refactor MMQ (#8416) * CUDA: optimize and refactor MMQ * explicit q8_1 memory layouts, add documentation
b3368
Name Migration: Build the deprecation-warning 'main' binary every tim…
b3263
Add MiniCPM, Deepseek V2 chat template + clean up `llama_chat_apply_t…
b3224
Gguf dump start data offset via --data-offset and some extra refactor…
b3145
rpc : fix ggml_backend_rpc_supports_buft() (#7918)
b3087
common : refactor cli arg parsing (#7675) * common : gpt_params_parse do not print usage * common : rework usage print (wip) * common : valign * common : rework print_usage * infill : remove cfg support * common : reorder args * server : deduplicate parameters ggml-ci * common : add missing header ggml-ci * common : remote --random-prompt usages ggml-ci * examples : migrate to gpt_params ggml-ci * batched-bench : migrate to gpt_params * retrieval : migrate to gpt_params * common : change defaults for escape and n_ctx * common : remove chatml and instruct params ggml-ci * common : passkey use gpt_params
b3060
convert-hf : Handle NotImplementedError in convert-hf-to-gguf (#7660)
b3029
[SYCL] Align GEMM dispatch (#7566) * align GEMM dispatch