Releases · CodeLinaro/llama.cpp

09 Jan 05:26

8d59d91

b4450

fix: add missing msg in static_assert (#11143)

Signed-off-by: hydai <[email protected]>

Assets 23

23 Dec 10:17

github-actions

b4382

86bf31c

b4382

rpc-server : add support for the SYCL backend (#10934)

Assets 23

13 Dec 19:55

github-actions

b4324

c27ac67

b4324

Opt class for positional argument handling (#10508)

Added support for positional arguments `model` and `prompt`. Added
functionality to download via strings like:

  llama-run llama3
  llama-run ollama://granite-code
  llama-run ollama://granite-code:8b
  llama-run hf://QuantFactory/SmolLM-135M-GGUF/SmolLM-135M.Q2_K.gguf
  llama-run huggingface://bartowski/SmolLM-1.7B-Instruct-v0.2-GGUF/SmolLM-1.7B-Instruct-v0.2-IQ3_M.gguf
  llama-run https://example.com/some-file1.gguf
  llama-run some-file2.gguf
  llama-run file://some-file3.gguf

Signed-off-by: Eric Curtin <[email protected]>

Assets 22

11 Dec 09:02

github-actions

b4302

43041d2

b4302

ggml: load all backends from a user-provided search path (#10699)

* feat: load all backends from a user-provided search path

* fix: Windows search path

* refactor: rename `ggml_backend_load_all_in_search_path` to `ggml_backend_load_all_from_path`

* refactor: rename `search_path` to `dir_path`

* fix: change `NULL` to `nullptr`

Co-authored-by: Diego Devesa <[email protected]>

* fix: change `NULL` to `nullptr`

---------

Co-authored-by: Diego Devesa <[email protected]>

Assets 22

10 Dec 21:36

github-actions

b4301

b685daf

b4301

vulkan: request round-to-even for fp16 in im2col/rope_head (#10767)

Vulkan doesn't mandate a specific rounding mode, but the shader_float_controls
feature allows rounding mode to be requested if the implementation supports it.

Assets 22

09 Dec 05:25

github-actions

b4291

ce8784b

b4291

server : fix format_infill (#10724)

* server : fix format_infill

* fix

* rename

* update test

* use another model

* update test

* update test

* test_invalid_input_extra_req

Assets 22

05 Dec 01:16

github-actions

b4267

f112d19

b4267

Update deprecation-warning.cpp (#10619)

Fixed Path Separator Handling for Cross-Platform Support (Windows File Systems)

Assets 22

04 Dec 00:00

github-actions

b4255

cc98896

b4255

vulkan: optimize and reenable split_k (#10637)

Use vector loads when possible in mul_mat_split_k_reduce. Use split_k
when there aren't enough workgroups to fill the shaders.

Assets 22

03 Dec 00:51

github-actions

b4242

642330a

b4242

llama : add enum for built-in chat templates (#10623)

* llama : add enum for supported chat templates

* use "built-in" instead of "supported"

* arg: print list of built-in templates

* fix test

* update server README

Assets 22

30 Nov 00:13

github-actions

b4226

7cc2d2c

b4226

ggml : move AMX to the CPU backend (#10570)

* ggml : move AMX to the CPU backend

---------

Co-authored-by: Georgi Gerganov <[email protected]>

Assets 22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: CodeLinaro/llama.cpp

b4450

Uh oh!

b4382

Uh oh!

b4324

Uh oh!

b4302

Uh oh!

b4301

Uh oh!

b4291

Uh oh!

b4267

Uh oh!

b4255

Uh oh!

b4242

Uh oh!

b4226

Uh oh!