Releases · CodeLinaro/llama.cpp

02 Jul 03:54

de56944

b5797 Latest

Latest

ci : disable fast-math for Metal GHA CI (#14478)

* ci : disable fast-math for Metal GHA CI

ggml-ci

* cont : remove -g flag

ggml-ci

Assets 15

cudart-llama-bin-win-cuda-12.4-x64.zip

sha256:8c79a9b226de4b3cacfd1f83d24f962d0773be79f1e7b75c6af4ded7e32ae1d6

373 MB 2025-07-02T03:54:16Z
llama-b5797-bin-macos-arm64.zip

sha256:d59244f58600162dccbbf48a9ce6b37ebe4d781a54356cd6d62a7106cfa71ac1

10.5 MB 2025-07-02T03:54:25Z
llama-b5797-bin-macos-x64.zip

sha256:936f1a3f6b495eaff20c72e4ca146d599580d8e1ae768ba4a8b658ddb31a55ba

26.3 MB 2025-07-02T03:54:26Z
llama-b5797-bin-ubuntu-vulkan-x64.zip

sha256:f2a2fdf8ae3a38b576e757ada52c3db67bbab1aea80d2140e4e666e8d87b8e4e

20 MB 2025-07-02T03:54:27Z
llama-b5797-bin-ubuntu-x64.zip

sha256:43afccaa8939d08c819bfadfaa27987a48cc73d2d5788c33ffb27f995ddea18f

12.4 MB 2025-07-02T03:54:28Z
llama-b5797-bin-win-cpu-arm64.zip

sha256:5a38bf0bdded17c92c61919937fd285bcb380aab903a71c6383bc7a19eedfaef

10.8 MB 2025-07-02T03:54:29Z
llama-b5797-bin-win-cpu-x64.zip

sha256:4df87230c64996bf948f792f7b173243fa4262925b4fdfe88627fd2f0df6b661

13.6 MB 2025-07-02T03:54:30Z
llama-b5797-bin-win-cuda-12.4-x64.zip

sha256:86fbb2a4fa468ee67d00f106c6607cb63eac4628470c0f28783c3655dc994840

128 MB 2025-07-02T03:54:31Z
llama-b5797-bin-win-hip-radeon-x64.zip

sha256:6dff456fb6e94f84676357dac9e3b2335723d18fdc09b6352b5064242f15b038

298 MB 2025-07-02T03:54:35Z
llama-b5797-bin-win-opencl-adreno-arm64.zip

sha256:21fb78fd5cfc3e4f6a1bb5b58691298b6c127c1b952b66d3c9d3561455717353

11.1 MB 2025-07-02T03:54:43Z
Source code (zip)

2025-07-01T15:04:08Z
Source code (tar.gz)

2025-07-01T15:04:08Z

24 Jun 19:03

github-actions

b5752

62af464

b5752

batch : fix check for empty sequences in memory (#14364)

* batch : fix check for empty sequences in memory

ggml-ci

* cont : reuse the var

ggml-ci

Assets 15

18 Jun 05:56

github-actions

b5689

c465030

b5689

cmake: remove shader-gen step-targets from ggml-vulkan (#14226)

* Remove step-targets from vulkan-shaders-gen

* Unset DESTDIR when building vulkan-shaders-gen

Assets 15

16 Jun 21:38

github-actions

b5686

e434e69

b5686

common : suggest --jinja when autodetection fails (#14222)

Assets 15

10 Jun 20:37

github-actions

b5627

3678b83

b5627

llama : support GEGLU for jina-bert-v2 (#14090)

Assets 15

30 May 23:37

github-actions

b5548

e562eec

b5548

CUDA: fix typo in FlashAttention code (#13926)

Assets 18

23 May 00:11

github-actions

b5460

3079e9a

b5460

release : fix windows hip release (#13707)

* release : fix windows hip release

* make single hip release with multiple targets

Assets 18

02 May 06:55

github-actions

b5255

d24d592

b5255

ci: fix cross-compile sync issues (#12804)

Assets 26

10 Apr 21:36

github-actions

b5098

64eda5d

b5098

convert : ability to lazy-load safetensors remotely without downloadi…

Assets 26

01 Apr 22:20

github-actions

b5022

f423981

b5022

opencl : fix memory allocation size (#12649)

issue:
https://github.com/CodeLinaro/llama.cpp/pull/17#issuecomment-2760611283

This patch fixes the memory allocation size
not exceeding the maximum size of the OpenCL device.

Assets 26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: CodeLinaro/llama.cpp

b5797

Uh oh!

b5752

Uh oh!

b5689

Uh oh!

b5686

Uh oh!

b5627

Uh oh!

b5548

Uh oh!

b5460

Uh oh!

b5255

Uh oh!

b5098

Uh oh!

b5022

Uh oh!