Releases · CodeLinaro/llama.cpp

28 May 00:47

852aafb

b3014

update HIP_UMA #7399 (#7414)

* update HIP_UMA #7399

add use of hipMemAdviseSetCoarseGrain when LLAMA_HIP_UMA is enable.
- get x2 on prompte eval and x1.5 on token gen with rocm6.0 on ryzen 7940HX iGPU (780M/gfx1103)

* simplify code, more consistent style

---------

Co-authored-by: slaren <[email protected]>

Assets 21

25 May 05:35

github-actions

b2993

902184d

b2993

fix missing slash in `fs_get_cache_directory()` (#7503)

* fix missing slash in fs_get_cache_directory()

* use LOCALAPPDATA for fs_get_cache_directory()

* better code style

Assets 21

24 May 22:05

github-actions

b2990

d041d2c

b2990

flake.lock: Update (#7232)

Flake lock file updates:

• Updated input 'flake-parts':
    'github:hercules-ci/flake-parts/e5d10a24b66c3ea8f150e47dfdb0416ab7c3390e?narHash=sha256-yzcRNDoyVP7%2BSCNX0wmuDju1NUCt8Dz9%2BlyUXEI0dbI%3D' (2024-05-02)
  → 'github:hercules-ci/flake-parts/8dc45382d5206bd292f9c2768b8058a8fd8311d9?narHash=sha256-/GJvTdTpuDjNn84j82cU6bXztE0MSkdnTWClUCRub78%3D' (2024-05-16)
• Updated input 'nixpkgs':
    'github:NixOS/nixpkgs/63c3a29ca82437c87573e4c6919b09a24ea61b0f?narHash=sha256-4cPymbty65RvF1DWQfc%2BBc8B233A1BWxJnNULJKQ1EY%3D' (2024-05-02)
  → 'github:NixOS/nixpkgs/4a6b83b05df1a8bd7d99095ec4b4d271f2956b64?narHash=sha256-%2BNpbZRCRisUHKQJZF3CT%2Bxn14ZZQO%2BKjxIIanH3Pvn4%3D' (2024-05-17)

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

Assets 21

24 May 00:41

github-actions

b2986

74f33ad

b2986

readme : remove trailing space (#7469)

Assets 21

21 May 18:03

github-actions

b2957

c3f8d58

b2957

tests : test-tokenizer-0.sh print more info (#7402)

Assets 21

20 May 21:37

github-actions

b2953

917dc8c

b2953

Tokenizer SPM fixes for phi-3 and llama-spm (#7375)

* Update brute force test: special tokens
* Fix added tokens
  - Try to read 'added_tokens.json'.
  - Try to read 'tokenizer_config.json'.
  - Try to read 'tokenizer.json'.
* Fix special tokens rtrim

Co-authored-by: Georgi Gerganov <[email protected]>
* server : fix test regexes

Assets 21

16 May 05:11

github-actions

fix-win-artifacts-b2897-e910cea

e910cea

fix-win-artifacts-b2897-e910cea

ci: fix bin/Release path for windows-arm64 builds

Switch to Ninja Multi-Config CMake generator to resurect bin/Release path
that broke artifact packaging in CI.

Assets 21

15 May 21:09

github-actions

b2894

e1b40ac

b2894

ggml : use dynamic thread scheduling for matrix multiplication (#6915)

* Just reordering some structs.

* Adding in the calls to mm_pause

* Passing around the state

* Renaming and moving a bunch of variables around.

* Extracting the logic to it's own function.

* Moving some variable definitions into the chunk function.

* Moving some variables around

* moving src1_cont inside

* Moving row_size

* adding the current_chunk

* Reorg the code.

* Formatting to match the orig patch

* starting to setup the chunking variables

* Starting the buildup of the loop

* The yield shouldn't be necessary.

* adding the looping structure based on the chunk configuration.

* Add in the re-chunking code.

* Making it much more likely to rechunk.

* disable resizing if numa is enabled.

* Updating comments with what we've learned.

* Fix formatting

* Couple more formatting fixes.

* More style fixes.

* Fix Warnings

* Going with unused because there's conditional logic that needs it.

* Update ggml.c

* Update ggml.c

---------

Assets 20

14 May 22:53

github-actions

b2886

9f77348

b2886

script : sync ggml-rpc

Assets 20

13 May 03:29

github-actions

b2866

b1f8af1

b2866

convert.py: Outfile default name change and additional metadata suppo…

Assets 19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: CodeLinaro/llama.cpp

b3014

Uh oh!

b2993

Uh oh!

b2990

Uh oh!

b2986

Uh oh!

b2957

Uh oh!

b2953

Uh oh!

fix-win-artifacts-b2897-e910cea

Uh oh!

b2894

Uh oh!

b2886

Uh oh!

b2866

Uh oh!