Overview

This repository implements an example of a tokenizer module based on SentencePiece for llama based llms. To build + run the samples here, just initialize the submodules and build with clang

./init_submodules.sh

# ./build.sh
cmake -Bbuild -GNinja -S . \
    -DCMAKE_C_COMPILER=clang \
    -DCMAKE_CXX_COMPILER=clang++

To use the tokenizer, you need to generate a parameter archive for it. The provided python script can do it alongside a runtime build with python bindings. (Requires a HuggingFace token)

python gen_tokenizer_irpa.py

Releases and build instructions can be found here:

Releases page: https://github.com/openxla/iree/releases Building from source: https://iree.dev/building-from-source/

To see an example of the tokenizer on its own, try:

./compile_and_run.sh

Note this requires a build of the iree-compiler too.

To try a full chat bot, separately download + compile a variant of llama compatible with whatever tokenizer you generated through SHARK-Turbine. An example can be found here: https://github.com/nod-ai/SHARK-Turbine/tree/main/examples/llama2_inference

License

Note: This repository is being developed for inclusion in the iree project in some form.

Licensed under the Apache License v2.0 with LLVM Exceptions. See https://llvm.org/LICENSE.txt for license information. SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

Name	Name	Last commit message	Last commit date
Latest commit qedawkins Fix license Jan 29, 2024 a84584d · Jan 29, 2024 History 12 Commits
chatbot	chatbot	Drop prints and switch to vulkan	Jan 6, 2024
src/tokenizer	src/tokenizer	Fix license	Jan 29, 2024
third_party	third_party	Add example based on meta-llama tokenizer	Jan 5, 2024
.clang-format	.clang-format	Finish module + clang format	Jan 4, 2024
.gitignore	.gitignore	Add example based on meta-llama tokenizer	Jan 5, 2024
.gitmodules	.gitmodules	Fork dynamic sample and init repo	Dec 29, 2023
CMakeLists.txt	CMakeLists.txt	Fix license	Jan 29, 2024
README.md	README.md	Fix license	Jan 29, 2024
build.sh	build.sh	Bump IREE and fix cmake	Jan 2, 2024
chat.sh	chat.sh	Cleanup iree_make_status error messaging now that it's fixed	Jan 8, 2024
compile_and_run.sh	compile_and_run.sh	Cleanup iree_make_status error messaging now that it's fixed	Jan 8, 2024
gen_tokenizer_irpa.py	gen_tokenizer_irpa.py	Add README and License	Jan 29, 2024
init_submodules.sh	init_submodules.sh	Bump IREE and fix cmake	Jan 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

License

About

Releases

Packages

Languages

qedawkins/iree-llm

Folders and files

Latest commit

History

Repository files navigation

Overview

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages