GPUtopia Workerbee

How to use the worker:

first set up an account at gputopia.ai, this is the easiest way to ensure your funds are swept correctly.
for now, we only support alby logins. i know this isn't ideal. but it's easier for now. in the future, any ln-wallet should work to log in and claim control over a given lnurl.
download or build a release, stick it somewhere nice (/usr/bin/gputopia-worker)
from the command-line try this: gputopia-worker --test_model TheBloke/CodeLlama-7B-Instruct-GGUF:Q4_K_M, maybe paste the results into a discord channel for fun and discussion
if that works, run gputopia-worker --ln_address your-ln@address-goes-here

Worker command line options:

usage: gputopia-worker [-h] [--auth_key AUTH_KEY] [--queen_url QUEEN_URL] [--ln_address LN_URL] [--loops LOOPS] [--debug]
                            [--test_model TEST_MODEL] [--test_max_tokens TEST_MAX_TOKENS] 
                            [--main_gpu MAIN_GPU] [--tensor_split TENSOR_SPLIT] [--force_layers FORCE_LAYERS]
                            [--layer_offset LAYER_OFFSET] [--version]

options:
  -h, --help                          show this help message and exit
  --version                           output version and exit
  --auth_key AUTH_KEY                 access_token for account login
  --queen_url QUEEN_URL               coordinator url (wss://queenbee.gputopia.ai/worker)
  --ln_address LN_ADDRESS             lightning address ([email protected])
  --loops LOOPS                       quit after getting this number of jobs
  --debug                             verbose debugging info
  --test_model TEST_MODEL             specify a HF_REPO/PATH[:FILTER?] to test
  --test_max_tokens TEST_MAX_TOKENS   more == longer test
  --main_gpu MAIN_GPU                 default "0"
  --tensor_split TENSOR_SPLIT         default "even split", specify comma-delimited list of numbers
  --force_layers FORCE_LAYERS         default, guess layers based on model size
  --layer_offset LAYER_OFFSET         default "2" (fudge guess down by 2, leaving more room for context)

How to build the worker from source:

When building, please ensure you have CUDA installed or OPENCL (for AMD chips). You can also do a METAL build for OSX.

CUDA/NVIDIA build

CMAKE_ARGS="-DLLAMA_CUBLAS=1" FORCE_CMAKE=1 poetry install --with onnx

OSX/METAL build:

CMAKE_ARGS="-DLLAMA_METAL=1" FORCE_CMAKE=1 poetry install --with onnx

if you want it to see the gpus!

CLBLAST build:

get (or build) this:

https://github.com/KhronosGroup/OpenCL-SDK/releases

put it in c:/opencl-sdk or (on linux) cmake --install it

git clone https://github.com/CNugteren/CLBlast.git
mkdir CLBlast/build
cd CLBlast/build
cmake .. -DOPENCL_ROOT=C:/OpenCL-SDK -G "Visual Studio 17 2022" -A x64
cmake --build . --config Release
cmake --install . --prefix C:/CLBlast

CMAKE_ARGS="-DLLAMA_CLBLAST=ON -DCMAKE_PREFIX_PATH=C:/CLBlast/lib/cmake/CLBlast" FORCE_CMAKE=1 poetry install --with onnx

Run a dev-mode worker

poetry run gputopia_worker

Run a re-quantization on a gguf

poetry run quantize_gguf

Run tests to be sure it really works

PYTHONPATH=. pytest tests/

Build your own EXE

pyinstaller --onefile --name gputopia-worker --additional-hooks-dir=./hooks ai_worker/__main__.py

Name	Name	Last commit message	Last commit date
Latest commit gonzafirewall setting n_ctx to 0 Apr 22, 2024 c4d44a5 · Apr 22, 2024 History 193 Commits
.github/workflows	.github/workflows	support for embedding api (OpenAgentsInc#22 )	Nov 1, 2023
ai_worker	ai_worker	setting n_ctx to 0	Apr 22, 2024
gguf_loader	gguf_loader	adding token in size checking	Apr 8, 2024
hooks	hooks	merge in coincurve hook fix (OpenAgentsInc#20 )	Oct 27, 2023
tests	tests	Add image generation functionality with SDXL and update main.py (Open…	Nov 10, 2023
.gitignore	.gitignore	.	Nov 1, 2023
Dockerfile	Dockerfile	Added Docker Compose	Apr 1, 2024
Dockerfile.balena	Dockerfile.balena	back out of balena	Mar 29, 2024
LICENSE.md	LICENSE.md	Add AGPL3	Oct 3, 2023
README.md	README.md	Merge branch 'main' of github.com:ArcadeLabsInc/ai-worker into main	Nov 2, 2023
build-bin.sh	build-bin.sh	feb	Nov 12, 2023
build-linux.sh	build-linux.sh	.	Oct 20, 2023
build-mac.sh	build-mac.sh	support for embedding api (OpenAgentsInc#22 )	Nov 1, 2023
build-version.py	build-version.py	ver	Oct 4, 2023
build-windows.sh	build-windows.sh	support for embedding api (OpenAgentsInc#22 )	Nov 1, 2023
docker-compose.yaml	docker-compose.yaml	Added Docker Compose	Apr 1, 2024
entry.sh	entry.sh	Funcionando con GPU en Balena	Mar 22, 2024
install-cuda-torch.sh	install-cuda-torch.sh	e2e test worker	Oct 20, 2023
poetry.lock	poetry.lock	update llama_cpp_python version	Mar 1, 2024
publish-github.sh	publish-github.sh	sync	Nov 1, 2023
pyinstaller.sh	pyinstaller.sh	.fine tuning (OpenAgentsInc#13 )	Oct 27, 2023
pyproject.toml	pyproject.toml	update llama_cpp_python version	Mar 1, 2024
pytest.ini	pytest.ini	working worker bee	Sep 8, 2023
run_tests.sh	run_tests.sh	support for embedding api (OpenAgentsInc#22 )	Nov 1, 2023
ubuntu-setup.sh	ubuntu-setup.sh	.	Nov 2, 2023
upload.sh	upload.sh	fix	Nov 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPUtopia Workerbee

How to use the worker:

Worker command line options:

How to build the worker from source:

CUDA/NVIDIA build

OSX/METAL build:

CLBLAST build:

Run a dev-mode worker

Run a re-quantization on a gguf

Run tests to be sure it really works

Build your own EXE

About

Releases

Packages

Languages

License

mediainbox/workerbee

Folders and files

Latest commit

History

Repository files navigation

GPUtopia Workerbee

How to use the worker:

Worker command line options:

How to build the worker from source:

CUDA/NVIDIA build

OSX/METAL build:

CLBLAST build:

Run a dev-mode worker

Run a re-quantization on a gguf

Run tests to be sure it really works

Build your own EXE

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages