-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
In Docker Python doesn't see quant_cuda library. #34
Comments
Do this to get it to work.
|
I have made a packaged version of that cuda module
But your machine does need to be able to compile cuda (just like this repo). |
And as of now i could not find a way to activate gpus during build time of a docker image. And it is troublesome to install cuda compiled python packages in docker. I am currently doing it by specifying the architecture. Dockerfile: FROM nvidia/cuda:12.1.1-cudnn8-devel-ubuntu20.04
ARG DEBIAN_FRONTEND=noninteractive
RUN apt-get update && apt-get install -y git build-essential python3 python3-pip
# there is no gpu access during docker build time :(
ENV TORCH_CUDA_ARCH_LIST Turing
RUN pip install git+https://github.com/osbm/quant_cuda And run:
Prints out:
But sadly this image is 17.6GB. I need to find a way to lower this, but here you go |
To make it work "site-packages/quant_cuda-0.0.0-py3.9-linux-x86_64.egg/llmtune/engine/quant" should be added to the PATH.
The text was updated successfully, but these errors were encountered: