Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] nexa convert fails: missing imports #372

Closed
iwr-redmond opened this issue Feb 5, 2025 · 3 comments
Closed

[BUG] nexa convert fails: missing imports #372

iwr-redmond opened this issue Feb 5, 2025 · 3 comments
Labels
🐞 bug Something isn't working

Comments

@iwr-redmond
Copy link

iwr-redmond commented Feb 5, 2025

Issue Description

Running nexa convert results in:

Error during conversion: name 'GGML_TYPE_BF16' is not defined

This appears to be resolvable by adding the missing types to llama_cpp.py#L106

GGML_TYPE_BF16 = 32
GGML_TYPE_Q4_0_4_4 = 33
GGML_TYPE_Q4_0_4_8 = 34
GGML_TYPE_Q4_0_8_8 = 35

I have obtained these values from line 382.

Steps to Reproduce

  1. Install nexaai[convert]==0.0.98 --extra-index-url https://github.nexa.ai/whl/cu124
  2. Run nexa convert [some-hf-repo] q4_1 /your/model.gguf
  3. Observe the error 😢
  4. Add the lines shown above
  5. Observe the successful conversion 😄

OS

Ubuntu 22.04

Python Version

3.10

Nexa SDK Version

0.0.98-cu124

GPU (if using one)

NVIDIA RTX 3060

@iwr-redmond iwr-redmond added the 🐞 bug Something isn't working label Feb 5, 2025
@xsxszab
Copy link
Collaborator

xsxszab commented Feb 5, 2025

Thanks for reporting this issue, A PR has been posted to address it

@keithem
Copy link

keithem commented Feb 5, 2025 via email

@iwr-redmond
Copy link
Author

I've tested the fix in ea46950 and confirmed it works. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🐞 bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants