Skip to content

Conversation

@foldl
Copy link

@foldl foldl commented Dec 9, 2023

add notes on ChatLLM.cpp, which support DeepSeek-LLM model.

add notes on ChatLLM.cpp.
@DOGEwbx
Copy link
Contributor

DOGEwbx commented Dec 11, 2023

Hi @foldl , thanks for your support to our work. I took a brief look about your repo and I found that you implement the same input preprocessing logic with llama.cpp which we found different from the python implementation. It will be appreciated if you could refer this PR and made some modifications to your work.

@Benjamin-eecs Benjamin-eecs changed the title Update README.md docs(README): update README.md Dec 11, 2023
@foldl
Copy link
Author

foldl commented Dec 11, 2023

Yes, some pre-processing rules are ignored and not implemented, which may cause subtle differences. I may add these functions later. At present, I am busy with adding more models.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants