Replies: 2 comments 1 reply
-
hello, glad to see you here. I think the following links can provide a useful reference for your needs: https://github.com/ggml-org/llama.cpp/blob/master/ggml/src/ggml-blas/ggml-blas.cpp https://github.com/ggml-org/ggml/blob/master/examples/simple/simple-backend.cpp https://github.com/ggml-org/ggml/tree/master/examples/simple accordingly, there is a ggml-qnn backend might be another useful reference for your zDNN backend: when I started this ggml-qnn backend I had nothing knowledge about qnn and limited knowledge about ggml internal, so I'm sure you will be success with your zDNN backend. at the same time, you can utilize a command line program to understand how a specified backend works(offload a specified ggml op to hardware accelerator or NPU through the highly-well designed ggml backend subsystem) and verify your zDNN backend accordingly: https://github.com/kantv-ai/llama.cpp/blob/kantvai-ggmlqnn-npurpc/tests/ggml-qnn-ut.cpp one more important thing: slaren is the author of the highly-well designed ggml backend subsystem, maybe you can ask him any related technical questions. |
Beta Was this translation helpful? Give feedback.
-
Hey @zhouwg, thanks for providing that useful list of references that I could use! It has been helpful referencing Is there a specific function that I should be calling/putting the tensor initialisation implementation such that The preparation steps include:
I've investigated the Any pointers would be very much appreciated! :) |
Beta Was this translation helpful? Give feedback.
-
Hi everyone,
I'm trying to implement a zDNN (z-Deep Neural Network) backend for the IBM NNPA Accelerator in the Telum processors but I am unsure of how to start. This accelerator is a synchronous compute device that requires it's own ztensor data structure to run computation from/to.
I've tried taking references from the CPU and CUDA implementations but I am confused by which methods/functions are required to be implemented for an accelerator device.
It would be really helpful if someone could provide a "shopping cart" list of items to do, or maybe point me to a documentation that I may have missed. Thank you!
Beta Was this translation helpful? Give feedback.
All reactions