POC:Avoid PackTranspose TinyBLAS_PPC in MMA kernel #14

shalinib-ibm · 2025-07-15T11:57:54Z

POC: Avoid Packing A Matrix in TinyBLAS_PPC Class

This patch transpositions weights matrix offline during PyTorch to gguf conversion.
Works only for symmetric matrices ( m = n = k).
This is because we need to transpose offline only those matrices which are input to llamafile_sgemm.
We cannot transpose all weights matrices because this would break graph construction.
No performance gains with this change.
Make sure to read the contributing guidelines before submitting a PR

Signed-off-by: Shalini Salomi Bodapati <[email protected]>

POC:Avoid PackTranspose TinyBLAS_PPC in MMA kernel

b0c47dd

Signed-off-by: Shalini Salomi Bodapati <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

POC:Avoid PackTranspose TinyBLAS_PPC in MMA kernel #14

POC:Avoid PackTranspose TinyBLAS_PPC in MMA kernel #14

Uh oh!

shalinib-ibm commented Jul 15, 2025

Uh oh!

Uh oh!

POC:Avoid PackTranspose TinyBLAS_PPC in MMA kernel #14

Are you sure you want to change the base?

POC:Avoid PackTranspose TinyBLAS_PPC in MMA kernel #14

Uh oh!

Conversation

shalinib-ibm commented Jul 15, 2025

Uh oh!

Uh oh!