> . The best speedups are both in the matrix ‘bibd_20_10’, Test bibd_20_10 in A100, performance are really bad.... DASP performance are slower than cusparse in many matrix. <img width="944" height="619" alt="Image" src="https://github.com/user-attachments/assets/214cc696-62a6-4bf8-8902-dffe40301854" />