Skip to content
Change the repository type filter

All

    Repositories list

    • KAMI

      Public
      KAMI: Communication-Avoiding General Matrix Multiplication within a Single GPU
      C++
      01200Updated Oct 31, 2025Oct 31, 2025
    • PanguLU

      Public
      PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems
      C
      94430Updated Aug 2, 2025Aug 2, 2025
    • Leda

      Public
      C++
      0100Updated Mar 27, 2025Mar 27, 2025
    • AmgT

      Public
      Source code of the SC '24 paper: "AmgT: Algebraic Multigrid Solver on Tensor Cores" by Yuechen Lu, Lijie Zeng, Tengcheng Wang, Xu Fu, Wenxuan Li, Helin Cheng, Dechuang Yang, Zhou Jin, Marc Casas and Weifeng Liu.
      C
      2830Updated Nov 11, 2024Nov 11, 2024
    • Cuper

      Public
      C++
      0000Updated Oct 24, 2024Oct 24, 2024
    • CSP

      Public
      Source code of the ICCAD'24 paper: "CSP: Comprehensively-Sparsified Preconditioner for Efficient Nonlinear Circuit Simulation" by Yuxuan Zhao, Xiaoyu Yang, Yinuo Bai, Lijie Zeng, Dan Niu, Weifeng Liu and Zhou Jin.
      C++
      0000Updated Oct 24, 2024Oct 24, 2024
    • Source code of our paper published at SC '24: Dechuang Yang, Yuxuan Zhao, Yiduo Niu, Weile Jia, En Shao, Weifeng Liu, Guangming Tan and Zhou Jin. Mille-feuille: A Tile-Grained Mixed Precision Single-Kernel Conjugate Gradient Solver on GPUs.
      Cuda
      1700Updated Oct 20, 2024Oct 20, 2024
    • Soda-PTA

      Public
      Source code of the ICCAD'24 paper: "Pseudo Adjoint Optimization: Harnessing the Solution Curve for SPICE Acceleration" by Jiatai Sun, Xiaru Zha, Chao Wang, Xiao Wu, Dan Niu, Wei W. Xing and Zhou Jin.
      Python
      0000Updated Oct 18, 2024Oct 18, 2024
    • Source code of the DATE '24 paper: "Efficient Spectral-Aware Power Supply Noise Analysis for Low-Power Design Verification"
      C++
      0100Updated Sep 19, 2024Sep 19, 2024
    • DASP

      Public
      Source code of the SC '23 paper: "DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multiplication" by Yuechen Lu and Weifeng Liu.
      C++
      62740Updated Jun 18, 2024Jun 18, 2024
    • Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Yuyao Niu, Zhengyang Lu, Haonan Ji, Shuhui Song, Zhou Jin, and Weifeng Liu.
      C
      104651Updated May 22, 2024May 22, 2024
    • HASpGEMM

      Public
      C
      1000Updated May 20, 2024May 20, 2024
    • HASpMV

      Public
      HASpMV: Heterogeneity-Aware Sparse Matrix-Vector Multiplication on Modern Asymmetric Multicore Processors
      C
      0200Updated May 20, 2024May 20, 2024
    • DALU

      Public
      Density-aware LU, based on SuperLU_DIST 8.0.0.
      C
      0200Updated Jan 16, 2024Jan 16, 2024
    • Source code of the ICPP '22 paper: "TileSpMSpV: A Tiled Algorithm for Sparse Matrix-Sparse Vector Multiplication on GPUs" by Haonan Ji, Huimin Song, Shibo Lu, Zhou Jin, Guangming Tan, and Weifeng Liu.
      C
      1500Updated Apr 18, 2023Apr 18, 2023
    • TileSpMV

      Public
      Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang Lu, Meichen Dong, Zhou Jin, Weifeng Liu, and Guangming Tan.
      C
      41220Updated Aug 12, 2022Aug 12, 2022
    • Source code of the ICPP '20 paper: "Efficient Block Algorithms for Parallel Sparse Triangular Solve" by Zhengyang Lu, Yuyao Niu, and Weifeng Liu.
      C
      2600Updated Jan 24, 2022Jan 24, 2022