Skip to content
@IST-DASLab

IST Austria Distributed Algorithms and Systems Lab

Popular repositories Loading

  1. gptq gptq Public

    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

    Python 2.2k 182

  2. marlin marlin Public

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    Python 908 75

  3. sparsegpt sparsegpt Public

    Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

    Python 841 112

  4. PanzaMail PanzaMail Public

    Python 294 19

  5. qmoe qmoe Public

    Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

    Python 277 23

  6. llmq llmq Public

    Quantized LLM training in pure CUDA/C++.

    C++ 193 9

Repositories

Showing 10 of 65 repositories
  • llmq Public

    Quantized LLM training in pure CUDA/C++.

    IST-DASLab/llmq’s past year of commit activity
    C++ 193 9 0 0 Updated Oct 11, 2025
  • IST-DASLab/ISTA-DASLab-Optimizers’s past year of commit activity
    Python 10 Apache-2.0 0 0 0 Updated Oct 8, 2025
  • FP-Quant Public
    IST-DASLab/FP-Quant’s past year of commit activity
    Python 51 6 5 1 Updated Oct 6, 2025
  • qutlass Public

    QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning

    IST-DASLab/qutlass’s past year of commit activity
    C++ 114 Apache-2.0 9 0 1 Updated Oct 1, 2025
  • gptq-gguf-toolkit Public

    GPTQ and efficient search for GGUF

    IST-DASLab/gptq-gguf-toolkit’s past year of commit activity
    Python 51 4 0 1 Updated Sep 17, 2025
  • Quartet Public
    IST-DASLab/Quartet’s past year of commit activity
    Jupyter Notebook 101 MIT 10 5 0 Updated Aug 24, 2025
  • EvoPress Public
    IST-DASLab/EvoPress’s past year of commit activity
    Python 33 2 0 0 Updated Jul 30, 2025
  • QuEST Public

    Work in progress.

    IST-DASLab/QuEST’s past year of commit activity
    Jupyter Notebook 74 MIT 6 2 0 Updated Jun 29, 2025
  • Yolov8-Pose-Detection-on-Browser Public Forked from akbartus/Yolov8-Pose-Detection-on-Browser

    Example of YOLOv8 pose detection (estimation) on browser. It shows implementations powered by ONNX and TFJS served through JavaScript without any frameworks. It demonstrates pose detection (estimation) on image as well as live web camera,

    IST-DASLab/Yolov8-Pose-Detection-on-Browser’s past year of commit activity
    HTML 0 MIT 3 0 0 Updated Jun 13, 2025
  • MoE-Quant Public

    Code for data-aware compression of DeepSeek models

    IST-DASLab/MoE-Quant’s past year of commit activity
    Python 56 9 2 0 Updated Jun 10, 2025

Most used topics

Loading…