Change the repository type filter
All
Repositories list
58 repositories
qserve
Publicefficientvit
PublicEfficient vision foundation models for high-resolution generation and perception.VisCompare
Publicnunchaku
Publicduo-attention
Publicllm-awq
Publicsparserefine
Publicdeepcompressor
Publicdistrifuser
Public[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Modelstinyengine
Public[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB MemoryQuest
Publictorchquantum
PublicA PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.torchsparse
Publictinychat-tutorial
Publichart
PublicBlock-Sparse-Attention
Publicdata-efficient-gans
Public[NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Trainingproxylessnas
Public[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardwarespatten
Public[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruningbevfusion
Public archive[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation- [ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
smoothquant
Publicspvnas
Public archive[ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolutionlite-transformer
Public archivetemporal-shift-module
Public[ICCV 2019] TSM: Temporal Shift Module for Efficient Video UnderstandingTinyChatEngine
PublicTinyChatEngine: On-Device LLM Inference Library