Skip to content
View zlsh80826's full-sized avatar
🏠
Working from home
🏠
Working from home
  • NVIDIA
  • Taipei, Taiwan

Block or report zlsh80826

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Large Context Attention

Python 681 53 Updated Jan 24, 2025

The Foundation for All Legate Libraries

203 62 Updated Feb 8, 2025

Convert .ninja_log files to chrome's about:tracing format.

Python 433 45 Updated Jun 5, 2024

My notes of Clean Code book

5,948 829 Updated Nov 26, 2023

C/C++ Performance Profiler

C++ 4,260 353 Updated Jan 31, 2025

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

C++ 300 55 Updated Feb 13, 2025

JAX-Toolbox

Jupyter Notebook 280 56 Updated Feb 13, 2025

Useful shortcuts for bash/zsh

852 118 Updated Nov 15, 2020

Benchmarking unity builds on real c++ projects.

Shell 14 1 Updated Jul 5, 2020

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,158 360 Updated Feb 13, 2025

A Python framework for high performance GPU simulation and graphics

Python 4,544 260 Updated Feb 13, 2025

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 6,327 662 Updated Feb 13, 2025

Task-based datasets, preprocessing, and evaluation for sequence models.

Python 568 55 Updated Feb 13, 2025
Python 2,744 307 Updated Jan 30, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 31,245 2,883 Updated Feb 13, 2025

A tool to classify and statistic GPU kernel information.

Python 8 Updated Jun 25, 2024

C++ Design Patterns

C++ 4,297 953 Updated May 12, 2024

📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/

C++ 24,442 3,029 Updated Aug 17, 2024

An efficient GPU resource sharing system with fine-grained control for Linux platforms.

C++ 76 31 Updated Mar 25, 2024
C++ 3 Updated Aug 12, 2020

An Aspiring Drop-In Replacement for NumPy at Scale

Python 824 79 Updated Feb 7, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,170 2,152 Updated Feb 1, 2025

AddressSanitizer, ThreadSanitizer, MemorySanitizer

C 11,752 1,051 Updated Jan 23, 2025
Go 2 Updated Feb 25, 2023

CUDA Python: Performance meets Productivity

Python 1,099 92 Updated Feb 12, 2025

Development repository for the Triton language and compiler

C++ 14,376 1,780 Updated Feb 13, 2025

⚡ Dynamically generated stats for your github readmes

JavaScript 71,417 23,812 Updated Feb 12, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,725 1,523 Updated Feb 13, 2025
Next
Showing results