-
NVIDIA
- Hangzhou, Zhejiang
- https://fanshiqing.github.io/
Pinned Loading
-
Megatron-LM
Megatron-LM PublicForked from NVIDIA/Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Python 1
-
DAPPLE
DAPPLE PublicForked from AlibabaPAI/DAPPLE
An Efficiency Pipelined Data Parallel Approach for Large Models Training
Python 3
-
grouped_gemm
grouped_gemm PublicForked from tgale96/grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM.
-
TransformerEngine
TransformerEngine PublicForked from NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
Python
If the problem persists, check the GitHub status page or contact support.