Popular repositories Loading
-
TransformerEngine
TransformerEngine PublicForked from NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in bot…
Python 1
-
apex
apex PublicForked from NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Python
-
-
FasterTransformer
FasterTransformer PublicForked from NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
C++
-
TensorRT-LLM
TensorRT-LLM PublicForked from NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
C++
If the problem persists, check the GitHub status page or contact support.