ntuhpc
Pinned Loading
Repositories
- inference_results_v5.1 Public Forked from mlcommons/inference_results_v5.1
This repository contains the results and code for the MLPerf™ Inference v5.1 benchmark.
ntuhpc/inference_results_v5.1’s past year of commit activity - mlperf-automations Public Forked from mlcommons/mlperf-automations
This repository contains automation scripts designed to run MLPerf Inference benchmarks. Originally developed for the Collective Mind (CM) automation framework, these scripts have been adapted to leverage the MLC automation framework, maintained by the MLCommons Benchmark Infrastructure Working Group.
ntuhpc/mlperf-automations’s past year of commit activity - TensorRT-LLM Public Forked from NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.
ntuhpc/TensorRT-LLM’s past year of commit activity
Top languages
Loading…