Skip to content
Change the repository type filter

All

    Repositories list

    • triton

      Public
      Development repository for the Triton language and compiler
      Python
      MIT License
      1.8k107851Updated Feb 17, 2025Feb 17, 2025
    • TensorFlow ROCm port
      C++
      Apache License 2.0
      75k6904476Updated Feb 17, 2025Feb 17, 2025
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      23k2216047Updated Feb 17, 2025Feb 17, 2025
    • 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
      Python
      Apache License 2.0
      28k405Updated Feb 17, 2025Feb 17, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      5.7k65627Updated Feb 17, 2025Feb 17, 2025
    • xla

      Public
      A machine learning compiler for GPUs, CPUs, and ML accelerators
      C++
      Apache License 2.0
      5033019Updated Feb 17, 2025Feb 17, 2025
    • Advanced Profiling and Analytics for AMD Hardware
      Python
      MIT License
      511405212Updated Feb 17, 2025Feb 17, 2025
    • rocMLIR

      Public
      MLIR
      Other
      39137121Updated Feb 17, 2025Feb 17, 2025
    • This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.
      LLVM
      Other
      13k133188Updated Feb 17, 2025Feb 17, 2025
    • ROCm Documentation Python package for ReadTheDocs build standardization
      CSS
      Other
      161484Updated Feb 17, 2025Feb 17, 2025
    • Ongoing research training transformer models at scale
      Python
      Other
      2.6k15011Updated Feb 17, 2025Feb 17, 2025
    • jax

      Public
      Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
      Python
      Apache License 2.0
      2.9k19015Updated Feb 17, 2025Feb 17, 2025
    • aomp

      Public
      AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
      Fortran
      Apache License 2.0
      48211344Updated Feb 17, 2025Feb 17, 2025
    • AMD's graph optimization engine.
      C++
      MIT License
      9520835450Updated Feb 17, 2025Feb 17, 2025
    • rocSPARSE

      Public
      Next generation SPARSE implementation for ROCm platform
      C++
      MIT License
      5611910Updated Feb 17, 2025Feb 17, 2025
    • aiter

      Public
      AI Tensor Engine for ROCm
      Cuda
      MIT License
      62355Updated Feb 17, 2025Feb 17, 2025
    • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
      C++
      Other
      1473462556Updated Feb 17, 2025Feb 17, 2025
    • HIP

      Public
      HIP: C++ Heterogeneous-Compute Interface for Portability
      C++
      MIT License
      5473.9k2041Updated Feb 17, 2025Feb 17, 2025
    • ROCm

      Public
      AMD ROCm™ Software - GitHub Home
      Shell
      MIT License
      4055k11315Updated Feb 17, 2025Feb 17, 2025
    • apex

      Public
      A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
      Python
      BSD 3-Clause "New" or "Revised" License
      1.4k19136Updated Feb 17, 2025Feb 17, 2025
    • hipBLASLt

      Public
      hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
      Assembly
      MIT License
      10579974Updated Feb 17, 2025Feb 17, 2025
    • MIOpen

      Public
      AMD's Machine Intelligence Library
      Assembly
      Other
      2421.1k24864Updated Feb 17, 2025Feb 17, 2025
    • aotriton

      Public
      Ahead of Time (AOT) Triton Math Library
      Python
      MIT License
      1852121Updated Feb 17, 2025Feb 17, 2025
    • clr

      Public
      C++
      MIT License
      531171621Updated Feb 17, 2025Feb 17, 2025
    • ROCgdb

      Public
      This is ROCgdb, the ROCm source-level debugger for Linux, based on GDB, the GNU source-level debugger.
      C
      GNU General Public License v2.0
      105351Updated Feb 17, 2025Feb 17, 2025
    • xformers

      Public
      Hackable and optimized Transformers building blocks, supporting a composable construction.
      Python
      Other
      6482492Updated Feb 17, 2025Feb 17, 2025
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.5k154206Updated Feb 17, 2025Feb 17, 2025
    • A collection of examples for the ROCm software stack
      C++
      MIT License
      4818521Updated Feb 17, 2025Feb 17, 2025
    • A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high-performance computing environments
      C++
      MIT License
      4067010Updated Feb 17, 2025Feb 17, 2025
    • rccl

      Public
      ROCm Communication Collectives Library (RCCL)
      C++
      Other
      1362971415Updated Feb 17, 2025Feb 17, 2025