Skip to content
Change the repository type filter

All

    Repositories list

    • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
      C++
      Other
      1553522451Updated Feb 26, 2025Feb 26, 2025
    • MIOpen

      Public
      AMD's Machine Intelligence Library
      Assembly
      Other
      2421.1k24982Updated Feb 26, 2025Feb 26, 2025
    • This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.
      LLVM
      Other
      13k135188Updated Feb 26, 2025Feb 26, 2025
    • aiter

      Public
      AI Tensor Engine for ROCm
      Cuda
      MIT License
      112657Updated Feb 26, 2025Feb 26, 2025
    • aotriton

      Public
      Ahead of Time (AOT) Triton Math Library
      Python
      MIT License
      1853120Updated Feb 26, 2025Feb 26, 2025
    • hipBLASLt

      Public
      hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
      Assembly
      MIT License
      10579974Updated Feb 26, 2025Feb 26, 2025
    • Python
      Other
      111889Updated Feb 26, 2025Feb 26, 2025
    • rccl

      Public
      ROCm Communication Collectives Library (RCCL)
      C++
      Other
      1383011225Updated Feb 26, 2025Feb 26, 2025
    • AMD's graph optimization engine.
      C++
      MIT License
      9420835747Updated Feb 26, 2025Feb 26, 2025
    • rpp

      Public
      AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.
      C++
      MIT License
      425803Updated Feb 26, 2025Feb 26, 2025
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.5k157174Updated Feb 26, 2025Feb 26, 2025
    • MIVisionX

      Public
      MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.
      C++
      MIT License
      75190110Updated Feb 26, 2025Feb 26, 2025
    • Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
      Go
      Apache License 2.0
      55306132Updated Feb 26, 2025Feb 26, 2025
    • aomp

      Public
      AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
      Fortran
      Apache License 2.0
      48211245Updated Feb 26, 2025Feb 26, 2025
    • rocDecode

      Public
      rocDecode is a high performance video decode SDK for AMD hardware
      C++
      Other
      182123Updated Feb 26, 2025Feb 26, 2025
    • ROCm Systems Profiler
      C++
      MIT License
      815411Updated Feb 26, 2025Feb 26, 2025
    • hipSOLVER

      Public
      ROCm SOLVER marshalling library
      C++
      MIT License
      2925010Updated Feb 26, 2025Feb 26, 2025
    • Jupyter Notebook
      114901Updated Feb 26, 2025Feb 26, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      5.9k66624Updated Feb 26, 2025Feb 26, 2025
    • hipTensor

      Public
      AMD’s C++ library for accelerating tensor primitives
      C++
      MIT License
      213803Updated Feb 26, 2025Feb 26, 2025
    • rocAL

      Public
      The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a processing graph programmable by the user.
      C++
      MIT License
      151373Updated Feb 26, 2025Feb 26, 2025
    • rocMLIR

      Public
      MLIR
      Other
      39137122Updated Feb 26, 2025Feb 26, 2025
    • ROCm Documentation Python package for ReadTheDocs build standardization
      CSS
      Other
      161483Updated Feb 26, 2025Feb 26, 2025
    • triton

      Public
      Development repository for the Triton language and compiler
      Python
      MIT License
      1.8k108750Updated Feb 25, 2025Feb 25, 2025
    • A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high-performance computing environments
      C++
      MIT License
      4067010Updated Feb 25, 2025Feb 25, 2025
    • rocPyDecode is a set of Python bindings to rocDecode C++ library which provides full HW acceleration for video decoding on AMD GPUs.
      C++
      MIT License
      8313Updated Feb 25, 2025Feb 25, 2025
    • rocJPEG

      Public
      rocJPEG is a high-performance jpeg decode SDK for decoding jpeg images using a hardware-accelerated jpeg decoder on AMD’s GPUs.
      C++
      MIT License
      9310Updated Feb 25, 2025Feb 25, 2025
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      23k2225739Updated Feb 25, 2025Feb 25, 2025
    • hipBLAS

      Public
      ROCm BLAS marshalling library
      C++
      Other
      8113216Updated Feb 25, 2025Feb 25, 2025
    • rocBLAS

      Public
      Next generation BLAS implementation for ROCm platform
      C++
      Other
      17436033Updated Feb 25, 2025Feb 25, 2025