Skip to content
Change the repository type filter

All

    Repositories list

    • torchada

      Public
      Adapter package for torch_musa to act exactly like PyTorch CUDA
      Python
      11701Updated Jan 29, 2026Jan 29, 2026
    • LiteGS

      Public
      A refactored codebase for Gaussian Splatting. Training 3DGS in 50 seconds!
      Cuda
      2933130Updated Jan 27, 2026Jan 27, 2026
    • C++
      2500Updated Jan 26, 2026Jan 26, 2026
    • TypeScript
      1101Updated Jan 23, 2026Jan 23, 2026
    • Provides a Python interface to GPU management and monitoring functions. This is a wrapper around the MTML library.
      C
      2410Updated Jan 22, 2026Jan 22, 2026
    • PyTorch media decoding and encoding
      Python
      88100Updated Jan 22, 2026Jan 22, 2026
    • torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics cards.
      Python
      33471230Updated Jan 19, 2026Jan 19, 2026
    • A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better per…
      Python
      617901Updated Jan 14, 2026Jan 14, 2026
    • Python
      11000Updated Jan 14, 2026Jan 14, 2026
    • Shell
      64361Updated Jan 13, 2026Jan 13, 2026
    • muAlg

      Public
      Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
      Cuda
      463600Updated Jan 12, 2026Jan 12, 2026
    • Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
      Python
      421000Updated Jan 12, 2026Jan 12, 2026
    • tvm_musa

      Public
      Open Machine Learning Compiler Framework
      Python
      3.8k000Updated Jan 9, 2026Jan 9, 2026
    • SimuMax

      Public
      a static analytical model for LLM distributed training
      Python
      1411420Updated Jan 8, 2026Jan 8, 2026
    • A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface
      Jupyter Notebook
      12000Updated Jan 7, 2026Jan 7, 2026
    • Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
      Python
      3.7k100Updated Jan 7, 2026Jan 7, 2026
    • StableGS

      Public
      Cuda
      0910Updated Jan 5, 2026Jan 5, 2026
    • FFmpeg

      Public
      Mirror of https://git.ffmpeg.org/ffmpeg.git
      C
      13k100Updated Dec 30, 2025Dec 30, 2025
    • vision

      Public
      Datasets, Transforms and Models specific to Computer Vision
      Python
      7.2k000Updated Dec 29, 2025Dec 29, 2025
    • mutlass

      Public
      MUSA Templates for Linear Algebra Subroutines
      C++
      1.6k4110Updated Dec 19, 2025Dec 19, 2025
    • mate

      Public
      MUSA AI Tensor Engine
      C++
      0500Updated Dec 19, 2025Dec 19, 2025
    • kineto

      Public
      HTML
      3100Updated Nov 20, 2025Nov 20, 2025
    • AI_Agent

      Public
      Jupyter Notebook
      0000Updated Nov 17, 2025Nov 17, 2025
    • URPO

      Public
      0000Updated Nov 14, 2025Nov 14, 2025
    • PyTorch Extension Library of Optimized Autograd Sparse Matrix Operations
      Python
      159000Updated Oct 17, 2025Oct 17, 2025
    • PyTorch Extension Library of Optimized Scatter Operations
      Python
      204000Updated Oct 17, 2025Oct 17, 2025
    • Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models on MTGPU.
      Go
      14k3420Updated Oct 13, 2025Oct 13, 2025
    • PaddlePaddle custom device implementaion. (『飞桨』自定义硬件接入实现)
      C++
      212000Updated Sep 24, 2025Sep 24, 2025
    • audio

      Public
      Data manipulation and transformation for audio signal processing, powered by PyTorch
      Python
      759000Updated Sep 14, 2025Sep 14, 2025
    • 0000Updated Sep 10, 2025Sep 10, 2025