-
algorithmic-efficiency Public
Forked from mlcommons/algorithmic-efficiencyMLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvements in both training algorithms and models.
Python Apache License 2.0 UpdatedMar 9, 2025 -
-
-
-
Muon Public
Forked from KellerJordan/MuonMuon optimizer: +>30% sample efficiency with <3% wallclock overhead
Python MIT License UpdatedFeb 24, 2025 -
-
-
-
-
-
-
-
SWANOptimizer Public
Unofficial implementation of https://arxiv.org/abs/2412.13148
-
diffusers Public
Forked from huggingface/diffusers🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Python Apache License 2.0 UpdatedDec 12, 2024 -
HeavyBall Public
Forked from HomebrewML/HeavyBallEfficient optimizers
Python BSD 2-Clause "Simplified" License UpdatedDec 9, 2024 -
fsdp_optimizers Public
supporting pytorch FSDP for optimizers
-
neurallambda Public
Forked from neurallambda/neurallambdaReasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.
Python Other UpdatedNov 26, 2024 -
-
-
-
-
-
-
-
-
-
-
-
-
Previous Next