Pinned Loading
Repositories
Showing 10 of 152 repositories
- vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
recogni/vllm’s past year of commit activity - microxcaling_traceable Public Forked from microsoft/microxcaling
PyTorch emulation library for Microscaling (MX)-compatible data formats.
recogni/microxcaling_traceable’s past year of commit activity - foundation-model-stack Public Forked from foundation-model-stack/foundation-model-stack
🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.
recogni/foundation-model-stack’s past year of commit activity - licenseheaders Public Forked from torsten-pf/licenseheaders
Simple python script to add/replace license headers in a directory tree of source files
recogni/licenseheaders’s past year of commit activity - Atom_communication Public Forked from efeslab/Atom
[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
recogni/Atom_communication’s past year of commit activity