Skip to content

SuDIS-ZJU/llm-inference-all-in-one

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

11 Commits
ย 
ย 

Repository files navigation

๐Ÿš€ LLM Inference All-in-One ๐ŸŒŸ

Your ultimate guide to resources, papers, and blogs on Large Language Model (LLM) inference techniques! ๐Ÿ“šโœจ


๐Ÿ† Awesome Lists

Overview

  • ๐Ÿ”— Awesome-LLM-Inference
    A curated collection of papers and codes on LLM inference, including topics like FlashAttention, PagedAttention, and Parallelism.

  • ๐Ÿ”— Awesome LLM Systems Papers
    A curated list of Large Language Model systems related academic papers, articles, tutorials, slides and projects.


๐ŸŒ€ Speculative Decoding


๐Ÿ“ Long-Context Modeling

๐Ÿ”— Large Language Model Based Long Context Modeling Papers and Blogs
Dive deep into papers and blogs on extending LLM context length, efficient transformers, and retrieval-augmented generation (RAG). ๐Ÿง โœจ


๐Ÿงฉ Mixture of Experts (MoE)

๐Ÿ”— Awesome MoE LLM Inference System and Algorithm
A comprehensive list of resources for optimizing MoE-based LLM inference. Perfect for tackling sparse expert models! ๐ŸŒŸ


๐Ÿ—‚๏ธ KV Cache Management

Efficient management of KV Caches for LLM acceleration! โšก


๐Ÿ“ Resources

Explore insightful blogs and courses on cutting-edge LLM inference techniques! ๐ŸŒ

Courses

๐Ÿ”— ๅ…ฅ้—จๅฟ…ๅค‡ - Andrej Karpathy๏ผšไปŽ้›ถๅผ€ๅง‹ๆž„ๅปบ GPT ็ณปๅˆ—

๐Ÿ”— MIT 6.5940 TinyML ๅ’Œ้ซ˜ๆ•ˆ็š„ๆทฑๅบฆๅญฆไน ่ฎก็ฎ—

๐Ÿ”— UCSD CSE 234: Data Systems for Machine Learning

๐Ÿ”— CMU Large Language Model System Course

Blogs

๐Ÿ”— Learning notes for ML System

๐Ÿ”— A batch of noteworthy MLSys bloggers

Stay tuned for more updates! ๐ŸŽ‰

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •