Skip to content

Explore DeepSeek R1🚀: reproduction guides, papers, insightful tweets&blogs to explore and learn. 🌟

Notifications You must be signed in to change notification settings

remember00000/Awesome-DeepSeek-R1-Resources

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

Awesome-deepseek-r1-resources

Reproduction

  • [🔗 link] — huggingface/open-r1: Fully open reproduction of DeepSeek-R1
  • [🔗 link] — Jiayi-Pan/TinyZero
  • [🔗 link] — hkust-nlp/simpleRL-reason: This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Related Resources

  • R1+Sonnet Shatters Benchmark at 14X Lower CostDeepSeek R1 paired with Sonnet achieved 64% on the aider polyglot benchmark, outperforming o1 while costing 14X less. Users highlighted its MIT license and adoption at top universities.
  • R1 Re-Distillation Boosts Qwen-1.5B: Mobius Labs’ redistilled R1 variant surpassed the original, with plans to expand to other architectures.
  • R1’s Arena Rankings Spark GPU Allocation Theories: R1 hit #3 in LMArena, matching o1’s coding performance at 20x cheaper, fueled by rumors of spare NVIDIA H100 usage and Chinese government backing.
  • [🔗 link] — atfortes/Awesome-LLM-Reasoning: Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓
  • [🔗 link] — Nathan Lambert: DeepSeek R1's recipe to replicate o1 and the future of reasoning LMs

Findings

Launch

When we say "R1", it's ambiguous. DeepSeek actually dropped 8 R1 models - 2 "full" models, and 6 distillations on open models:

  • from Qwen 2.5: finetuned with 800k samples curated with DeepSeek-R1, in 1.5B, 7B, 14B, and 32B
  • from Llama 3.1 8B Base: DeepSeek-R1-Distill-Llama-8B
  • from Llama3.3-70B-Instruct: DeepSeek-R1-Distill-Llama-70B
  • and DeepSeek-R1 and DeepSeek-R1-Zero, the full-size, 671B MoE models similar to DeepSeek V3. Surprisingly, MIT licensed rather than custom licenses, including explicit OK for finetuning and distillation

Other notables from the launch:

Surprises from the paper:

image.png

image.png

Jiayi-Pan(TinyZero):

About

Explore DeepSeek R1🚀: reproduction guides, papers, insightful tweets&blogs to explore and learn. 🌟

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published