Skip to content

bruno686/Awesome-RL-based-LLM-Reasoning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 

Repository files navigation

Awesome-RL-based-LLM-Reasoning

PR Welcome License: MIT Awesome

We have witnessed the powerful capabilities of pure RL-based LLM Reasoning. In this repository, we will add newest papers, slides, and other interesting materials that enhance LLM reasoning with reinforcement learning, helping everyone learn quickly!
Starring this repository is like being at the forefront of RL-based LLM reasoning.
在风口浪尖 (In the teeth of the storm)

Papers

Outcome-based Reward Model

Process-based Reward Model

Reinforcement learning

Search algorithms (Monte Carlo Tree Search or Beam Search)

Other Newest Interesting Papers about LLM Reasoning

Slides and Discussion

Video

Open-Source Project

Introduction to Reinforcement Learning

Cloud GPU

  • Compshare (After registration, there is a quota of 50 yuan, enough to run R1 on unsloth)

Other Interesting RL-based Reasoning Repository

Contributing

  • Feel free to contribute more papers or other any resources!