Skip to content
@PRIME-RL

PRIME-RL

Researching scalable (RL) methods on language models.

Pinned Loading

  1. SimpleVLA-RL SimpleVLA-RL Public

    SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

    Python 747 31

  2. Entropy-Mechanism-of-RL Entropy-Mechanism-of-RL Public

    The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

    Python 336 9

  3. TTRL TTRL Public

    [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

    Python 816 61

  4. PRIME PRIME Public

    Scalable RL solution for advanced reasoning of language models

    Python 1.7k 100

  5. ImplicitPRM ImplicitPRM Public

    Repo of paper "Free Process Rewards without Process Labels"

    Python 164 11

Repositories

Showing 6 of 6 repositories