Pinned Loading
-
PRIME-RL/PRIME
PRIME-RL/PRIME PublicScalable RL solution for advanced reasoning of language models
-
PRIME-RL/TTRL
PRIME-RL/TTRL Public[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
-
thunlp/UltraChat
thunlp/UltraChat PublicLarge-scale, Informative, and Diverse Multi-round Chat Data (and Models)
-
OpenBMB/UltraFeedback
OpenBMB/UltraFeedback PublicA large-scale, fine-grained, diverse preference dataset (and models).
-
thunlp/OpenPrompt
thunlp/OpenPrompt PublicAn Open-Source Framework for Prompt-Learning.
-
OpenBMB/MiniCPM
OpenBMB/MiniCPM PublicMiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.