PhD at UChicago
RL for language models & agents, especially GUI
Highlights
- Pro
Pinned Loading
-
Gen-Verse/dLLM-RL
Gen-Verse/dLLM-RL Public[ICLR 2026] TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models
-
Gen-Verse/Open-AgentRL
Gen-Verse/Open-AgentRL PublicAn open-source reinforcement learning framework for training LLM-based agents — supporting GRPO, PPO, RLHF, multi-turn reasoning, tool use, and distributed training.
-
Gen-Verse/CURE
Gen-Verse/CURE Public[NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
