tsaoyu

Follow

🚀

Working

Tony Yu Cao tsaoyu

🚀

Working

Follow

LLM, Reinforcement Learning, Robotics

90 followers · 39 following

https://www.tsaoyu.com

Achievements

Achievements

Highlights

Pro

Organizations

Pinned Loading

ct_example ct_example Public

Advanced control (iLQR, MPC, GNMS) examples with control toolbox in ROS

C++ 24 3
openai/baselines openai/baselines Public

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16.4k 4.9k
OpenRLHF/OpenRLHF OpenRLHF/OpenRLHF Public

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8k 773
volcengine/verl volcengine/verl Public

verl: Volcano Engine Reinforcement Learning for LLMs

Python 13.5k 2.4k