michaelnny

Follow

Michael Hu michaelnny

Follow

Working on RL

44 followers · 8 following

Shanghai

Achievements

Achievements

Pinned Loading

rl4llm Public

RL4LLM: A Research-Friendly RL Framework for LLM Post-Tuning

Python
alpha_zero Public

A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games

Python 161 36
deep_rl_zoo Public

A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.

Python 116 12
muzero Public

A PyTorch implementation of DeepMind's MuZero agent

Python 36 6
InstructLLaMA Public

Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to Instru…

Jupyter Notebook 54 13

634 contributions in the last year

Skip to contributions year list

Learn how we count contributions

Less

More

Contribution activity

November 2025

4 contributions in private repositories Nov 1