Skip to content
View michaelnny's full-sized avatar
  • Shanghai

Block or report michaelnny

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. rl4llm Public

    RL4LLM: A Research-Friendly RL Framework for LLM Post-Tuning

    Python

  2. alpha_zero Public

    A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games

    Python 146 34

  3. deep_rl_zoo Public

    A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.

    Python 115 11

  4. muzero Public

    A PyTorch implementation of DeepMind's MuZero agent

    Python 35 6

  5. InstructLLaMA Public

    Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to Instru…

    Jupyter Notebook 51 11

369 contributions in the last year

Skip to contributions year list
Contribution Graph
Day of Week August September October November December January February March April May June July
Sunday
Monday
Tuesday
Wednesday
Thursday
Friday
Saturday
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More

Contribution activity

August 1, 2025

michaelnny has no activity yet for this period.
Loading