Skip to content
View michaelnny's full-sized avatar
  • Shanghai

Block or report michaelnny

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. rl4llm Public

    RL4LLM: A Research-Friendly RL Framework for LLM Post-Tuning

    Python

  2. alpha_zero Public

    A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games

    Python 161 36

  3. deep_rl_zoo Public

    A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.

    Python 116 12

  4. muzero Public

    A PyTorch implementation of DeepMind's MuZero agent

    Python 36 6

  5. InstructLLaMA Public

    Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to Instru…

    Jupyter Notebook 54 13

634 contributions in the last year

Skip to contributions year list
Contribution Graph
Day of Week November December January February March April May June July August September October
Sunday
Monday
Tuesday
Wednesday
Thursday
Friday
Saturday
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More

Contribution activity

November 2025

4 contributions in private repositories Nov 1
Loading