Skip to content
View michaelnny's full-sized avatar

Block or report michaelnny

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Llama3-FunctionCalling Llama3-FunctionCalling Public

    Fine-tune Llama3 model to support function calling

    Jupyter Notebook 24 1

  2. InstructLLaMA InstructLLaMA Public

    Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to Instru…

    Jupyter Notebook 44 9

  3. RAG-LLaMA RAG-LLaMA Public archive

    A clean and simple implementation of Retrieval Augmented Generation (RAG) to enhanced LLaMA chat model to answer questions from a private knowledge base. We use Tesla user manuals to build the know…

    Jupyter Notebook 3 1

  4. alpha_zero alpha_zero Public

    A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games

    Python 79 18

  5. muzero muzero Public archive

    A PyTorch implementation of DeepMind's MuZero agent

    Python 27 3

  6. deep_rl_zoo deep_rl_zoo Public archive

    A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.

    Python 104 10