lupuandr

Follow

Andrei Lupu lupuandr

Follow

Ph.D. at Oxford and Meta AI studying (multi-agent) reinforcement learning. Formerly at Mila / McGill University

11 followers · 3 following

FLAIR, University of Oxford / FAIR team at Meta AI
Oxford, UK

Achievements

Achievements

Highlights

Pro

Organizations

Pinned Loading

Target-UCB Target-UCB Public

Simple implementation of the Target-UCB algorithm.

Python 2
luchris429/purejaxrl luchris429/purejaxrl Public

Really Fast End-to-End Jax RL Implementations

Python 765 63
FLAIROx/JaxMARL FLAIROx/JaxMARL Public

Multi-Agent Reinforcement Learning with JAX

Python 468 89
montrealrobotics/DeepRLInTheWorld montrealrobotics/DeepRLInTheWorld Public

From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..

221 29
facebookresearch/off-belief-learning facebookresearch/off-belief-learning Public archive

Implementation of the Off Belief Learning algorithm.

Python 46 8
FLAIROx/behaviour-distillation FLAIROx/behaviour-distillation Public

Code for Behaviour Distillation (ICML 2024)

Python 3