Skip to content

Latest commit

 

History

History
13 lines (8 loc) · 856 Bytes

E2.md

File metadata and controls

13 lines (8 loc) · 856 Bytes

Some Considerations on Learning to Explore via Meta-Reinforcement Learning

Bradly C. Stadie, Ge Yang, Rein Houthooft, Xi Chen, Yan Duan, Yuhuai Wu, Pieter Abbeel, Ilya Sutskever

Abstract

We interpret meta-reinforcement learning as the problem of learning how to quickly find a good sampling distribution in a new environment.

This interpretation leads to the development of two new meta-reinforcement learning algorithms: E-MAML and E-RL2.

Results are presented on a new environment we call ‘Krazy World’: a difficult high-dimensional gridworld which is designed to highlight the importance of correctly differentiating through sampling distributions in meta-reinforcement learning.

Further results are presented on a set of maze environments.

We show E-MAML and E-RL2 deliver better performance than baseline algorithms on both tasks.