Skip to content

Latest commit

 

History

History
11 lines (8 loc) · 664 Bytes

File metadata and controls

11 lines (8 loc) · 664 Bytes

Reinforcement Learning: An Introduction

Notes and exercise solutions to the second edition of Sutton & Barto's book.

Some of the solutions have mistakes, so be mindful! If you spot a mistake, please raise an issue and (at some point...) I'll fix it.


Notes:

  • The code has been refactored as I've gone along, so some of the earlier exercises might break/have code duplicated elsewhere
  • I used the online draft, so the numbering of sections, equations and exercises might not be consistent with the published version of the book. This is unfortunate, but I don't have time to correct it.
  • If there is a typo/mistake somewhere then let me know