Skip to content

Latest commit

 

History

History
15 lines (7 loc) · 241 Bytes

README.md

File metadata and controls

15 lines (7 loc) · 241 Bytes

The implementation of tabular solution methods in Reinformcement Learning, Sutton's book: Part I

On-policy first-visit Monte Carlo

Off-policy Monte Carlo

Double Q-learning

2-Step Tree Back-up

SARSA

2-Step Expected SARSA