Reading part:
- Chapters 2, 4, 5, 6, 8 from QCQI, Nielsen and Chuang, 9 partially done.
- Chapters 1, 2 from Sutton and Barto, 3 in progress.
Report writing:
- 8th chapter from N&C partially done.
Coding:
- Wrote a basic simulation for an
$\epsilon$ -greedy strategy in a multi-armed bandit.