# Notational/formatting stuff - [x] all the V and Qs wrapped inside \mathrm{} - [x] all Rewards R and transition T wrapped inside \mathrm{} - [x] horizon _h appears on the subscript, policy \pi or star on superscript - [x] horizon $h$ is lowercase - [x] abbreviation "MDP" should always be uppercase # Content stuff (thanks Mardavij for comments too) - [x] openning paragraph - [x] 10.1.2 infinite-horizon - [x] 10.8 side note - [x] demote the DP notebox onto side note - [x] differentiate between fix-policy Q_{\pi} and optimal Q^* - [x] (in fact, typically small) remove - [x] decide if to introduce sink/terminal state
Notational/formatting stuff
Content stuff
(thanks Mardavij for comments too)