Skip to content

mdp chap clean up #33

@shensquared

Description

@shensquared

Notational/formatting stuff

  • all the V and Qs wrapped inside \mathrm{}
  • all Rewards R and transition T wrapped inside \mathrm{}
  • horizon _h appears on the subscript, policy \pi or star on superscript
  • horizon $h$ is lowercase
  • abbreviation "MDP" should always be uppercase

Content stuff

(thanks Mardavij for comments too)

  • openning paragraph
  • 10.1.2 infinite-horizon
  • 10.8 side note
  • demote the DP notebox onto side note
  • differentiate between fix-policy Q_{\pi} and optimal Q^*
  • (in fact, typically small) remove
  • decide if to introduce sink/terminal state

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions