This class is about the main techniques of reinforcement learning:

  • Markov decision process
  • Dynamic programming
  • Online evaluation
  • Online control
  • Gradient methods
  • Bandit algorithms