This class is about the main techniques of reinforcement learning:

  • Markov decision process
  • Dynamic programming
  • Online prediction
  • Online control
  • Bandit algorithms
  • Monte-Carlo tree search