Options d’inscription
This course presents the main concepts and algorithms of reinforcement learning:
- Markov decision process
- Dynamic programming (policy iteration, value iteration)
- Online control (Q-learning, Monte-Carlo Tree Search)
- Bandit algorithms
Teacher: Thomas Bonald
References:
- Enseignant: Thomas Bonald