Markov Decision Processes (MDP)

Pre-work

The pre-work for this class includes reading the pages from 47 until 56 and from 58 until 67 of Sutton, R. S., & Barto, A. G. (2018). Reinforcement learning: An introduction. MIT press.

In this class, we will introduce Markov Decision Processes (MDPs), which are a fundamental framework for modeling decision-making problems in reinforcement learning. We will cover the following topics:

Definition of MDPs
Examples of MDPs
Value functions and Optimal policies
Bellman equations