MDP - igheyas/Reinforcement-Learning GitHub Wiki

7. Why MDPs Matter

Provide a rigorous foundation for sequential decision problems.

Underlie most RL algorithms (e.g. value iteration, Q-learning, policy gradients).

Once you can cast your problem as an MDP, you can apply standard solution methods to learn or compute an optimal policy.