MDP - igheyas/Reinforcement-Learning GitHub Wiki
7. Why MDPs Matter
Provide a rigorous foundation for sequential decision problems.
Underlie most RL algorithms (e.g. value iteration, Q-learning, policy gradients).
Once you can cast your problem as an MDP, you can apply standard solution methods to learn or compute an optimal policy.