Epsilon soft policy - HiIAmTzeKean/SC3000-Artificial-Intelligence GitHub Wiki
tags:
- 🌱
- AI
- ComputerScience date: 20--Feb--2023
Related to Soft policy
Different from the greedy approach where each action has at least
\begin{cases}
p=1-\epsilon+\frac{\epsilon}{|A|}, & \alpha=\alpha^* \
p=\frac{\epsilon}{|A|}, & \alpha\ne\alpha^*
\end{cases}$$
Links: