Lukas' Notes

reinforcement-learning

Stochastic Policy

Given a state space and an action space , a stochastic policy is a map

where is the set of probability distributions over . Equivalently,

is the probability of choosing action in state .