Definition
On-Policy Value Function
The on-policy value function gives the expected return if you start in state and always act according to policy :
On-Policy Value Function
The on-policy value function gives the expected return if you start in state and always act according to policy :