Definition
Advantage Estimate
An advantage estimate is an empirical approximation of the advantage function , computed from sampled trajectories.
Common estimators:
The estimate enters the policy gradient as the weight multiplying .
Advantage Estimate
An advantage estimate is an empirical approximation of the advantage function , computed from sampled trajectories.
Common estimators:
The estimate enters the policy gradient as the weight multiplying .