Lukas' Notes

reinforcement-learning

Definition

Discount Factor

The discount factor weights future rewards in the return:

  • : the agent is myopic, maximising only the immediate reward.
  • : the agent is farsighted, valuing distant rewards almost as much as immediate ones.
  • also ensures remains finite for unbounded or continuing tasks, acting as a soft horizon.