Reinforcement Learning

Definition

Reinforcement Learning

Reinforcement learning is a paradigm of machine learning where an autonomous agent learns to make decisions by interacting with an environment to maximise a cumulative reward signal. Formally, the problem is modelled as a Markov Decision Process defined by $(S, A, P, R, γ)$ .

Formalism

The learning environment is defined by the state space $S$ and action space $A$ , with transition probabilities $P (s^{'} ∣ s, a)$ and an immediate reward function $R (s, a, s^{'})$ . The learner’s objective is to identify an optimal policy $π^{*} : S \to A$ that maximises the expected discounted return $G_{t} = \sum_{k = 0}^{\infty} γ^{k} r_{t + k + 1}$ , where $γ \in [0, 1]$ is the discount factor.

Lukas' Notes

Reinforcement Learning

Table of Contents

Definition

Formalism

Backlinks