PAC Learning

machine-learning statistics learning-theory

Definition

PAC Learning

Probably Approximately Correct (PAC) learning is a mathematical framework for quantifying the learnability of a hypothesis class $H$ . A class $H$ is PAC-learnable if there exists a learning algorithm $A$ and a sample complexity function $m (ϵ, δ)$ such that for any distribution $P$ and any parameters $ϵ, δ \in (0, 1)$ :

$Pr_{S \sim P^{m}} (R_{P} (A (S)) \leq ϵ) \geq 1 - δ$

provided that the sample size $m \geq m (ϵ, δ)$ . The parameters define the rigor of the guarantee:

Approximately Correct ( $ϵ$ ): The hypothesis achieves an error rate no greater than $ϵ$ .

Probably ( $1 - δ$ ): This accuracy is achieved with high confidence.

Learnability of Finite Classes

If the hypothesis class $H$ is finite and the realisability assumption holds, any hypothesis that is consistent with the training set (i.e., has zero empirical risk) is PAC-learnable. The required sample complexity is bounded by:

m \geq \frac{1}{ϵ} (ln ∣ H ∣ + ln \frac{1}{δ})

Derivation Intuition: Consider a “bad” hypothesis with $R (h) > ϵ$ . The probability that this hypothesis is consistent with a single sample is at most $1 - ϵ$ . For $m$ independent samples, the probability that a bad hypothesis remains consistent is $(1 - ϵ)^{m}$ . To ensure that the probability of any bad hypothesis in $H$ being consistent is less than $δ$ , we require $∣ H ∣ (1 - ϵ)^{m} < δ$ . Using the inequality $1 - ϵ \leq e^{- ϵ}$ , this yields $∣ H ∣ e^{- m ϵ} < δ$ , which solves to the bound above.

Learnability of Infinite Classes

For classes with infinite cardinality, such as linear separators in $R^{d}$ , PAC-learnability is determined by the VC dimension. A class is PAC-learnable if and only if its VC dimension is finite.

Lukas' Notes

PAC Learning

Definition

Learnability of Finite Classes

Learnability of Infinite Classes

Graph View

Table of Contents

Backlinks