Lukas' Notes

machine-learning learning-theory

Definition

Approximation Error

The approximation error is the gap between the risk of the best model a hypothesis class can express and the lowest risk attainable by any predictor. Writing

for the best-in-class model and for the Bayes risk, it is

It measures a limitation of itself, independent of any data: even with infinitely many samples, no model in can beat . A class too restrictive for the target leaves a large approximation error, the learning-theory counterpart of bias and the source of underfitting. It is reduced only by enlarging or changing , never by collecting more data.

The excess-risk decomposition

The risk of a learned model — the empirical-risk minimiser — above the Bayes optimum splits into a part the class cannot avoid and a part the finite sample adds:

The first term is the estimation error, driven by the sample and the capacity of ; the second is the approximation error above, fixed by the choice of class. Enriching lowers the approximation error but typically raises the estimation error, which is the bias-variance tradeoff stated at the level of risk.