Bayes Optimal Classifier

machine-learning statistics learning-theory

Definition

Bayes Optimal Classifier

The Bayes optimal classifier is the theoretically best possible classification model for a given data distribution $P (X, Y)$ . Formally, for an input $x \in X$ , the Bayes optimal prediction $f^{*} (x)$ is obtained by selecting the label $y \in Y$ that maximises the posterior probability:

$f^{*} (x) = ar g max_{y \in Y} P (y ∣ x)$

This classifier achieves the minimum possible true risk, known as the Bayes risk. For any other hypothesis $g : X \to Y$ , the risk $R (f^{*}) \leq R (g)$ holds.

Optimality and Risk

Bayes Risk: The expected error rate of the Bayes optimal classifier. It represents the irreducible error caused by the overlap of class distributions in the feature space (stochasticity).

Relation to Empirical Methods: While the true distribution $P (y ∣ x)$ is typically unknown, machine learning algorithms seek to approximate the Bayes optimal classifier by either directly estimating the conditional distribution (Discriminative Learning) or modelling the joint distribution (Generative Learning).

Lukas' Notes

Bayes Optimal Classifier

Definition

Optimality and Risk

Graph View

Table of Contents

Backlinks