Lukas' Notes

machine-learning statistics

Definition

Divergence Function

A divergence function is a function that measures how different two objects are, without necessarily satisfying all axioms of a metric.

For a set , a divergence function is usually a map

such that

The value measures the discrepancy between and . Unlike a metric, a divergence function may be asymmetric, may fail the triangle inequality, and may treat its two arguments differently.

Interpretation

A divergence is often used when one object is treated as the reference and the other as an approximation. In that case, means: how costly is it to use in place of ?

This is why the order may matter. A Kullback–Leibler divergence measures the cost of approximating by , and this is generally different from approximating by .

Comparison with Metrics

Every non-negative metric can be used as a divergence, but not every divergence is a metric.

PropertyMetricDivergence function
Non-negativeyesusually yes
Zero on identical objectsyesusually yes
Symmetricrequirednot required
Triangle inequalityrequirednot required

Thus a divergence is a broader notion of discrepancy. It keeps the idea of separation, but drops some geometric constraints when they are not appropriate for the problem.

Examples