Lukas' Notes

analysis optimisation convex-analysis

Definition

Subgradient

Let be a convex function on a convex set , and let . A subgradient of at is a vector such that

The affine function is a global lower supporting plane for at . The set of all such vectors is the subdifferential .

Derivation

For a differentiable convex function, the tangent line at lies below the whole function:

The gradient is therefore the slope of a global affine lower bound. At a corner, there is no single tangent slope, but there may be many supporting slopes. A subgradient keeps exactly the part of the gradient that survives the corner: it is any slope whose affine line still stays below the function everywhere.

This is why the formula is forced. Given convexity, the world is shaped like a bowl; asking for a first-order linear approximation that never cuts through the bowl gives the inequality

The vector is not merely a local hint. It defines a global supporting hyperplane to the epigraph of the function.

Relation to the Gradient

If is a differentiable convex function at , then the subdifferential contains exactly one vector:

So, for differentiable convex functions, the subgradient is the gradient.

The converse intuition needs care. A differentiable non-convex function has a gradient, but that gradient need not satisfy the global subgradient inequality. In that setting, the gradient is a local slope, not necessarily a subgradient in the convex-analysis sense.

Optimisation Meaning

For convex minimisation, a subgradient points toward increase in the linear lower model. Therefore is used as a descent direction in subgradient methods.

The optimality condition is especially clean:

At a smooth minimum, this says . At a corner minimum, it says that the zero vector lies among the supporting slopes.

Example

For on ,

At the corner , every slope between and defines a line that touches the graph at the origin and stays below everywhere.