Lukas' Notes

optimisation calculus machine-learning

Definition

First-order Optimisation Method

A first-order optimisation method is an optimisation method that chooses its updates using first-derivative information of an objective, usually the gradient, but not second-derivative curvature information such as the Hessian matrix. For a differentiable objective , a typical first-order step has the form

where is computed from , and is a learning rate. Gradient descent is the canonical example, with . The method reads the local slope of the objective, but it does not directly rescale directions by local curvature as a second-order optimisation method does.