Gradient of a function
The gradient of a differentiable function contains the first derivatives of the function with respect to each variable. The gradient is useful to find the linear approximation of the function near a point.
Definition
The gradient of at , denoted , is the vector in given by
Examples:
- Distance function: The distance function from a point to another point is defined as
The function is differentiable, provided , which we assume. Then
- Log-sum-exp function: Consider the ‘‘log-sum-exp’’ function , with values
The gradient of at is
where . More generally, the gradient of the function with values
is given by
where , and .
Composition rule with an affine function
If is a matrix, and is a vector, the function with values
is called the composition of the affine map with with . Its gradient is given by
Geometric interpretation
Geometrically, the gradient can be read on the plot of the level set of the function. Specifically, at any point , the gradient is perpendicular to the level set and points outwards from the sub-level set (that is, it points towards higher values of the function).