Definition
Fixed direction differential fn
Aka directional derivative.
Fixing the direction
Alternate notation:
Affine approximation view
This definition of the directional derivative is equivalent to the defining
R to R case
In this special case, there is just one direction:
Directional differentiability
If, at
The differential of
So,
But, this is unsatisfactory as directional differentiability does not imply continuity. \why
Continuous differentiability
If at
Aka Frechet derivative, total derivative.
Connection to directional differentiability
In non pathological cases, both notions of differentiability are equivalent: This comes from applying the polynomial approximation theorem for
In the case of continuous differentiability, this follows from definition. In the case of directional differential functions, this can be seen using the polynomial approximation theorem for
Matrix functionals
Similar definition for differential functions for functionals over the vector space of matrices. Eg: See
Linearity
The differential operator
Note that this is separate from directional linearity.
Connection to partial derivatives
We suppose that linearity is established (simple in case of Frechet derivatives).
From linearity,
Notation
Note about representation
Note that, as explained there, ‘gradients’ are defined wrt to vectors - without differentiating between their representation as row or column vectors. Such representations are secondary to the correctness of their values, and can be altered as necessary for convenience of expression.
D(f) as a Vector field
Hence, the derivative operator
C1 smoothness
Differentiability vs smoothness
Gradient’s existence does not guarantee differentiability; derivative must exist in all directions - in an open ball around
In contour graph
Perpendicular to contours
Sublevel sets and gradient direction
Consider level-sets
In the plot
Take the plot
Subgradients at convex points
Extension of the gradient to non-differentiable functional f(x). See convex functional section.
Differential operator
Its general properties, including linearity, product rule and the chain rule, are considered under vector functions.
Derivatives of important functionals
For simplicity in remembering the rules it is easier to think in terms of the Differential operator, rather than the gradient (which is just
Linear functionals
Quadratic functionals
Proof
expanding
If
Higher order differential functions
Definition
Linear map from V
Take the differential functional
Similarly, kth order differential function
Differential operators, of which
Directional higher order differential fn
With
Multi-Linear map from \htext{ {V-k}}
Note that, as defined here,
So, using an isomorphism, it is convenient to view
Hence,
Similarly, kth order differential functions can be defined in general.
Properties
Symmetry
Wrt basis vectors
The notation
Tensor representation
Proof
By the distributive property of multilinear functions. This can also be proved by applying the chain rule, the directional linearity of the differential function and the linearity of the differential operator.
Similarly
2nd order case
In the 2nd order case, this is aka Hessian matrix.
This matrix is important in tests for convexity at a critical point.
Polynomial approximation
See the 1-D case in complex analysis ref.
Restrict
Polynomial approximation series
Aka Taylor series. Similarly, in the limit get:
Multi-index notation
Take
Connection with extreme values
See optimization ref.
Derivative matrix
Motivation using directional derivatives
For every functional
Arrangement as rows
So, due to the definition of the differential function of vector valued functions,
This is aka Jacobian matrix. Notation:
Note about dimensions
As explained in the case of derivatives of functionals, representations are secondary to the correctness of their values, and can be altered as necessary for convenience of expression. One must however pay attention to them to be consistent with other entities in the same algebraic expression.
Differential operator
Linearity follows from linearity of functional derivatives.
Row-valued functions
Sometimes, one encounters a function whose component functionals are arranged as a row vector
Product of functions
From scalar functional derivative product rule:
Composition of functions: chain rule
Directional differential functions
Take
Proof
We want
In matrix representation
In terms of derivative matrices, this is a matrix product:
(Observe how the dimensions match perfectly: for functional (function) compositions!)
Linear and constant functions
Non-triviality of inversion
COnsider
If J is square and M is invertible: