Definition
Rate of change
The differential function, aka derivative, is \
Linear approximation view
Hence,
Other views and generalizations
See the derivatives of general functionals and functions in the vector spaces survey.
Existence: Differentiability
The the above limit exists at a certain point
Relationship with continuity
If
Also,
Smoothness
If for all
Differential operator
Definition
Consider the operator
Notation
So,
Below, represent the scalar functions
Higher order differentials
These are defined by
Other notations include:
Inverse
Directly from the definition,
Linearity
Hence, the differential operator
Other properties
If
Composition (chain rule)
Suppose
Parametrically defined functions
Suppose
Differentiation
Differentiation is the procedure of evaluating the differential operator for a certain function.
Differentiation tricks
Differentials of important functions
Ab initio differentiation of sin x, cos x.
Differentiation of powers and exponents
Other properties
Geometry
Geometry is described for the case of general functionals in the vector spaces survey.
Connection with extrema
If
Effect of uniform convergence
Take
If
Polynomial approximation of f
f(x): Mean change and the gradient
Interior extremum existance
(Rolle) If
Easy to make a visual argument.
Proof
There exists atleast one maximum and one minimum in [a, b]; if it happens to be in the interior set
Mean value theorem
If
(Thence, linear approximation to
Relative to another function
If
Proof
Suppose that
Definite integral view and the mean
As integration can be viewed as an extension of summation,
Polynomial approximation
Aka Taylor theorem.
Can then bound error term by bounding
Proof
We want to find
From mean value theorem wrt function
Associated series
In Pf, note that, in general,
The polynomial approximation series, aka Taylor series, is the polynomial approximation
McLaurin series: Taylor series about 0.
Importance
Polynomial approximation of functions described above is very important in analyzing the solutions to many problems. This is because one can use the nth degree approximation and upper bounds on
For example, this is used to prove that solutions to certain optimization problems which arise in doing maximum likelihood estimation have desirable properties. Also, several optimization algorithms work by minimizing the polynomial (quadratic in the case of Newton’s method) approximation to
Extensions
Can’t easily extend to general metric spaces by using a distance function
Extension to functionals
See vector spaces ref.