Part 1: The Total Derivative
To this point, we have considered only partial derivatives of a
function f( x,y) . In this section, we introduce the concept of
a total derivative of a transformation.
Recall that a function f(x) of one variable is said to be differentiable
at p if
the following limit exists:
|
f' (p) = |
lim
h ® 0
|
|
f(p+h) - f( p)
h
|
|
| (1) |
The problem
with extending (1) to higher dimensions is that it requires h to
become a vector, and yet we cannot "divide" by a vector. However, we can
rewrite (1) in the form
|
|
lim
h ® 0
|
 |
f(p+h) - f( p)
h
|
- f' (p) |
 |
= 0 |
|
|
and thus, we obtain also that
|
|
lim
h ® 0
|
 |
f(p+h) - f( p)
h
|
- |
f' (p) h
h |
 |
= 0 |
|
|
Consequently, the definition of the derivative (1) can be re-interpreted to mean that
if f(x) is differentiable at p if there is a number f'(p) such that
|
|
lim
h ® 0
|
| f(p+h) - f( p)
- f' (p)
h |
|h|
|
|
|
= 0 |
|
|
In this formulation, division is by
|h|, which naturally generalizes to the norm of a vector h =
á h, k
ñ. Indeed, if we denote f( x,y) by f(x) , where x =
á x,y
ñ, and let p = ( p,q) be a point, then we have
the following:
Definition 5.1: A function f( x,y) is differentiable at a point ( p,q) if there exists a vector Ñf( p) for which
|
|
lim
h® 0
|
|
| f(p+h) - f(p) - Ñ
f( p) · h|
||
h||
|
= 0 |
| (2) |
When it exists, Ñf( p) is the total
derivative of f( x) at p.
The vector Ñf( p) is also called the
gradient of f( x) at p.
If we write Ñf( p) =
á a,b
ñ , then we can determine the values of a and b by evaluating the limit
in (2) in two different directions. Along the
x-axis, h =
á h,0
ñ and thus we have
|
|
lim
h® 0
|
|
| f(p+h,q) - f(p,q) - á a,b
ñ ·
á h,0
ñ|
||
á h,0
ñ ||
|
= 0 |
| |
|
|
lim
h ® 0
|
 |
f(p+h,q) - f( p,q)
- ah
h
|
 |
= 0 |
|
|
|
|
lim
h ® 0
|
 |
f(p+h,q) - f( p,q)
h
|
- |
a |
 |
= 0 |
|
|
This implies that
|
a = |
lim
h ® 0
|
|
f(p+h) - f( p)
h
|
= fx(p,q) |
| (1) |
Similarly, (see the exercises), it can be shown that b = fy(p,q),
so that the gradient is given by
|
Ñf( p,q) =
á fx( p,q) ,fy(p,q)
ñ |
|
This yields the following:
Theorem 5.2: If f( x,y) is differentiable at a point
( p,q) , then its total derivative is given by the
gradient of f at ( p,q) , which is
|
Ñf( p,q) =
á fx( p,q), fy(p,q)
ñ |
|
The total derivative is also known as the Jacobian of the
mapping f( x,y) for reasons that will become apparent in the
next chapter.
EXAMPLE 1 Find the total derivative (i.e., gradient) of
Solution: Since fx = 2x+3y and fy = 3x, the total
derivative is
Definition 5.1 can be applied to a function f of any number of
variables, in which case Theorem 5.2 says that Ñf is the vector of
first partial derivatives. For example, the gradient of a function of 3
variables U( x,y,z) is given by
and ÑU is the total derivative of U( x,y,z) .
Check your Reading: Why was the product rule not used in evaluating
fx(x,y) in example 1?