Advanced Calculus of Several Variables (1973)
Part VI. The Calculus of Variations
The calculus of variations deals with a certain class of maximum–minimum problems, which have in common the fact that each of them is associated with a particular sort of integral expression. The simplest example of such a problem is the following. Let f:
3 →
be a given
function, and denote by
the set of all real-valued
functions defined on the interval [a, b]. Given
, define
by
![]()
Then F is a real-valued function on
. The problem is then to find that element
, if any, at which the function
attains its maximum (or minimum) value, subject to the condition that φ has given preassigned values φ(a) = α and φ(b) = β at the endpoints of [a, b].
For example, if we want to find the
function φ : [a, b] →
whose graph x = φ(t) (in the tx-plane, Fig. 6.1) joins the points (a, α) and (b, β) and has minimal length, then we want to minimize the function
defined by
![]()
Here the function f :
3 →
is defined by
.
Note that in the above problem the “unknown” is a function
, rather than a point in
n (as in the maximum–minimum problems which we considered in Chapter II). The “function space”
, is, like
n, a vector space, albeit an infinite-dimensional one. The purpose of this chapter is to appropriately generalize the finite-dimensional methods of Chapter II so as to treat some of the standard problems of the calculus of variations.

Figure 6.1
In Section 3 we will show that, if
maximizes (or minimizes) the function
defined by (*), subject to the conditions φ(a) = α and φ(b) = β, then φ satisfies the Euler–Lagrange equation
![]()
The proof of this will involve a generalized version of the familiar technique of “setting the derivative equal to zero, and then solving for the unknown.”
In Section 4 we discuss the so-called “isoperimetric problem,” in which it is desired to maximize or minimize the function
![]()
subject to the conditions ψ(a) = α, ψ(b) = β and the “constraint”
![]()
Here f and g are given
functions on
3. We will see that, if
is a solution for this problem, then there exists a number
such that φ satisfies the Euler–Lagrange equation
![]()
for the function h :
3 →
defined by
![]()
This assertion is reminiscent of the constrained maximum–minimum problems of Section II.5, and its proof will involve a generalization of the Lagrange multiplier method that was employed there.
As preparation for these general maximum–minimum methods, we need to develop the rudiments of “calculus in normed vector spaces.” This will be done in Sections 1 and 2.
Chapter 1. NORMED VECTOR SPACES AND UNIFORM CONVERGENCE
The introductory remarks above indicate that certain typical calculus of variations problems lead to a consideration of “function spaces” such as the vector space
of all continuously differentiable functions defined on the interval [a, b]. It will be important for our purpose to define a norm on the vector space
, thereby making it into a normed vector space. As we will see in Section 2, this will make it possible to study real-valued functions on
by the methods of differential calculus.
Recall (from Section I.3) that a norm on the vector space V is a real-valued function x →
x
on V satisfying the following conditions:

for all
and
. The norm
x
of the vector
may be thought of as its length or “size.” Also, given
, the norm
x − y
of x − y may be thought of as the “distance” from x to y.
Example 1 We have seen in Section I.3 that each of the following definitions gives a norm on
n:

where
. It was (and is) immediate that
0 and
1 are norms on
n, while the verification that
2 satisfies the triangle inequality (Condition N3) required the Cauchy–Schwarz inequality (Theorem I.3.1).
Any two norms on
n are equivalent in the following sense. The two norms
1 and
2 on the vector space V are said to be equivalent if there exist positive numbers a and b such that
![]()
for all
. We leave it as an easy exercise for the reader to check that this relation between norms on V is an equivalence relation (in particular, if the norms
1 and
2 on V are equivalent, and also
2 and
3 are equivalent, then it follows that
1 and
3 are equivalent norms on V).
We will not include here the full proof that any two norms on
n are equivalent. However let us show that any continuous norm
on
n (that is,
x
is a continuous function of x) is equivalent to the Euclidean norm
2. Since
is continuous on the unit sphere
![]()
there exist positive numbers m and M such that
![]()
for all
. Given
, choose a > 0 such that x = ay with
. Then the above inequality gives
![]()
so
![]()
as desired. In particular, the sup norm and the 1-norm of Example 1 are both equivalent to the Euclidean norm, since both are obviously continuous. For the proof that every norm on
n is continuous, see Exercise 1.7.
Equivalent norms on the vector space V give rise to equivalent notions of sequential convergence in V. The sequence
of points of V is said to converge to
with respect to the norm
if and only if
![]()
Lemma 1.1 Let
1 and
2 be equivalent norms on the vector space V. Then the sequence
converges to
with respect to the norm
1 if and only if it converges to x with respect to the norm
2.
This follows almost immediately from the definitions; the easy proof is left to the reader.
Example 2 Let V be the vector space of all continuous real-valued functions on the closed interval [a, b], with the vector space operations defined by
![]()
where
and
. We can define various norms on V, analogous to the norms on
n in Example 1, by

Again it is an elementary matter to show that
0 and
1 are indeed norms on V, while the verification that
2 satisfies the triangle inequality requires the Cauchy–Schwarz inequality (see the remarks following the proof of Theorem 3.1 in Chapter I). We will denote by
the normed vector space V with the sup norm
0.
No two of the three norms in Example 2 are equivalent. For example, to show that
0 and
1 are not equivalent, consider the sequence
of elements of V defined by

and let φ0(x) = 0 for
(here the closed interval [a, b] is the unit interval [0, 1]). Then it is clear that
converges to φ0 with respect to the 1-norm
1 because
![]()
However
does not converge to φ0 with respect to the sup norm
0 because
![]()
for all n. It therefore follows from Lemma 1.1 that the norms
0 and
1 are not equivalent.
Next we want to prove that
, the vector space of Example 2 with the sup norm, is complete. The normed vector space V is called complete if every Cauchy sequence of elements of V converges to some element of V. Just as in
n, the sequence
is called a Cauchy sequence if, given
> 0, there exists a positive integer N such that
![]()
These definitions are closely related to the following properties of sequences of functions. Let
be a sequence of real-valued functions, each defined on the set S. Then
is called a uniformly Cauchy sequence (of functions on S) if, given
> 0, there exists N such that
![]()
for every
. We say that
converges uniformly to the function f: S →
if, given
> 0, there exists N such that
![]()
for every
.
Example 3 Consider the sequence
, where fn : [0, 1] →
is the function whose graph is shown in Fig. 6.2. Then limn→∞ fn(x) = 0 for every
. However the sequence
does not converge uniformly to its pointwise limit f(x) ≡ 0, because
fn
= 1 for every n (sup norm).

Figure 6.2
Example 4 Let fn(x) = xn on [0, 1]. Then
![]()
so the pointwise limit function is not continuous. It follows from Theorem 1.2 below that the sequence {fn} does not converge uniformly to f.
Comparing the above definitions, note that a Cauchy sequence in
is simply a uniformly Cauchy sequence of continuous functions on [a, b], and that the sequence
converges with respect to the sup norm to
if and only if it converges uniformly to φ. Therefore, in order to prove that
is complete, we need to show that every uniformly Cauchy sequence of continuous functions on [a, b] converges uniformly to some continuous function on [a, b]. The first step is the proof that, if a sequence of continuous functions converges uniformly, then the limit function is continuous.
Theorem 1.2 Let S be a subset of
k. Let
be a sequence of continuous real-valued functions on S. If
converges uniformly to the function f: S →
, then f is continuous.
PROOF Given
> 0, first choose N sufficiently large that
fN(x) − f(x)
<
/3 for all
(by the uniform convergence). Then, given
, choose δ > 0 such that
![]()
(by the continuity of fN at x0). If
and
x − x0
< δ, then it follows that

so f is continuous at x0.
![]()
Corollary 1.3
is complete.
PROOF Let
be a Cauchy sequence of elements of
, that is, a uniformly Cauchy sequence of continuous real-valued functions on [a, b]. Given
> 0, choose N such that
![]()
Then, in particular,
φm(x) − φn(x)
<
/2 for each
. Therefore
is a Cauchy sequence of real numbers, and hence converges to some real number φ(x). It remains to show that the sequence of functions {φn} converges uniformly to φ; if so, Theorem 1.2 will then imply that
.
We assert that
(same N as above, n fixed) implies that
![]()
To see this, choose
sufficiently large (depending upon x) that
φ(x) − φm(x)
<
/2. Then it follows that

Since
was arbitrary, it follows that
φn − φ
<
as desired.
![]()
Example 5 Let
denote the vector space of all continuously differentiable real-valued functions on [a, b], with the vector space operations of
, but with the “
-norm” defined by
![]()
for
. We leave it as an exercise for the reader to verify that this does define a norm on
.
In Section 4 we will need to know that the normed vector space
is complete. In order to prove this, we need a result on the termwise-differentiation of a sequence of continuously differentiable functions on [a, b]. Suppose that the sequence
converges (pointwise) to f, and that the sequence of derivatives
converges to g. We would like to know whether f′ = g. Note that this is the question as to whether
![]()
an “interchange of limits” problem of the sort considered at the end of Section IV.3. The following theorem asserts that this interchange of limits is valid, provided that the convergence of the sequence of derivatives is uniform.
Theorem 1.4 Let
be a sequence of continuously differentiable real-valued functions on [a, b], converging (pointwise) to f. Suppose that the
converges uniformly to a function g. Then
converges uniformly to f, and f is differentiable, with f′ = g.
PROOF By the fundamental theorem of calculus, we have
![]()
for each n and each
. From this and Exercise IV.3.4 (on the termwise-integration of a uniformly convergent sequence of continuous functions) we obtain

Another application of the fundamental theorem yields f′ = g as desired.
To see that the convergence of
to f is uniform, note that

The uniform convergence of the sequence
therefore follows from that of the sequence
.
![]()
Corollary 1.5
is complete.
PROOF Let
be a Cauchy sequence of elements of
. Since
![]()
we see that
is a uniformly Cauchy sequence of continuous functions. It follows from Corollary 1.3 that
converges uniformly to a continuous function φ : [a, b] →
. Similarly
![]()
so it follows, in the same way from Corollary 1.3, that
converges uniformly to a continuous function ψ : [a, b] →
. Now Theorem 1.4 implies that φ is differentiable with φ′ = ψ, so
. Since

the uniform convergence of the sequences
and
implies that the sequence
converges to φ with respect to the
-norm of
. Thus every Cauchy sequence in
converges.
![]()
Example 6 Let
denote the vector space of all
paths in
n (mappings φ : [a, b] →
n), with the norm
![]()
where
0 denotes the sup norm in
n. Then it follows, from a coordinatewise application of Corollary 1.5, that the normed vector space
is complete.
Exercises
1.1Verify that
0,
1, and
2, as defined in Example 2, are indeed norms on the vector space of all continuous functions defined on [a, b].
1.2Show that the norms
1 and
2 of Example 2 are not equivalent. Hint: Truncate the function
near 0.
1.3Let E and F be normed vector spaces with norms
E and
F, respectively. Then the the product set E × F is a vector space, with the vector space operations defined coordinatewise. Prove that the following are equivalent norms on E × F:

1.4If the normed vector spaces E and F are complete, prove that the normed vector space E × F, of the previous exercise, is also complete.
1.5Show that a closed subspace of a complete normed vector space is complete.
1.6Denote by
n[a, b] the subspace of
that consists of all polynomials of degree at most n. Show that
n[a, b] is a closed subspace of
. Hint: Associate the polynomial
with the point
, and compare
φ
0 with
a
.
1.7If
is a norm on
n, prove that the function x →
x
is continuous on
n. Hint: If
![]()
then
![]()
The Cauchy-Schwarz inequality then gives
![]()
where
and
is the Euclidean norm on
n.