NORMED VECTOR SPACES AND UNIFORM CONVERGENCE - The Calculus of Variations - Advanced Calculus of Several Variables

Advanced Calculus of Several Variables (1973)

Part VI. The Calculus of Variations

The calculus of variations deals with a certain class of maximum–minimum problems, which have in common the fact that each of them is associated with a particular sort of integral expression. The simplest example of such a problem is the following. Let f: Image3Image be a given Image function, and denote by Image the set of all real-valued Image functions defined on the interval [a, b]. Given Image, define Image by

Image

Then F is a real-valued function on Image. The problem is then to find that element Image, if any, at which the function Image attains its maximum (or minimum) value, subject to the condition that φ has given preassigned values φ(a) = α and φ(b) = β at the endpoints of [a, b].

For example, if we want to find the Image function φ : [a, b] → Image whose graph x = φ(t) (in the tx-plane, Fig. 6.1) joins the points (a, α) and (b, β) and has minimal length, then we want to minimize the function Imagedefined by

Image

Here the function f : Image3Image is defined by Image.

Note that in the above problem the “unknown” is a function Image, rather than a point in Imagen (as in the maximum–minimum problems which we considered in Chapter II). The “function space” Image, is, like Imagen, a vector space, albeit an infinite-dimensional one. The purpose of this chapter is to appropriately generalize the finite-dimensional methods of Chapter II so as to treat some of the standard problems of the calculus of variations.

Image

Figure 6.1

In Section 3 we will show that, if Image maximizes (or minimizes) the function Image defined by (*), subject to the conditions φ(a) = α and φ(b) = β, then φ satisfies the Euler–Lagrange equation

Image

The proof of this will involve a generalized version of the familiar technique of “setting the derivative equal to zero, and then solving for the unknown.”

In Section 4 we discuss the so-called “isoperimetric problem,” in which it is desired to maximize or minimize the function

Image

subject to the conditions ψ(a) = α, ψ(b) = β and the “constraint”

Image

Here f and g are given Image functions on Image3. We will see that, if Image is a solution for this problem, then there exists a number Image such that φ satisfies the Euler–Lagrange equation

Image

for the function h : Image3Image defined by

Image

This assertion is reminiscent of the constrained maximum–minimum problems of Section II.5, and its proof will involve a generalization of the Lagrange multiplier method that was employed there.

As preparation for these general maximum–minimum methods, we need to develop the rudiments of “calculus in normed vector spaces.” This will be done in Sections 1 and 2.

Chapter 1. NORMED VECTOR SPACES AND UNIFORM CONVERGENCE

The introductory remarks above indicate that certain typical calculus of variations problems lead to a consideration of “function spaces” such as the vector space Image of all continuously differentiable functions defined on the interval [a, b]. It will be important for our purpose to define a norm on the vector space Image, thereby making it into a normed vector space. As we will see in Section 2, this will make it possible to study real-valued functions on Image by the methods of differential calculus.

Recall (from Section I.3) that a norm on the vector space V is a real-valued function xImage x Image on V satisfying the following conditions:

Image

for all Image and Image. The norm ImagexImage of the vector Image may be thought of as its length or “size.” Also, given Image, the norm Imagex − yImage of x − y may be thought of as the “distance” from x to y.

Example 1 We have seen in Section I.3 that each of the following definitions gives a norm on Imagen:

Image

where Image. It was (and is) immediate that Image Image0 and Image Image1 are norms on Imagen, while the verification that Image Image2 satisfies the triangle inequality (Condition N3) required the Cauchy–Schwarz inequality (Theorem I.3.1).

Any two norms on Imagen are equivalent in the following sense. The two norms Image Image1 and Image Image2 on the vector space V are said to be equivalent if there exist positive numbers a and b such that

Image

for all Image. We leave it as an easy exercise for the reader to check that this relation between norms on V is an equivalence relation (in particular, if the norms Image Image1 and Image Image2 on V are equivalent, and also Image Image2 and Image Image3 are equivalent, then it follows that Image Image1 and Image Image3 are equivalent norms on V).

We will not include here the full proof that any two norms on Imagen are equivalent. However let us show that any continuous norm Image Image on Imagen (that is, ImagexImage is a continuous function of x) is equivalent to the Euclidean norm Image Image2. Since Image Image is continuous on the unit sphere

Image

there exist positive numbers m and M such that

Image

for all Image. Given Image, choose a > 0 such that x = ay with Image. Then the above inequality gives

Image

so

Image

as desired. In particular, the sup norm and the 1-norm of Example 1 are both equivalent to the Euclidean norm, since both are obviously continuous. For the proof that every norm on Imagen is continuous, see Exercise 1.7.

Equivalent norms on the vector space V give rise to equivalent notions of sequential convergence in V. The sequence Image of points of V is said to converge to Image with respect to the norm Image Image if and only if

Image

Lemma 1.1 Let Image Image1 and Image Image2 be equivalent norms on the vector space V. Then the sequence Image converges to Image with respect to the norm Image Image1 if and only if it converges to x with respect to the norm Image Image2.

This follows almost immediately from the definitions; the easy proof is left to the reader.

Example 2 Let V be the vector space of all continuous real-valued functions on the closed interval [a, b], with the vector space operations defined by

Image

where Image and Image. We can define various norms on V, analogous to the norms on Imagen in Example 1, by

Image

Again it is an elementary matter to show that Image Image0 and Image Image1 are indeed norms on V, while the verification that Image Image2 satisfies the triangle inequality requires the Cauchy–Schwarz inequality (see the remarks following the proof of Theorem 3.1 in Chapter I). We will denote by Image the normed vector space V with the sup norm Image Image0.

No two of the three norms in Example 2 are equivalent. For example, to show that Image Image0 and Image Image1 are not equivalent, consider the sequence Image of elements of V defined by

Image

and let φ0(x) = 0 for Image (here the closed interval [a, b] is the unit interval [0, 1]). Then it is clear that Image converges to φ0 with respect to the 1-norm Image Image1 because

Image

However Image does not converge to φ0 with respect to the sup norm Image Image0 because

Image

for all n. It therefore follows from Lemma 1.1 that the norms Image Image0 and Image Image1 are not equivalent.

Next we want to prove that Image, the vector space of Example 2 with the sup norm, is complete. The normed vector space V is called complete if every Cauchy sequence of elements of V converges to some element of V. Just as in Imagen, the sequence Image is called a Cauchy sequence if, given Image > 0, there exists a positive integer N such that

Image

These definitions are closely related to the following properties of sequences of functions. Let Image be a sequence of real-valued functions, each defined on the set S. Then Image is called a uniformly Cauchy sequence (of functions on S) if, given Image > 0, there exists N such that

Image

for every Image. We say that Image converges uniformly to the function f: SImage if, given Image > 0, there exists N such that

Image

for every Image.

Example 3 Consider the sequence Image, where fn : [0, 1] → Image is the function whose graph is shown in Fig. 6.2. Then limn→∞ fn(x) = 0 for every Image. However the sequence Image does not converge uniformly to its pointwise limit f(x) ≡ 0, because ImagefnImage = 1 for every n (sup norm).

Image

Figure 6.2

Example 4 Let fn(x) = xn on [0, 1]. Then

Image

so the pointwise limit function is not continuous. It follows from Theorem 1.2 below that the sequence {fn} does not converge uniformly to f.

Comparing the above definitions, note that a Cauchy sequence in Image is simply a uniformly Cauchy sequence of continuous functions on [a, b], and that the sequence Image converges with respect to the sup norm to Image if and only if it converges uniformly to φ. Therefore, in order to prove that Image is complete, we need to show that every uniformly Cauchy sequence of continuous functions on [a, b] converges uniformly to some continuous function on [a, b]. The first step is the proof that, if a sequence of continuous functions converges uniformly, then the limit function is continuous.

Theorem 1.2 Let S be a subset of Imagek. Let Image be a sequence of continuous real-valued functions on S. If Image converges uniformly to the function f: SImage, then f is continuous.

PROOF Given Image > 0, first choose N sufficiently large that ImagefN(x) − f(x)Image < Image/3 for all Image (by the uniform convergence). Then, given Image, choose δ > 0 such that

Image

(by the continuity of fN at x0). If Image and Imagex − x0Image < δ, then it follows that

Image

so f is continuous at x0.

Image

Corollary 1.3 Image is complete.

PROOF Let Image be a Cauchy sequence of elements of Image, that is, a uniformly Cauchy sequence of continuous real-valued functions on [a, b]. Given Image > 0, choose N such that

Image

Then, in particular, Imageφm(x) − φn(x)Image < Image/2 for each Image. Therefore Image is a Cauchy sequence of real numbers, and hence converges to some real number φ(x). It remains to show that the sequence of functions {φn} converges uniformly to φ; if so, Theorem 1.2 will then imply that Image.

We assert that Image (same N as above, n fixed) implies that

Image

To see this, choose Image sufficiently large (depending upon x) that Imageφ(x) − φm(x)Image < Image/2. Then it follows that

Image

Since Image was arbitrary, it follows that ImageφnφImage < Image as desired.

Image

Example 5 Let Image denote the vector space of all continuously differentiable real-valued functions on [a, b], with the vector space operations of Image, but with the “Image-norm” defined by

Image

for Image. We leave it as an exercise for the reader to verify that this does define a norm on Image.

In Section 4 we will need to know that the normed vector space Image is complete. In order to prove this, we need a result on the termwise-differentiation of a sequence of continuously differentiable functions on [a, b]. Suppose that the sequence Image converges (pointwise) to f, and that the sequence of derivatives Image converges to g. We would like to know whether f′ = g. Note that this is the question as to whether

Image

an “interchange of limits” problem of the sort considered at the end of Section IV.3. The following theorem asserts that this interchange of limits is valid, provided that the convergence of the sequence of derivatives is uniform.

Theorem 1.4 Let Image be a sequence of continuously differentiable real-valued functions on [a, b], converging (pointwise) to f. Suppose that the Image converges uniformly to a function g. Then Image converges uniformly to f, and f is differentiable, with f′ = g.

PROOF By the fundamental theorem of calculus, we have

Image

for each n and each Image. From this and Exercise IV.3.4 (on the termwise-integration of a uniformly convergent sequence of continuous functions) we obtain

Image

Another application of the fundamental theorem yields f′ = g as desired.

To see that the convergence of Image to f is uniform, note that

Image

The uniform convergence of the sequence Image therefore follows from that of the sequence Image.

Image

Corollary 1.5 Image is complete.

PROOF Let Image be a Cauchy sequence of elements of Image. Since

Image

we see that Image is a uniformly Cauchy sequence of continuous functions. It follows from Corollary 1.3 that Image converges uniformly to a continuous function φ : [a, b] → Image. Similarly

Image

so it follows, in the same way from Corollary 1.3, that Image converges uniformly to a continuous function ψ : [a, b] → Image. Now Theorem 1.4 implies that φ is differentiable with φ′ = ψ, so Image. Since

Image

the uniform convergence of the sequences Image and Image implies that the sequence Image converges to φ with respect to the Image-norm of Image. Thus every Cauchy sequence in Image converges.

Image

Example 6 Let Image denote the vector space of all Image paths in Imagen (mappings φ : [a, b] → Imagen), with the norm

Image

where Image Image0 denotes the sup norm in Imagen. Then it follows, from a coordinatewise application of Corollary 1.5, that the normed vector space Image is complete.

Exercises

1.1Verify that Image Image0, Image Image1, and Image Image2, as defined in Example 2, are indeed norms on the vector space of all continuous functions defined on [a, b].

1.2Show that the norms Image Image1 and Image Image2 of Example 2 are not equivalent. Hint: Truncate the function Image near 0.

1.3Let E and F be normed vector spaces with norms Image ImageE and Image ImageF, respectively. Then the the product set E × F is a vector space, with the vector space operations defined coordinatewise. Prove that the following are equivalent norms on E × F:

Image

1.4If the normed vector spaces E and F are complete, prove that the normed vector space E × F, of the previous exercise, is also complete.

1.5Show that a closed subspace of a complete normed vector space is complete.

1.6Denote by Imagen[a, b] the subspace of Image that consists of all polynomials of degree at most n. Show that Imagen[a, b] is a closed subspace of Image. Hint: Associate the polynomial Image with the point Image, and compare ImageφImage0 with ImageaImage.

1.7If Image Image is a norm on Imagen, prove that the function xImagexImage is continuous on Imagen. Hint: If

Image

then

Image

The Cauchy-Schwarz inequality then gives

Image

where Image and Image Image is the Euclidean norm on Imagen.