Advanced Calculus of Several Variables (1973)
Part VI. The Calculus of Variations
Chapter 4. THE ISOPERIMETRIC PROBLEM
In this section we treat the so-called isoperimetric problem that was mentioned in the introduction to this chapter. Given functions f, g : 3 → , we wish to maximize or minimize the function
subject to the endpoint conditions ψ(a) = α, ψ(b) = β and the constraint
If, as in Section 3, we denote by M the hyperplane in consisting of all those functions ψ : [a, b] → such that ψ(a) = α and (b) = β, then our problem is to locate the local extrema of the function F on the set .
The similarity between this problem and the constrained maximum–minimum problems of Section II.5 should be obvious—the only difference is that here the functions F and G are defined on the infinite-dimensional normed vector space , rather than on a finite-dimensional Euclidean space. So our method of attack will be to appropriately generalize the method of Lagrange multipliers so that it will apply in this context.
First let us recall Theorem II.5.5 in the following form. Let F, G : n → be functions such that G(0) = 0 and ∇G(0) ≠ 0. If F has a local maximum or local minimum at 0 subject to the constraint G(x) = 0, then there exists a number λ such that
Since the differentials dF0, dG0 : n → are given by
Eq. (3) can be rewritten
where Λ : → is the linear function defined by Λ(t) = λt.
Equation (4) presents the Lagrange multiplier method in a form which is suitable for generalization to normed vector spaces (where differentials are available, but gradient vectors are not). For the proof we will need the following elementary algebraic lemma.
Lemma 4.1 Let α and β be real-valued linear functions on the vector space E such that
Then there exists a linear function Λ : → such that α = Λ β. That is, the following diagram of linear functions “commutes.”
PROOF Given , pick such that β(x) = t, and define
In order to show that Λ is well defined, we must see that, if y is another element of E with β(y) = t, then α(x) = α(y). But if β(x) = β(y) = t, then so α(x − y) = 0, which immediately implies that α(x) = α(y).
If β(x) = s and β(y) = t, then
so Λ is linear.
The following theorem states the Lagrange multiplier method in the desired generality.
Theorem 4.2 Let F and G be real-valued functions on the complete normed vector space E, with G(0) = 0 and dG0 ≠ 0 (so Im dG0 = ). If F : E → has a local extremum at 0 subject to the constraint G(x) = 0, then there exists a linear function λ : → such that
Of course the statement, that “F has a local extremum at 0 subject to G(x) = 0,” means that the restriction FG−1(0) has a local extremum at .
PROOF This will follow from Lemma 4.1, with α = dF0 and β = dG0, if we can prove that Ker dF0 contains Ker dG0. In order to do this, let us first assume the fact (to be established afterward) that, given , there exists a differentiable path γ : (−, ) → E whose image lies in G−1(0), such that γ(0) = 0 and γ′(0) = v (see Fig. 6.6).
Then the composition h = F γ : (−, ) → has a local extremum at 0, so h′(0) = 0. The chain rule therefore gives
as desired.
We will use the implicit function theorem to verify the existence of the differentiable path γ used above. If X = Ker dG0 then, since dG0 : E → is continuous, X is a closed subspace of E, and is therefore complete (by Exercise 1.5). Choose such that dG0(w) = 1, and denote by Y the closed subspace of E consisting of all scalar multiples of w; then Y is a “copy” of .
It is clear that . Also, if and , then
so . Therefore
with and . Thus E is the algebraic direct sum of the subspaces X and Y. Moreover, it is true (although we omit the proof) that the norm on E is equivalent to the product norm on X × Y, so we may write E = X × Y.
In order to apply the implicit function theorem, we need to know that
Figure 6.6
is an isomorphism. Since Y ≈ , we must merely show that dy G0 ≠ 0. But, given , we have
by Exercise 2.6, so the assumption that dy G0 = 0 would imply that dG0 = 0, contrary to hypothesis.
Consequently the implicit function theorem provides a function φ : X → Y whose graph y = φ(x) in X × Y = E coincides with G−1(0), inside some neighborhood of 0. If H(x) = G(x, φ(x)), then H(x) = 0 for x near 0, so
for all . It therefore follows that dφ0 = 0, because dy G0 is an isomorphism.
Finally, given , define γ : → E by γ(t) = (tu, φ(tu)). Then γ(0) = 0 and for t sufficiently small, and
as desired.
We are now prepared to deal with the isoperimetric problem. Let f and g be real-valued functions on 3, and define the real-valued functions F and G on by
and
where . Assume that φ is a element of at which F has a local extremum on , where M is the usual hyperplane in that is determined by the endpoint conditions (a) = α and (b) = β.
We have seen (in Section 3) that M is the translate (by any fixed element of M) of the subspace of consisting of these elements such that (a) = (b) = 0. Let be the translation defined by
and note that T(0) = φ, while
is the identity mapping.
Now consider the real-valued functions F T and G T on . The fact that F has a local extremum on at φ implies that F T has a local extremum at 0 subject to the condition G T() = 0.
Let us assume that φ is not an extremal for G on M, that is, that
so d(G T)0 ≠ 0. Then Theorem 4.2 applies to give a linear function Λ : → such that
Since dT0 is the identity mapping on C01[a, b], the chain rule gives
on . Writing Λ(t) = λt and applying the computation of Corollary 3.2 for the differentials dFφ and dGφ, we conclude that
for all .
If h : 3 → is defined by
it follows that
for all . An application of Lemma 3.3 finally completes the proof of the following theorem.
Theorem 4.3 Let F and G be the real-valued functions on defined by (6) and (7), where f and g are functions on 3. Let be a function which is not an extremal for G. If F has a local extremum at φ subject to the conditions
then there exists a real number λ such that φ satisfies the Euler–Lagrange equation for the function h = f − λg, that is,
for all .
The following application of this theorem is the one which gave such constraint problems in the calculus of variations their customary name—isoperimetric problems.
Example Suppose φ : [a, b] → is that nonnegative function (if any) with φ(a) = φ(b) = 0 whose graph x = φ(t) has length L, such that the area under its graph is maximal. We want to prove that the graph x = φ(t) must be an arc of
Figure 6.7
a circle (Fig. 6.7). If f(x, y, t) = x and , then φ maximizes the integral
subject to the conditions
Since , the Euler-Lagrange equation (8) is
or
This last equation just says that the curvature of the curve t → (t, φ(t)) is the constant 1/λ. Its image must therefore be part of a circle.
The above discussion of the isoperimetric problem generalizes in a straightforward manner to the case in which there is more than one constraint. Given functions f, g1, . . . , gk: 3 → , we wish to minimize or maximize the function
subject to the endpoint conditions (a) = α, (b) = β and the constraints
Our problem then is to locate the local extrema of F on , where M is the usual hyperplane in and
The result, analogous to Theorem 4.3, is as follows.
Let be a function which is not an extremal for any linear combination of the functions G1, . . . , Gk. If F has a local extremum at φ subject to the conditions
then there exist numbers λ1, . . . , λk such that φ satisfies the Euler–Lagrange equation for the function
Inclusion of the complete details of the proof would be repetitious, so we simply outline the necessary alterations in the proof of Theorem 4.3.
First Lemma 4.1 and Theorem 4.2 are slightly generalized as follows. In Lemma 4.1 we take β to be a linear mapping from E to k with Im β = k, and in Theorem 4.2 we take G to be a mapping from E to k such that G(0) = 0 and Im dG0 = k. The only other change is that, in the conclusion of each, Λ becomes a real-valued linear function on k. The proofs remain essentially the same.
We then apply the generalized Theorem 4.2 to the mappings and , defined by (9) and (10), in the same way that the original (Theorem 4.2) was applied (in the proof of Theorem 4.3) to the functions defined by (6) and (7). The only additional observation needed is that, if φ is not an extremal for any linear combination of the component functions G1, . . . , Gk, then it follows easily that dGφ maps TMφ onto k. We then conclude as before that
for some linear function Λ : k → . Writing , we conclude that
for all . An application of Lemma 3.3 then implies that φ satisfies the Euler–Lagrange equation for .
Exercises
4.1Consulting the discussion at the end of Section 3, generalize the isoperimetric problem to the vector-valued case as follows: Let f, g : n × n × → be given functions, and suppose φ : [a, b] → n is an extremal for
subject to the conditions (a) = α, (b) = β and
Then show under appropriate conditions that, for some number λ, the path φ satisfies the Euler–Lagrange equations
for the function h = f − λg.
4.2Let φ : [a, b] → 2 be a closed curve in the plane, φ(a) = φ(b), and write φ(t) = (x(t), y(t)). Apply the result of the previous problem to show that, if φ maximizes the area integral
subject to
then the image of φ is a circle.
4.3With the notation and terminology of the previous problem, establish the following reciprocity relationship. The closed path φ is an extremal for the area integral, subject to the arclength integral being constant, if and only if φ is an extremal for the arclength integral subject to the area integral being constant. Conclude that, if φ has minimal length amongst curves enclosing a given area, then the image of φ is a circle.
4.4Formulate (along the lines of Exercise 3.9) a necessary condition that φ : [a, b] → minimize
subject to
This is the isoperimetric problem with second derivatives.
4.5Suppose that describes (in polar coordinates) a closed curve of length L that encloses maximal area. Show that it is a circle by maximizing
subject to the condition
4.6A uniform flexible cable of fixed length hangs between two fixed points. If it hangs in such a way as to minimize the height of its center of gravity, show that its shape is that of a catenary (see Example 2 of Section 3). Hint: Note that Exercise 3.2 applies.
4.7If a hanging flexible cable of fixed length supports a horizontally uniform load, show that its shape is that of a parabola.