THE INFINITUDE OF THE NUMBER SYSTEM. MATHEMATICAL INDUCTION - THE NATURAL NUMBERS - What Is Mathematics? An Elementary Approach to Ideas and Methods, 2nd Edition (1996)

What Is Mathematics? An Elementary Approach to Ideas and Methods, 2nd Edition (1996)

CHAPTER I. THE NATURAL NUMBERS

*§2. THE INFINITUDE OF THE NUMBER SYSTEM. MATHEMATICAL INDUCTION

1. The Principle of Mathematical Induction

There is no end to the sequence of integers 1, 2, 3, 4, · · ·; for after any integer n has been reached we may write the next integer, n + 1. We express this property of the sequence of integers by saying that there are infinitely many integers. The sequence of integers represents the simplest and most natural example of the mathematical infinite, which plays a dominant rôle in modern mathematics. Everywhere in this book we shall have to deal with collections or “sets” containing infinitely many mathematical objects, like the set of all points on a line or the set of all triangles in a plane. The infinite sequence of integers is the simplest example of an infinite set.

The step by step procedure of passing from n to n + 1 which generates the infinite sequence of integers also forms the basis of one of the most fundamental patterns of mathematical reasoning, the principle of mathematical induction. “Empirical induction” in the natural sciences proceeds from a particular series of observations of a certain phenomenon to the statement of a general law governing all occurrences of this phenomenon. The degree of certainty with which the law is thereby established depends on the number of single observations and confirmations. This sort of inductive reasoning is often entirely convincing; the prediction that the sun will rise tomorrow in the east is as certain as anything can be, but the character of this statement is not the same as that of a theorem proved by strict logical or mathematical reasoning.

In quite a different way mathematical induction is used to establish the truth of a mathematical theorem for an infinite sequence of cases, the first, the second, the third, and so on without exception. Let us denote by A a statement that involves an arbitrary integer n. For example, A may be the statement, “The sum of the angles in a convex polygon of n + 2 sides is n times 180 degrees.” Or A′ may be the assertion, “By drawing n lines in a plane we cannot divide the plane into more than 2n parts.” To prove such a theorem for every integer n it does not suffice to prove it separately for the first 10 or 100 or even 1000 values of n. This indeed would correspond to the attitude of empirical induction. Instead, we must use a method of strictly mathematical and non-empirical reasoning whose character will be indicated by the following proofs for the special examples A and A′. In the case A, we know that for n = 1 the polygon is a triangle, and from elementary geometry the sum of the angles is known to be 1 · 180°. For a quadrilateral, n = 2, we draw a diagonal which divides the quadrilateral into two triangles. This shows immediately that the sum of the angles of the quadrilateral is equal to the sum of the angles in the two triangles, which yields 180° + 180° = 2 · 180°. Proceeding to the case of a pentagon with 5 edges, n = 3, we decompose it into a triangle plus a quadrilateral. Since the latter has the angle sum 2 · 180°, as we have just proved, and since the triangle has the angle sum 180°, we obtain 3 · 180 degrees for the 5-gon. Now it is clear that we can proceed indefinitely in the same way, proving the theorem for n = 4, then for n = 5, and so on. Each statement follows in the same way from the preceding one, so that the general theorem A can be established for all n.

Similarly we can prove the theorem A′. For n = 1 it is obviously true, since a single line divides the plane into 2 parts. Now add a second line. Each of the previous parts will be divided into two new parts, unless the new line is parallel to the first. In either case, for n = 2 we have not more than 4 = 22 parts. Now we add a third line. Each of the previous domains will either be cut into two parts or be left untouched. Thus the sum of parts is not greater than 22.2 = 23. Knowing this to be true, we can prove the next case in the same way, and so on indefinitely.

The essential idea in the preceding arguments is to establish a general theorem A for all values of n by successively proving a sequence of special cases, A1, A2, · · ·. The possibility of doing this depends on two things: a) There is a general method for showing that if any statement Ar, is true then the next statement, Ar+1, will also be true. b) The first statement A1 is known to be true. That these two conditions are sufficient to establish the truth of all the statements A1, A2, A3, · · · is a logical principle which is as fundamental to mathematics as are the classical rules of Aristotelian logic. We formulate it as follows:

Let us suppose that we wish to establish a whole infinite sequence of mathematical propositions

A1, A2, A3, · · ·

which together constitute the general proposition A. Suppose that a) by some mathematical argument it is shown that if r is any integer and if the assertion Ar is known to be true then the truth of the assertion Ar+1 will follow, and that b) the first proposition A1 is known to be true. Then all the propositions of the sequence must be true, and A is proved.

We shall not hesitate to accept this, just as we accept the simple rules of ordinary logic, as a basic principle of mathematical reasoning. For we can establish the truth of every statement An, starting from the given assertion b) that A1 is true, and proceeding by repeated use of the assertion a) to establish successively the truth of A2, A3, A4, and so on until we reach the statement An. The principle of mathematical induction thus rests on the fact that after any integer r there is a next, r + 1, and that any desired integer n may be reached by a finite number of such steps, starting from the integer 1.

Often the principle of mathematical induction is applied without explicit mention, or is simply indicated by a casual “etc.” or “and so on.” This is especially frequent in elementary instruction. But the explicit use of an inductive argument is indispensable in more subtle proofs. We shall give a few illustrations of a simple but not quite trivial character.

2. The Arithmetical Progression

For every value of n, the sum 1 + 2 + 3 + · · · + n of the first n integers is equal to image. In order to prove this theorem by mathematical induction we must show that for every n the assertion An:

(1)   image

is true. a) We observe that if r is an integer and if the statement Ar, is known to be true, i.e. if it is known that

image

then by adding the number (r + 1) to both sides of this equation we obtain the equation

image

which is precisely the statement Ar+1. b) The statement A1 is obviously true, image. Hence, by the principle of mathematical induction, the statement An is true for every n, as was to be proved.

Ordinarily this is shown by writing the sum 1 + 2 + 3 + · · · + n in two forms:

Sn = 1 + 2 + · · · + (n –1) + n

and

Sn = n + (n –1) + · · · + 2 + 1.

On adding, we see that each pair of numbers in the same column yields the sum n + 1, and, since there are n columns in all, it follows that

2Sn = n(n + 1),

which proves the desired result.

From (1) we may immediately derive the formula for the sum of the first (n + 1) terms of any arithmetical progression,

(2) image

For

image

For the case a = 0, d = 1, this is equivalent to (1).

3. The Geometrical Progression

One may treat the general geometrical progression in a similar way. We shall prove that for every value of n

(3) image

(We suppose that q ≠ 1, since otherwise the right side of (3) has no meaning.)

Certainly this assertion is true for n = 1, for then it states that

image

And if we assume that

image

then we find as a consequence that

image

But this is precisely the assertion (3) for the case n = r + 1. This completes the proof.

In elementary textbooks the usual proof proceeds as follows. Set

Gn = a + aq + · · · + aqn,

and multiply both sides of this equation by q, obtaining

qGn = aq + aq2 + · · · + aqn+1.

Now subtract corresponding sides of this equation from the preceding equation, obtaining

image

4. The Sum of the First n Squares

A further interesting application of the principle of mathematical induction refers to the sum of the first n squares. By direct trial one finds that, at least for small values of n,

(4) image

and one might guess that this remarkable formula is valid for all integers n. To prove this, we shall again use the principle of mathematical induction. We begin by observing that if the assertion An, which in this case is the equation (4), is true for the case n = r, so that

image

then on adding (r + 1)2 to both sides of this equation we obtain

image

which is precisely the assertion Ar+1 in this case, since it is obtained by substituting r + 1 for n in (4). To complete the proof we need only remark that the assertion A1, in this case the equation

image

is obviously true. Hence the equation (4) is true for every n.

Formulas of a similar sort may be found for higher powers of the integers, 1k + 2k + 3k + · · · + nk, where k is any positive integer As an exercise, the reader may prove by mathematical induction that

(5)  image

It should be remarked that although the principle of mathematical induction suffices to prove the formula (5) once this formula has been written down, the proof gives no indication of how this formula was arrived at in the first place; why precisely the expression [n(n + 1)/2]2 should be guessed as an expression for the sum of the first n cubes, rather than [n(n + 1)/3]2 or (19n2 – 41n + 24)/2 or any of the infinitely many expressions of a similar type that could have been considered. The fact that the proof of a theorem consists in the application of certain simple rules of logic does not dispose of the creative element in mathematics, which lies in the choice of the possibilities to be examined. The question of the origin of the hypothesis (5) belongs to a domain in which no very general rules can be given; experiment, analogy, and constructive intuition play their part here. But once the correct hypothesis is formulated, the principle of mathematical induction is often sufficient to provide the proof. Inasmuch as such a proof does not give a clue to the act of discovery, it might more fittingly be called a verification.

*5. An Important Inequality

In a subsequent chapter we shall find use for the inequality

(6) (1 + p)n ≥ 1 + np,

which holds for every number p > –1 and positive integer n. (For the sake of generality we are anticipating here the use of negative and non-integral numbers by allowing p to be any number greater than –1. The proof for the general case is exactly the same as in the case where p is a positive integer.) Again we use mathematical induction.

a) If it is true that (1 + p)r ≥ 1 + rp, then on multiplying both sides of this inequality by the positive number 1 + p, we obtain

(1 + p)r+1 ≥ 1 + rp + p + rp2

Dropping the positive term rp2 only strengthens this inequality, so that

(1 + p)r+1 ≥ 1 + (r + 1)p,

which shows that the inequality (6) will also hold for the next integer, r + 1. b) It is obviously true that (1 + p)1 ≥ 1 + p. This completes the proof that (6) is true for every n. The restriction to numbers p > –1 is essential. If p < –1, then 1 + p is negative and the argument in a) breaks down, since if both members of an inequality are multiplied by a negative quantity, the sense of the inequality is reversed. (For example, if we multiply both sides of the inequality 3 > 2 by –1 we obtain –3 > –2, which is false.)

*6. The Binomial Theorem

Frequently it is important to have an explicit expression for the nth power of a binomial, (a + b)n. We find by explicit calculation that

for n = 1, (a + b)1 = a + b,

for n = 2, (a + b)2 = (a + b)(a + b) = a(a + b) + b(a + b) = a2 + 2ab + b2,

for n = 3, (a + b)3 = (a + b)(a + b)2 = a(a2 + 2ab+ b2) + b(a2 + 2ab + b2) = a3 + 3a2b + 3ab2 + b3,

and so on. What general law of formation lies behind the words “and so on”? Let us examine the process by which (a + b)2 was computed. Since (a + b)2 = (a+ b)(a + b), we obtained the expression for (a + b)2 by multiplying each term in the expression a + b by a, then by b, and adding. The same procedure was used to calculate (a + b)3 = (a + b)(a + b)2. We may continue in the same way to calculate (a + b)4, (a + b)5, and so on indefinitely. The expression for (a + b)n will be obtained by multiplying each term of the previously obtained expression for (a + b)n-1 by a, then by b, and adding. This leads to the following diagram:

image

which gives at once the general rule for forming the coefficients in the expansion of (a + b)n. We construct a triangular array of numbers, starting with the coefficients 1, 1 of a + b, and such that each number of the triangle is the sum of the two numbers on each side of it in the preceding row. This array is known as Pascal’s Triangle.

image

The nth row of this array gives the coefficients in the expansion of (a + b)n in descending powers of a and ascending powers of b; thus

(a + b)7 = a7 + 7a6b + 21a5b2+ 35a4b3+ 35a3b4+ 21a2b5+ 7ab6 + b7.

Using a concise subscript and superscript notation we may denote the numbers in the nth row of Pascal’s Triangle by

image

Then the general formula for. (a + b)n may be written

(7) image

According to the law of formation of Pascal’s Triangle, we have

(8)    image

As an exercise, the experienced reader may use this relation, together with the fact that image, to show by mathematical induction that

(9) image

(For any positive integer n, the symbol n! (read, “n factorial”) denotes the product of the first n integers: n! = 1·2·3 · · · n. It is convenient also to define 0! = 1, so that 9) is valid for i = 0 and i = n.) This explicit formula for the coefficients in the binomial expansion is sometimes called thebinomial theorem. (See also p. 475.)

Exercises: Prove by mathematical induction:

1) image

2) image

*3) image

*4) image

Find the sum of the following geometrical progressions:

5) image

6) image

7) image

Using formulas (4) and (5) prove:

*8) image

*9) 13 + 33 + ··· + (2n + 1)3 = (n + 1)2(2n2 + 4n + 1)

10) Prove the same results directly by mathematical induction.

*7. Further Remarks on Mathematical Induction

The principle of mathematical induction may be generalized slightly to read: “If a sequence of statements As, As+1, As+2, · · · is given, where s is some positive integer, and if

a) For every value of r ≥ s, the truth of Ar+1 will follow from the truth of Ar, and

b) As, is known to be true,

then all the statements As, As+1, As+2· · · are true; that is to say, An is true for all ns.” Precisely the same reasoning used to establish the truth of the ordinary principle of mathematical induction applies here, with the sequence 1, 2, 3, · · · replaced by the similar sequence s, s + 1, s + 2, s + 3 · · ·. By using the principle in this form we can strengthen somewhat the inequality on page 15 by eliminating the possibility of the “=” sign. We state: For every p ≠ 0 and > –1 and every integer n ≥ 2,

(10)    (1 + p)n > 1 + np.

The proof will be left to the reader.

Closely related to the principle of mathematical induction is the “principle of the smallest integer” which states that every non-empty set C of positive integers has a smallest member. A set is empty if it has no members, e.g., the set of straight circles or the set of integers n such that n > n. For obvious reasons we exclude such sets in the statement of the principle. The set C may be finite, like the set 1, 2, 3, 4, 5, or infinite, like the set of all even numbers 2, 4, 6, 8, 10, · · ·. Any non-empty set C must contain at least one integer, say n, and the smallest of the integers 1, 2, 3, · · ·,n that belongs to C will be the smallest integer in C.

The only way to realize the significance of this principle is to observe that it does not apply to every set C of numbers that are not integers; for example, the set of positive fractions image does not contain a smallest member.

From the point of view of logic it is interesting to observe that the principle of the smallest integer may be used to prove the principle of mathematical induction as a theorem. To this end, let us consider any sequence of statements A1, A2, A3, · · · such that

a) For any positive integer r the truth of Ar+1 will follow from that of Ar.

b) A1 is known to be true.

We shall show the hypothesis that any one of the A′s is false to be untenable. For if even one of the A′s were false, the set C of all positive integers n for which An is false would be non-empty. By the principle of the smallest integer, C would contain a smallest integer, p, which must be > 1 because of b). Hence Ap would be false, but Ap-1 true. This contradicts a)..

Once more we emphasize that the principle of mathematical induction is quite distinct from empirical induction in the natural sciences. The confirmation of a general law in any finite number of cases, no matter how large, cannot provide a proof for the law in the rigorous mathematical sense of the word, even if no exception is known at the time. Such a law would remain only a very reasonable hypothesis, subject to modification by the results of future experience. In mathematics, a law or a theorem is proved only if it can be shown to be a necessary logical consequence of certain assumptions which are accepted as valid. There are many examples of mathematical statements which have been verified in every particular case considered thus far, but which have not yet been proved to hold in general (for an example see p. 30). One may suspect that a theorem is true in all generality by observing its truth in a number of examples; one may then attempt to prove it by mathematical induction. If the attempt succeeds the theorem is proved to be true; if the attempt fails, the theorem may be true or false and may some day be proved or disproved by other methods.

In using the principle of mathematical induction one must always be sure that the conditions a) and b) are really satisfied. Neglect of this precaution may lead to absurdities like the following, in which the reader is invited to discover the fallacy. We shall “prove” that any two positive integers are equal; for example, that 5 = 10.

First a definition: If a and b are two unequal positive integers, we define max (a, b) to be a or b, whichever is greater; if a = b we set max (a, b) = a = b. Thus max (3, 5) = max (5, 3) = 5, while max (4, 4) = 4. Now let An be the statement, “If a and b are any two positive integers such that max (a, b) = n, then a = b.”

a) Suppose Ar to be true. Let a and b be any two positive integers such that max (a, b) = r + 1. Consider the two integers

α = a – 1

β = b – 1;

then max (α, β) = r. Hence α = β, for we are assuming Ar to be true. It follows that a = b; hence Ar+1 is true.

b) A1 is obviously true, for if max (a, b) = 1, then since a and b are by hypothesis positive integers they must both be equal to 1. Therefore, by mathematical induction, An is true for every n.

Now if a and b are any two positive integers whatsoever, denote max (a, b) by r. Since An has been shown to be true for every n, in particular Ar is true. Hence a = b.