FUNCTIONS - A Book of Abstract Algebra

A Book of Abstract Algebra, Second Edition (1982)

Chapter 6. FUNCTIONS

The concept of a function is one of the most basic mathematical ideas and enters into almost every mathematical discussion. A function is generally defined as follows: If A and B are sets, then a function from A to B is a rule which to every element x in A assigns a unique element y in B. To indicate this connection between x and y we usually write y = f(x), and we call y the image of x under the function f.

There is nothing inherently mathematical about this notion of function. For example, imagine A to be a set of married men and B to be the set of their wives. Let f be the rule which to each man assigns his wife. Then f is a perfectly good function from A to B; under this function, each wife is the image of her husband. (No pun is intended.)

Take care, however, to note that if there were a bachelor in A then f would not qualify as a function from A to B; for a function from A to B must assign a value in B to every element of A, without exception. Now, suppose the members of A and B are Ashanti, among whom polygamy is common; in the land of the Ashanti, f does not necessarily qualify as a function, for it may assign to a given member of A several wives. If/is a function from A to B, it must assign exactly one image to each element of A.

If f is a function from A to B it is customary to describe it by writing

f : A → B

The set A is called the domain of f. The range of f is the subset of B which consists of all the images of elements of A. In the case of the function illustrated here, {a, b, c} is the domain of f, and {x, y} is the

range of f(z is not in the range of f). Incidentally, this function f may be represented in the simplified notation

This notation is useful whenever A is a finite set: the elements of A are listed in the top row, and beneath each element of A is its image.

It may perfectly well happen, if f is a function from A to B, that two or more elements of A have the same image. For example, if we look at the function immediately above, we observe that a and b both have the same image x. If f is a function for which this kind of situation does not occur, then f is called an injective function. Thus,

Definition 1 A function f : A → B is called injective if each element of B is the image of no more than one element of A.

The intended meaning, of course, is that each element y in B is the image of no two distinct elements of A. So if

that is, x_l and x₂ have the same image y, we must require that x_l be equal to x₂. Thus, a convenient definition of “injective” is this: a function f : A →B is injective if and only if

f(x₁) = f(x₂) impliesx₁ = x₂

If f is a function from A to B, there may be elements in B which are not images of elements of A. If this does not happen, that is, if every element of B is the image of some element of A, we say that f is surjective.

Definition 2 A function f : A → B is called surjective if each element of B is the image of at least one element of A.

This is the same as saying that B is the range of f.

Now, suppose that f is both injective and surjective. By Definitions 1 and 2, each element of B is the image of at least one element of A, and no more than one element of A. So each element of B is the image of exactly oneelement of A. In this case, f is called a bijective function, or a one-to-one correspondence.

Definition 3 A function f : A →B is called bijective if it is both injective and surjective.

It is obvious that under a bijective function, each element of A has exactly one “partner” in B and each element of B has exactly one partner in A.

The most natural way of combining two functions is to form their “composite.” The idea is this: suppose f is a function from A to B, and g is a function from B to C. We apply f to an element x in A and get an element y in B; then we apply g to y and get an element z in C. Thus, z is obtained from x by applying f and g in succession. The function which

consists of applying f and g in succession is a function from A to C, and is called the composite of f and g. More precisely,

Let f : A → B and g : B → C be functions. The composite function denoted by g ∘ f is a function from A to C defined as follows:

[g ∘ f](x) = g(f(x)) for every x ∊ A

For example, consider once again a set A of married men and the set B of their wives. Let C be the set of all the mothers of members of B. Let f : A →B be the rule which to each member of A assigns his wife, and g : B → C the rule which to each woman in B assigns her mother. Then g ∘ f is the “mother-in-law function,” which assigns to each member of A his wife’s mother:

For another, more conventional, example, let f and g be the following functions from to : f(x) = 2x; g(x) = x + 1. (In other words, f is the rule “multiply by 2” and g is the rule “add 1.”) Their composites are the functions g ∘ fand f ∘ g given by

[f ∘ g](x) = f(g(x)) = 2(x + 1)

and

[g ∘ f](x) = g(f(x)) = 2x + 1

f ∘ g and g ∘ f are different: f ∘ g is the rule “add 1, then multiply by 2,” whereas g ∘ fis the rule “multiply by 2 and then add 1.”

It is an important fact that the composite of two injective functions is injective, the composite of two surjective functions is surjective, and the composite of two bijective functions is bijective. In other words, if f : A → B and g : B → C are functions, then the following are true:

If f and g are injective, then g ∘ f is injective.

If f and g are surjective, then g ∘ f is surjective.

If f and g are bijective, then g ∘ f is bijective.

Let us tackle each of these claims in turn. We will suppose that f and g are injective, and prove that g ∘ fis injective. (That is, we will prove that if [g ∘ f](x) = [g ∘ f](y), then x = y.)

Suppose [g ∘ f](x) = [g ∘ f](y), that is,

g(f(x)) = g(f(y))

Because g is injective, we get

f(x) = f(y)

and because f is injective,

x = y

Next, let us suppose that f and g are surjective, and prove that g ∘ f is surjective. What we need to show here is that every element of C is g ∘ f of some element of A. Well, if z ∈ C, then (because g is surjective) x = g(y) for some y ∈ B; but f is surjective, so y = f(x) for some x ∈ A. Thus,

z = g(y) = g(f(x)) = [g ∘ f](x)

Finally, if f and g are bijective, they are both injective and surjective. By what we have already proved, g ∘ f is injective and surjective, hence bijective.

A function f from A to B may have an inverse, but it does not have to. The inverse of f, if it exists, is a function f⁻¹ (“ f inverse”) from B to A such that

x = f⁻¹(y) if and only if y = f(x)

Roughly speaking, if f carries x to y then f⁻^l carries y to x. For instance, returning (for the last time) to the example of a set A of husbands and the set B of their wives, if f : A → B is the rule which to each husband assigns his wife, then f⁻^l : B → A is the rule which to each wife assigns her husband:

If we think of functions as rules, then f⁻¹ is the rule which undoes what ever f does. For instance, if f is the real-valued function f(x) = 2x, then f⁻¹ is the function f⁻¹(x) = x/2 [or, if preferred, f⁻¹(y) = y/2]. Indeed, the rule “divide by 2” undoes what the rule “multiply by 2” does.

Which functions have inverses, and which others do not? If f, a function from A to B, is not injective, it cannot have an inverse; for “not injective” means there are at least two distinct elements x₁ and x₂ with the same image y;

Clearly, x₁ = f⁻¹(y) and x₂ = f⁻¹(y) so f⁻^l(y) is ambiguous (it has two different values), and this is not allowed for a function.

If f, a function from A to B is not surjective, there is an element y in B which is not an image of any element of A; thus f⁻^l(y) does not exist. So f⁻¹ cannot be a function from B (that is, with domain B) to A.

It is therefore obvious that if f⁻^l exists, f must be injective and surjective, that is, bijective. On the other hand, if f is a bijective function from A to B, its inverse clearly exists and is determined by the rule that if y = f(x) then f⁻¹(y) = x.

Furthermore, it is easy to see that the inverse of f is also a bijective function. To sum up:

A function f : A → B has an inverse if and only if it is bijective.

In that case, the inverse f⁻^l is a bijective function from B to A.

EXERCISES

A. Examples of Injective and Surjective Functions

Each of the following is a function f : → . Determine

(a) whether or not f is injective, and

(b) whether or not f is surjective.

Prove your answer in either case.

Example 1 f(x) = 2x

f is injective.

PROOF Suppose f(a) = f(b), that is,

2a = 2b

Then

a = b

Therefore f is injective. ■

f is surjective.

PROOF Take any element y ∈. Then y = 2(y/2) = f(y/2).

Thus, every y ∈ is equal to f(x) for x = y/2.

Therefore f is surjective. ■

Example 2 f(x) = x²

f is not injective.

PROOF By exhibiting a counterexample: f(2) = 4 f(−2), although 2 ≠ −2.■

f is not surjective.

PROOF By exhibiting a counterexample: −1 is not equal to f(x) for any x ∈.■

1f(x) = 3x + 4

2f(x) = x³ + 1

3f(x) = |x|

# 4f(x) = x³ − 3x

# 6

7Determine the range of each of the functions in parts 1 to 6.

B. Functions on and

Determine whether each of the functions listed in parts 1−4 is or is not (a) injective and (b) surjective. Proceed as in Exercise A.

1f : →(0, ∞), defined by f(x) = e^x.

2f : (0,1)→, defined by f(x) = tan x.

3f : →, defined by f(x) = the least integer greater than or equal to x.

4f : →, defined by

5Find a bijective function f from the set of the integers to the set E of the even integers.

C. Functions on Arbitrary Sets and Groups

Determine whether each of the following functions is or is not (a) injective and (b) surjective. Proceed as in Exercise A.

In parts 1 to 3, A and B are sets, and A × B denotes the set of all the ordered pairs (x,y) as x ranges over A and y over B.

1f : A × B → A,defined by f(x,y) = x.

2f : A × B → B × A, defined by f(x, y) = (y, x).

3f : A × B,defined by f(x) = (x, b), where b is a fixed element of B.

4G is a group, a ∈ G, and f : G → G is defined by f(x) = ax.

5G is a group and f : G → G is defined by f(x) = x⁻¹.

6G is a group and f : G → G is defined by f(x) = x².

D. Composite Functions

In parts 1−3 find the composite function, as indicated.

1f : → is defined by f(x) = sin x.

g : → is defined by g(x) = e^x.

Find f ∘ g and g ∘ f.

2A and B are sets; f: A × B → B × A is given by f(x, y) = (y, x).

g : B × A → B is given by g(y, x) = y.

Find g ∘ f.

3f : (0,1)→ is defined by f(x) = 1 /x.

g : → is defined by g(x) = In x.

Find g ∘ f. Explain why f ∘ g is undefined.

4In school, Jack and Sam exchanged notes in a code f which consists of spelling every word backwards and interchanging every letter s with t. Alternatively, they use a code g which interchanges the letters a with o, i with u, e with y, and s with t. Describe the codes f ∘ g and g ∘ f. Are they the same?

5A = {a, b, c, d}; f and g are functions from A to A; in the tabular form described on page 57, they are given by

Give f ∘ g and g ∘ f in the same tabular form.

6G is a group, and a and b are elements of G.

f : G → G is defined by f(x) = ax.

g : G → G is defined by g(x) = bx.

Find f ∘ g and g ∘ f.

7Indicate the domain and range of each of the composite functions you found in parts 1 to 6.

E. Inverses of Functions

Each of the following functions f is bijective. Describe its inverse.

1 f : (0, ∞)→ (0, ∞), defined by f(x) = 1 /x.

2 f : → (0, ∞), defined by f(x) = e^x.

3 f : →, defined by f(x) = x³ + 1.

4 f : →, defined by

5 A = {a, b, c, d}, B = {1, 2, 3, 4} and f : A → B is given by

6 G is a group, a ∈ G, and f: G → G is defined by f(x) = ax.

F. Functions on Finite Sets

1 The members of the U.N. Peace Committee must choose, from among themselves, a presiding officer of their committee. For each member x, let f(x) designate that member’s choice for officer. If no two members vote alike, what is the range of f?

2 Let A be a finite set. Explain why any injective function f : A → A is necessarily surjective. (Look at part 1.)

3 If A is a finite set, explain why any surjective function f : A → A is necessarily injective.

4 Are the statements in parts 2 and 3 true when A is an infinite set? If not, give a counterexample.

# 5 If A has n elements, how many functions are there from A to A? How many bijective functions are there from A to A?

G. Some General Properties of Functions

In parts 1 to 3, let A, B, and C be sets, and let f : A → B and g : B → C be functions.

1 Prove that if g ∘ f is injective, then f is injective.

2 Prove that if g ∘ f is surjective, then g is surjective.

3 Parts 1 and 2, together, tell us that if g ∘ f is bijective, then f is injective and g is surjective. Is the converse of this statement true: If xy0 is injective and g surjective, is g ∘ f bijective? (If “yes,” prove it; if “no,” give a counterexample.)

4 Let f : A → B and g : B → A be functions. Suppose that y = f(x) iff x = g(y). Prove that f is bijective, and g = f⁻¹

H. Theory of Automata

Digital computers and other electronic systems are made up of certain basic control circuits. Underlying such circuits is a fundamental mathematical notion, the notion of finite automata, also known as finite-state machines.

A finite automaton receives information which consists of sequences of symbols from some alphabet A. A typical input sequence is a word x = x₁x₂ … x_n, where x_l x₂,. . . are symbols in the alphabet A. The machine has a set of internal components whose combined state is called the internal state of the machine. At each time interval the machine “reads” one symbol of the incoming input sequence and responds by going into a new internal state: the response depends both on the symbol being read and on the machine’s present internal state. Let S denote the set of internal states; we may describe a particular machine by specifying a function α : S × A → S. If s_i is an internal state and a_j is the symbol currently being read, then α(s_i a_j) = s_k is the machine’s next state. (That is, the machine, while in state s_i responds to the symbol a_j by going into the new state s_k.) The function α is called the next-state function of the machine.

Example Let M₁ be the machine whose alphabet is A = {0,1}, whose set of internal states is S = {s₀, s₁}, and whose next-state function is given by the table

(The table asserts: When in state s₀ and reading 0, remain in s₀. When in s₀ and reading 1, go to state s₁. When in s₁ and reading 0, remain in s₁ When in s₁ and reading 1, go to state s₀.)

A possible use of M₁ is as a parity-check machine, which may be used in decoding information arriving on a communication channel. Assume the incoming information consists of sequences of five symbols, 0s and Is, such as 10111. The machine starts off in state s₀. It reads the first digit, 1, and as dictated by the table above, goes into state s₁. Then it reads the second digit, 0, and remains in s₁. When it reads the third digit, 1, it goes into state s₀, and when it reads the fourth digit, 1, it goes into s₁. Finally, it reaches the last digit, which is the parity-check digit: if the sum of the first four digits is even, the parity-check digit is 0; otherwise it is 1. When the parity-check digit, 1, is read, the machine goes into state s₀. Ending in state s₀ indicates that the parity-check digit is correct. If the machine ends in s₁, this indicates that the parity-check digit is incorrect and there has been an error of transmission.

A machine can also be described with the aid of a state diagram, which consists of circles interconnected by arrows: the notation

means that if the machine is in state s_i when x is read, it goes into state s_j. The diagram of the machine M₁ of the previous example is

In parts 1−4 describe the machines which are able to carry out the indicated functions. For each machine, give the alphabet A, the set of states S, and the table of the next-state function. Then draw the state diagram of the machine.

# 1 The input alphabet consists of four letters, a, b, c, and d. For each incoming sequence, the machine determines whether that sequence contains exactly three a’s.

2 The same conditions pertain as in part 1, but the machine determines whether the sequence contains at least three a’s.

3 The input alphabet consists of the digits 0, 1, 2, 3, 4. The machine adds the digits of an input sequence; the addition is modulo 5 (see page 27). The sum is to be given by the machine’s state after the last digit is read.

4 The machine tells whether or not an incoming sequence of 0_s and Is ends with 111.

5 If M is a machine whose next-state function is α, define ᾱ as follows: If x is an input sequence and the machine (in state s_i.) begins reading x, then ᾱ(s_i, x) is the state of the machine after the last symbol of x is read. For instance, if M₁ is the machine of the example given above, then ᾱ(s₀,11010) = s₁.(The machine is in s₀ before the first symbol is read; each 1 alters the state, but the 0s do not. Thus, after the last 0 is read, the machine is in state s₁.)

(a) For the machine M₁, give ᾱ(s₀,x) for all three-digit sequences x.

(b) For the machine of part 1, give ᾱ(s_i, x) for each state s_i and every two-letter sequence x.

6 With each input sequence x we associate a function T_x: S→ S called a state transition function, defined as follows:

T_x(s_i) = ᾱ(s_i,x)

For the machine M₁ of the example, if x = 11010, T_x is given by

T^x(s₀) = s₁ and T_x(s₁)= s₀

(a) Describe the transition function T_x for the machine M₁ and the following sequences: x = 01001, x =10011, x = 01010.

(b) Explain why M₁ has only two distinct transition functions. [Note: Two functions f and g are equal if f(x) = g(x) for every x; otherwise they are distinct.]

(c) For the machine of part 1, describe the transition function T₁ for the following x: x = abbca, x = babac, x = ccbaa.

(d) How many distinct transition functions are there for the machine of part 3?

I. Automata, Semigroups, and Groups

By a semigroup we mean a set A with an associative operation. (There does not need to be an identity element, nor do elements necessarily have inverses.) Every group is a semigroup, though the converse is clearly false. With every semigroup A we associate an automaton M = M (A) called the automaton of the semigroup A. The alphabet of M is A, the set of states also is A, and the next-state function is α(s, a) = sa [or α(s, a) = s + a if the operation of the semigroup is denoted additively].

1 Describe M(₄). That is, give the table of its next-state function, as well as its state diagram.

2 Describe M(S₃).

If M is a machine and S is the set of states of M, the state transition functions of M (defined in Exercise H6 of this chapter) are functions from S to S. In the next exercise you will be asked to show that T_y ∘ T_x = T_xy; that is, the composite of two transition functions is a transition function. Since the composition of functions is associative [f ∘ (g ∘ h) = (f ∘ g) ∘ h], it follows that the set of all transition functions, with the operation °, is a semigroup. It is denoted by (M) and called the semigroup of the machine M.

3 Prove that T_xy = T_y ∘ T_x.

4 Let M₁ be the machine of the example in Exercise H above. Give the table of the semigroup (M₁. Does (M₁) have an identity element? Is (M₁) a group?

5 Let M₂ be the machine of Exercise H3. How many distinct functions are there in (M₃)? Give the table of (M₃). Is (M₃) a group? (Why?)

6 Find the table of (M) if M is the machine whose state diagram is