Functions and Linear Transformations and Matrices

Addendic B: Functions

function

If $A$ and $B$ are sets, then a function $f$ from $A$ to $B$ , written $f : A \to B$ , is a rule that associates to each element $x$ in $A$ a unique element denoted $f (x)$ in $B$ . The element $f (x)$ is called the image of $x$ (under $f$ ), and $x$ is called a preimage of $f (x)$ (under $f$ ).

If $f : A \to B$ , then $A$ is called the domain of $f$ , $B$ is called the codomain of $f$ , and the set ${f (x) : x \in A}$ is called the range of f. Note that the range of $f$ is a subset of $B$ . If $S \subseteq A$ , we denote by $f (S)$ the set ${f (x) : x \in S}$ of all images of elements of $S$ . Likewise, if $T \subseteq B$ , we denote by $f^{- 1} (T)$ the set ${x \in A : f (x) \in T}$ of all preimages of elements in $T$ .

Finally, two functions $f : A \to B$ and $g : A \to B$ are equal, written $f = g$ , if $f (x) = g (x)$ for all $x \in A$ .

Example B.1

Suppose that $A = [- 10, 10]$ . Let $f : A \to R$ be the functio that assignss to each element $x$ in $A$ the element $x^{2} + 1$ in $R$ ; that is, $f$ is defined by $f (x) = x^{2} + 1$ .

Then $A$ is the domain of $f$ , $R$ is the codmain of $f$ , and $[1, 101]$ is the range of $f$ . Since $f (2) = 5$ , the image of $2$ is $5$ , and $2$ is a preimage of $5$ . Notice that $- 2$ is another preimage of $5$ . Moreover, if $S = [1, 2]$ and $T = [82, 101]$ , then $f (S) = [2, 5]$ and $f^{- 1} (T) = [- 10, - 9] \cup [9, 10]$ .

one-to-one, onto

As example 1 shows, the preimage of an element in the range need not be unique. Functions s.t. each element of the range has a unique preimage are called one-to-one (injective); that is $f : A \to B$ is one-to-one if $f (x) = f (y)$ implies $x = y$ or, equivalently, if $x \neq y$ implies $f (x) \neq f (y)$ .

If $f : A \to B$ is a function with range $B$ , that is, if $f (A) = B$ , then $f$ is called onto (surjective).So $f$ is onto iff the range of $f$ equals the codomain of $f$ . [core]

one-to-one + onto = bijective

Let $A, B, C$ be sets and $f : A \to B$ and $g : B \to C$ be functios. By following $f$ with $g$ , we obtain a function $g \circ f : A \to C$ called the composite of $g$ and $f$ . Thus $(g \circ f) (x) = g (f (x)), \forall x \in A$ .

For example, let $A = B = C = R, f (x) = \sin x$ , and $g (x) = x^{2} + 3$ . Then $(g \circ f) (x) = (g (f (x))) = \sin^{2} x + 3$ , whereas $(f \circ g) (x) = f (g (x)) = f (g (x)) = \sin (x^{2} + 3)$ . Hence, $g \circ f \neq f \circ g$ .
Functionalcomposition is associative, i.e. if $h : C \to D$ is another function, then $h \circ (g \circ f) = (h \circ g) \circ f$ .

A function $f : A \to B$ is said to e invertible if there exists a function $g : B \to A$ s.t. $(f \circ g) (y) = y, \forall y \in B$ and $(g \circ f) (x) = x, \forall x \in A$ . If such a function $g$ exists, then it's unique and is called the inverse of $f$ . We denote the inverse of $f$ (when it exists) by $f^{- 1}$ . It can be shown that $f$ is invertible iff $f$ is both one-to-one and onto.

The following facts about invertible functions are easily proved:

If $f : A \to B$ is invertible, then $f^{- 1}$ is invertible, and $(f^{- 1})^{- 1} = f$ .
If $f : A \to B$ and $g : B \to C$ ar invertible, then $g \circ f$ is invertible, and $(g \circ f)^{- 1} = f^{- 1} \circ g^{- 1}$ .

2.1 Linear Transformations, Null Spaces, and Ranges

Linear Transformation

Now it's natural to consider those functions defined on vector spaces that in some sense "preserve" the structure.

Definition: linear transformation

Let $V, W$ be vector spaces (over $F$ ). We call a function $T : V \to W$ a linear transformation from $V$ to $W$ if, $\forall x, y \in V$ and $c \in F$ , we have

$T (x + y) = T (x) + T (y)$ (additivity)
$T (c x) = c T (x)$ (scaling)

Often simply all $T$ linear. Following are properties of a function: $T : V \to W$ :

Properties

If $T$ is linear, then $T (0) = 0$ ;
$T$ is linear iff $T (c x + y) = c T (x) + T (y), \forall x, y \in V$ and $c \in F$ ; (Generally used to prove that a given transformation is linear)[core]
If $T$ is linear, then $T (x - y) = T (x) - T (y), \forall x, y \in V$ ;
$T$ is linear iff for $x_{1}, x_{2}, \dots, x_{n} \in V$ and $a_{1}, a_{2}, \dots, a_{n} \in F$ , we have

T (\sum_{i = 1}^{n} a_{i} x_{i}) = \sum_{i = 1}^{n} a_{i} T (x_{i}) .

Example 1

Define

T : R^{2} \to R^{2} by T (a_{1}, a_{2}) = (2 a_{1} + a_{2}, a_{1}) .

To show that $T$ is linear, let $c \in R$ and $x, y \in R^{2}$ , where $x = (b_{1}, b_{2})$ and $y = (d_{1}, d_{2})$ . Since

c x + y = (c b_{1} + d_{1}, c b_{2} + d_{2})

we have

T (c x + y) = (2 (c b_{1} + d_{1}) + c b_{2} + d_{2}, c b_{1} + d_{1}) .

Also

\begin{aligned} c ⊤ (x) + ⊤ (y) & = c (2 b_{1} + b_{2}, b_{1}) + (2 d_{1} + d_{2}, d_{1}) \\ = (2 c b_{1} + c b_{2} + 2 d_{1} + d_{2}, c b_{1} + d_{1}) \\ = (2 (c b_{1} + d_{1}) + c b_{2} + d_{2}, c b_{1} + d_{1}) . \end{aligned}

So $T$ is linear.

Example 2: rotation

For any angle $θ$ , define $T_{θ} : R^{2} \to R^{2}$ by the rule: $T_{θ} (a_{1}, a_{2})$ is the vector obtained by rotating $(a_{1}, a_{2})$ counterclockwise by $θ$ if $(a_{1}, a_{2}) \neq (0, 0)$ , and $T_{θ} (0, 0) = (0, 0)$ .

Then $T_{θ} : R^{2} \to R^{2}$ is a linear transformation that is called the rotation by $θ$ .

We determine an explicit formula for $T_{θ}$ . Fix a nonzero vector $(a_{1}, a_{2}) \in R^{2}$ . Let $α$ be the angle that $(a_{1}, a_{2})$ makes with the positive $x$ -axis, and let $r = \sqrt{a_{1}^{2} + a_{2}^{2}}$ . Then $a_{1} = r \cos α$ and $a_{2} = r \sin α$ . Also, $T_{θ} (a_{1}, a_{2})$ has length $r$ and makes an angle $α + θ$ with the positive $x$ -axis. It follows that

\begin{aligned} T_{θ} (a_{1}, a_{2}) & = (r \cos (α + θ), r \sin (α + θ)) \\ = (r \cos α \cos θ - r \sin α \sin θ, r \cos α \sin θ + r \sin α \cos θ) \\ = (a_{1} \cos θ - a_{2} \sin θ, a_{1} \sin θ + a_{2} \cos θ) \end{aligned}

Finally, observe that this same formula is valid for $(a_{1}, a_{2}) = (0, 0)$ . It is now easy to show, as in Example 1, that $T_{θ}$ is linear.

Example 3&4: Reflection

Define $T : R^{2} \to R^{2}$ by $T (a_{1}, a_{2}) = (a_{1}, - a_{2})$ . $T$ is called the reflection about the x-axis

Define $T : R^{2} \to R^{2}$ by $T (a_{1}, a_{2}) = (- a_{1}, a_{2})$ . $T$ is called the reflection about the y-axis

Example 5: Polynominal

Define $T : P_{n} (R) \to P_{n - 1} (R)$ by $T (f (x)) = f^{'} (x)$ , where $f^{'} (x)$ denotes the derivative of $f (x)$ . To show that $T$ is linear, let $g (x), h (x) \in P_{n} (R)$ and $a \in R$ . Now

T (a g (x) + h (x)) = (a g (x) + h (x))^{'} = a g^{'} (x) + h^{'} (x) = a T (g (x)) = T (h (x)) .

So by property 2 above, $T$ is linear.

Example 6: integration

Let $V = C (R)$ , the vector space of continous real-valued functions on $R$ . Let $a, b \in R, a < b$ . Define $T : V \to R$ by

T (f) = \int_{a}^{b} f (t) d t .

for all $f \in V$ . Then $T$ is a linear transformation because the definte integral of a linear combination of functions is the same as the linear combination f he definite integrals of the fnctions.

Special transformations

For vector spaces $V, W$ (over $F$ ), we define the identity transformation $I_{V} : V \to V$ by $I_{V} (x) = x, \forall x \in V$ and the zero transformation $T_{0} : V \to W$ by $T_{0} (x) = 0$ for all $x \in V$ . It is clear that both of these transformations are linear. We often write $I$ instead of $I_{V}$ .

We now turn our attention to two very important sets associated with linear transformations: the range and null space. The determination of these sets allows us to examine more closely the intrinsic properties of a linear transformation.

Definitions: null space/kernel; range / image [core]

Let $V, W$ be vector spaces, and let $T : V \to W$ be linear. We define the null space (or kernel) $N (T)$ of $T$ to be the set of all vectors $x$ in $V$ s.t. $T (x) = 0$ ; i.e., $N (T) = {x \in V : T (x) = 0}$ .

We define the range (or image) $R (T)$ of $T$ to be the subset of $W$ consisting of all images (under $T$ ) of vectors in $V$ ; i.e., $R (T) = {T (x) : x \in V}$ .

Example 7: identity and zero transformation

Let $V, W$ be vector spaces, and let $I : V \to V$ and $T_{0} : V \to W$ be the identity and zero transformation, respectively. Then $N (I) = {0}, R (I) = V, N (T_{0}) = V, R (T_{0}) = {0}$

Example 8

Let $T : R^{3} \to R^{2}$ be the linear transformation defined by

T (a_{1}, a_{2}, a_{3}) = (a_{1} - a_{2}, 2 a_{3}) .

It's easy to verify that:

N (T) = {(a, a, 0) : a \in R}, and R (T) = R^{2} .

Theorem 2.1

Let $V, W$ be vector spaces and $T : V \to W$ be linear. Then $N (T)$ and $R (T)$ are subspaces of $V, W$ , respectively.

Theorem 2.2

Let $V, W$ be vector spaces and $T : V \to W$ be linear. If $β = {v_{1}, v_{2} \dots, v_{n}}$ is a basis for $V$ , then

R (T) = span (T (β)) = span ({T (v_{1}), T (v_{2}), \dots, T (v_{n})}) .

Recall Theorem 1.5 in §1 that the relation between subspace and span.

Example 9

Define the linear transformation $T : P_{2} (R) \to M_{2 \times 2} (R)$ by

T (f (x)) = (\begin{matrix} f (1) - f (2) & 0 \\ 0 & f (0) \end{matrix})

Since $β = {1, x, x^{2}}$ is a basis for $P_{2} (R)$ , we have

\begin{aligned} R (T) & = span (T (β)) = span ({T (1), T (x), T (x^{2})}) \\ = span ({(\begin{array}{c} 0 & 0 \\ 0 & 1 \end{array}), (\begin{array}{c} - 1 & 0 \\ 0 & 0 \end{array}), (\begin{array}{c} - 3 & 0 \\ 0 & 0 \end{array})}) \\ = span ({(\begin{array}{c} 0 & 0 \\ 0 & 1 \end{array}), (\begin{array}{c} - 1 & 0 \\ 0 & 0 \end{array})}) \end{aligned}

Thus, we have found a basis for $R (T)$ , so $\dim (R (T)) = 2$ .

Definition and Theorem

As in §1.6, we measure the "size" of a subspace by its dimension. The null space and range are so important that we attach special names to their respective dimensions.

Definition: nulity; rank [core]

Let $V$ and $W$ be vector spaces, and let $T : V \to W$ be linear. If $N (T)$ and $R (T)$ are finite-dimensional, then we define:

Nullity of $T$ , denoted $nullity (T)$ , as the dimension of $N (T)$ .
Rank of $T$ , denoted $rank (T)$ , as the dimension of $R (T)$ .

Theorem 2.3: Dimension Theorem [core]

Let $V$ and $W$ be vector spaces, and $T : V \to W$ linear. If $V$ is finite-dimensional, then

nullity (T) + rank (T) = \dim (V)

Theorem 2.4: Null Space and One-to-one [core]

Let $V$ and $W$ be vector spaces, and $T : V \to W$ linear. Then $T$ is one-to-one iff

N (T) = {0}

Theorem 2.5 [core]

Let $V$ and $W$ be finite-dimensional vector spaces of equal and finite dimension, and $T : V \to W$ linear. The following are equivalent:

$T$ is one-to-one.
$T$ is onto.
$rank (T) = \dim (V)$ .

Example 10

Let $T : P_{2} (R) \to P_{3} (R)$ be the linear transformation defined by

T (f (x)) = 2 f^{'} (x) + \int_{0}^{x} 3 f (t) d t

Now

\begin{aligned} R (T) & = span ({T (1), T (x), T (x^{2})}) \\ = span ({3 x, 2 + \frac{3}{2} x^{2}, 4 x + x^{3}}) . \end{aligned}

Since $span ({3 x, 2 + \frac{3}{2} x^{2}, 4 x + x^{3}})$ is linear independent, The rank is $3$ ;
Since $\dim (P_{3} (R)) = 4, T$ is not onto;
From the dimension theorem, $nullity (T) + 3 = 3$ , so nullity is $0$ , and therefore $N (T) = {0}$ .

So $T$ is one-to-one but not onto.

Example 11

Let $T : F^{2} \to F^{2}$ be the linear transformation defined by

T (a_{1}, a_{2}) = (a_{1} + a_{2}, a_{1})

It's easy to see that $N (T) = {0}$ ; so $T$ is one-to-one. And Theorem 2.5 tells us that $T$ must be onto.

Example 12

Let $T : P_{2} (R) \to R^{3}$ be the linear transformation defined by

T (a_{0} + a_{1} x + a_{2} x^{2}) = (a_{0}, a_{1}, a_{2})

Clearly $T$ is linear and one-to-one. Let $S = {1 - x + 3 x^{2}, x + x^{2}, 1 - 2 x^{2}}$ . Then $S$ is linearly independent in $P_{2} (R)$ because

T (S) = {(2, - 1, 3), (0, 1, 1), (1, 0, - 2)}

is linearly independent in $R^{3}$ .

Theorem 2.6

Let $V$ and $W$ be vector spaces over $F$ , and suppose that ${v_{1}, v_{2}, \dots, v_{n}}$ is a basis for $V$ . For $w_{1}, w_{2}, \dots, w_{n}$ in $W$ , there exists exactly one linear transformation: $T : V \to W$ s.t. $T (v_{i}) = w_{i}$ ,for $i = 1, 2, \dots, n$ .

Corollary

Let $V$ and $W$ be vector spaces, and suppose that $V$ has a finite basis ${v_{1}, v_{2}, \dots, v_{n}}$ . If $U, T : V \to W$ are linear and $U (v_{i}) = T (v_{i})$ for $i = 1, 2, \dots, n$ , then $U = T$ .

Example 13

Let $T : R^{2} \to R^{2}$ be the linear transformation defined by

T (a_{1}, a_{2}) = (2 a_{2} - a_{1}, 3 a_{1}) .

and suppose that $U : R^{2} \to R^{2}$ is linear. If we know that $U (1, 2) = (3, 3)$ and $U (1, 1) = (1, 3)$ , then $U = T$ . This follows from the corollary and from the fact that ${(1, 2), (1, 1)}$ is the basis for $R^{2}$ .

2.2 The Matrix Representation of a Linear Transformations

coordinate vector

Definition: ordered basis

Let $V$ be a finite-dimensional vector space. An ordered basis for $V$ is a basis for $V$ endowed with a specific order; that is, an ordered basis for $V$ is a finite sequence of linearly independent vectors in $V$ that generates $V$ .

Example 2.2.1

In $F^{3}, β = {e_{1}, e_{2}, e_{3}}$ can be considered as ordered basis. Also $γ = {e_{2}, e_{1}, e_{3}}$ is an ordered basis, but $β \neq γ$ as ordered bases

For the vector space $F^{n}$ , we call ${e_{1}, e_{2}, \dots, e_{n}}$ the standard ordered basis for $F^{n}$ . Similarly, for the vector space $P_{n} (F)$ , we call ${1, x, \dots, x^{n}}$ the standard ordered basis for $P_{n} (F)$ .

Definition: coordinate vector [core]

Let $β = {v_{1}, v_{2}, \cdot \cdot \cdot, v_{n}}$ be an ordered basis for a finite-dimensional vector space $V$ . For $x \in V$ , let $a_{1}, a_{2}, \dots, a_{n}$ be the unique scalars s.t.

x = \sum_{i = 1}^{n} a_{i} u_{i}

We define the coordinate vecotr of $x$ relative to $β$ , denoted $[x]_{β}$ . by

[x]_{β} = (a_{1}, a_{2}, \dots, a_{n})^{⊤} .

Example 2.2.2: coordinate vector for polynominal

Let $V = P_{2} (R)$ , an let $β = {1, x, x^{2}}$ be the standard ordered basis for $V$ . If $f (x) = 4 + 6 x - 7 x^{2}$ , then

[f]_{β} = (4, 6, - 7)^{⊤} .

matrix representation

Definition: matrix representation [core]

Using the notation above, we call the $m \times n$ matrix $A$ defined by $A_{i j} = a_{i j}$ the matrix representation of $T$ in the ordered bases $β$ and $γ$ and write $A = [T]_{β}^{γ}$ .

If $V = W$ and $β = γ$ , then we write $A = [T]_{β}$ .

Notice that the jth column of $A$ is simply $[T (v_{j})]_{γ}$ . Also observe that if $U : V \to W$ is a linear transformation s.t. $[U]_{β}^{γ} = [T]_{β}^{γ}$ , then $U = T$ by the corollary to theorem 2.6.

Example 2.2.3: matrix representation for tuples

Let $T : R^{2} \to R^{3}$ be defined by

T (a_{1}, a_{2}) = (a_{1} + 3 a_{2}, 0, 2 a_{1} - 4 a_{2})

Let $β, γ$ be the standard ordered bases for $R^{2}$ and $R^{3}$ , respectively. Now

T (1, 0) = (1, 0, 2) = 1 e_{1} + 0 e_{2} + 2 e_{3} .

and

T (0, 1) = (3, 0, - 4) = 3 e_{1} + 0 e_{2} + - 4 e_{3} .

Hence

[T]_{β}^{γ} = (\begin{matrix} 1 & 3 \\ 0 & 0 \\ 2 & - 4 \end{matrix})

If we let $γ^{'} = {e_{3}, e_{2}, e_{1}}$ , then

[T]_{β}^{γ^{'}} = (\begin{matrix} 2 & - 4 \\ 0 & 0 \\ 1 & 3 \end{matrix})

Example 2.2.4: matrix representation for polynominal

Let ${T : P}_{3} (R) \to P_{2} (R)$ be the linear transformation defined by $T (f (x)) = f^{'} (x)$ .Let $β, γ$ be the standard ordered bases for $P_{3} (R)$ and $P_{2} (R)$ , respectively. Then

\begin{aligned} T (1) & = 0 \cdot 1 + 0 \cdot x + 0 \cdot x^{2} \\ T (x) & = 1 \cdot 1 + 0 \cdot x + 0 \cdot x^{2} \\ T (x^{2}) & = 0 \cdot 1 + 2 \cdot x + 0 \cdot x^{2} \\ T (x^{3}) & = 0 \cdot 1 + 0 \cdot x + 3 \cdot x^{2} \end{aligned}

[\frac{d}{d x}] = [T]_{β}^{γ} = (\begin{matrix} 0 & 1 & 0 & 0 \\ 0 & 0 & 2 & 0 \\ 0 & 0 & 0 & 3 \end{matrix})

i.e.

\begin{matrix} f (x) = a + b x + c x^{2} + d x^{3} \\ T (f (x)) = f^{'} (x) = b + 2 c x + 3 d x^{2} \\ [\begin{matrix} b \\ 2 c \\ 3 d \end{matrix}] = [\begin{matrix} 0 & 1 & 0 & 0 \\ 0 & 0 & 2 & 0 \\ 0 & 0 & 0 & 3 \end{matrix}] [\begin{matrix} a \\ b \\ c \\ d \end{matrix}] \end{matrix}

Note that when $T (x^{j})$ is written as a linear combination of the vectors of $γ$ , its coefficients gives the entries of the jth column of $[T]_{β}^{γ}$ .

operations on linear transformation

Definition

Let $T, U : V \to W$ be arbitary functions, where $V, W$ are vector spaces over $F$ , and let $a \in F$ . We define:

$T + U : V \to W$ by $(T + U) (x) = T (x) + U (x)$ for all $x \in V$ ;
$a T : V \to W$ by $(a T) (x) = a T (x)$ for all $x \in V$ ;

Theorem 2.7

Let $V, W$ be vector spaces over a field $F$ , and let $T, U : V \to W$ be linear.

For all $a \in F, a T + U$ is linear.
Using the operations of addition andscalar multiplication in the precedingg definition, the collection of all linear transformation from $V$ to $W$ is a vector space over $F$ .

Definition

Let $V, W$ be vector spaces over $F$ . We denote the vector space of all linear transformation from $V$ to $W$ by $L (V, W)$ . In the case that $V = W$ , we write $L (V)$ instead of $L (V, W)$ .

Theorem 2.8

Let $V, W$ be finite-dimensional vector spaces with ordered bases $β, γ$ , respectively, and let $T, U : V \to W$ be linear transformations. Then

$[T + U]_{β}^{γ} = [T]_{β}^{γ} + [U]_{β}^{γ}$
$[a T]_{β}^{γ} = a [T]_{β}^{γ}$ for all scalars $a$ .

Example 2.2.5

Let $T : R^{2} \to R^{3}$ and $U : R^{2} \to R^{3}$ be the linear transformations respectively defined by

\begin{aligned} T (a_{1}, a_{2}) & = (a_{1} + 3 a_{2}, 0, 2 a_{1} - 4 a_{2}) \\ U (a_{1}, a_{2}) & = (a_{1} - a_{2}, 2 a_{1}, 3 a_{1} + 2 a_{2}) \end{aligned}

Let $β, γ$ be the standard ordered bases of $R^{2} R^{3}$ , respectively. Then

[T]_{β}^{γ} = (\begin{matrix} 1 & 3 \\ 0 & 0 \\ 2 & - 4 \end{matrix})

(as completed in Example 2.2.3), and

[U]_{β}^{γ} = (\begin{matrix} 1 & - 1 \\ 2 & 0 \\ 3 & 2 \end{matrix})

If we compute $T + U$ usingthe proceding definitions, we obtain

(T + U) (a_{1}, a_{2}) = (2 a_{1} + 2 a_{2}, 2 a_{1}, 5 a_{1} - 2 a_{2}) .

[T + U]_{β}^{γ} = (\begin{matrix} 2 & 2 \\ 2 & 0 \\ 5 & - 2 \end{matrix})

which is simply $[T]_{β}^{γ} + [U]_{β}^{γ}$ , illustrated Theorem 2.8

2.3 Composition of Linear Transformations and Matrix Multiplication

matrix product

Theorem 2.9

Let $V, W, Z$ be vector spaces over the same field $F$ , and let $T : V \to W$ and $U : W \to Z$ be linear, then $U T : V \to Z$ is linear.

Theorem 2.10

Let $V$ be a vector space. Let ${T, U}_{1}, U_{2} \in L (V)$ . Then

$T (U_{1} + U_{2}) = {TU}_{1} + {TU}_{2}$ and $(U_{1} + U_{2}) T = U_{1} T + U_{2} T$
$T (U_{1} U_{2}) = (T U_{1}) U_{2}$
$TI = IT = T$
$a (U_{1} U_{2}) = (a U_{1}) U_{2} = U_{1} (a U_{2})$ for all scalars $a$ .

A more general result holds for linear transformations that have domains unequal to their codomains.

Definition: product

Let $T : V \to W$ and $U : W \to Z$ be linear transformations, and let $A = [U]_{β}^{γ}$ and $B = [T]_{α}^{β}$ , where $α = {v_{1}, v_{2}, \dots, v_{n}}, β = {w_{1}, w_{2}, \dots, w_{m}}$ , and $γ = {z_{1}, z_{2}, \dots, z_{p}}$ are ordered bases for $V, W$ , and $Z$ , respectively. We would like to define the product $A B$ of two matrices so that $A B = [UT]_{α}^{γ}$ . Consider the matrix $[UT]_{α}^{γ}$ . For $1 \leq j \leq n$ , we have

\begin{aligned} (UT) (v_{j}) & = U (T (v_{j})) = U (\sum_{k = 1}^{m} B_{k j} w_{k}) = \sum_{k = 1}^{m} B_{k j} U (w_{k}) \\ = \sum_{k = 1}^{m} B_{k j} (\sum_{i = 1}^{p} A_{i k} z_{i}) = \sum_{i = 1}^{p} (\sum_{k = 1}^{m} A_{i k} B_{k j}) z_{i} \\ = \sum_{i = 1}^{p} C_{i j} z_{i} \end{aligned}

where

C_{i j} = \sum_{k = 1}^{m} A_{i k} B_{k j} .

This computation motivates the following definition of matrix multiplication.

why matrix multiplication is defined this way - it perfectly describes the composition of basis transformations. It's not an arbitrary definition, but rather a natural consequence of how basis transformations compose.

Definition: product of Matrix

Let $A$ be an $m \times n$ matrix and $B$ be an $n \times p$ matrix. We define the product of $A$ and $B$ , denoted $A B$ , to be the $m \times p$ matrix s.t.

(A B)_{i j} = \sum_{k = 1}^{n} A_{i k} B_{k j} for 1 \leq i \leq m, 1 \leq j \leq p

Example 2.3.1

We have

(\begin{matrix} 1 & 2 & 1 \\ 0 & 4 & - 1 \end{matrix}) (\begin{matrix} 4 \\ 2 \\ 5 \end{matrix}) = (\begin{matrix} 1 \cdot 4 + 2 \cdot 2 + 1 \cdot 5 \\ 0 \cdot 4 + 4 \cdot 2 + (- 1) \cdot 5 \end{matrix}) = (\begin{matrix} 13 \\ 3 \end{matrix})

Notice again the symbolic relationship $(2 \times 3) \cdot (3 \times 1) = 2 \times 1$ .

As in the case with composition of functions, we have the matrix multiplication is not commutative. It need not be true that $A B = B A$ .

transpose

The transpose $A^{t}$ of an $m \times n$ matrix $A$ is the $n \times m$ matrix obtained from $A$ by interchanging the rows with the columns; that is, ${(A^{t})}_{i j} = A_{j i}$ . For example,

{(\begin{array}{rrr} 1 & - 2 & 3 \\ 0 & 5 & - 1 \end{array})}^{t} = (\begin{array}{rr} 1 & 0 \\ - 2 & 5 \\ 3 & - 1 \end{array}) and {(\begin{array}{ll} 1 & 2 \\ 2 & 3 \end{array})}^{t} = (\begin{array}{ll} 1 & 2 \\ 2 & 3 \end{array})

show that if $A$ is an $m \times n$ matrix and $B$ is an $n \times p$ matrix, then $(A B)^{t} = B^{t} A^{t}$ . Since

(A B)_{i j}^{t} = (A B)_{j i} = \sum_{k = 1}^{n} A_{j k} B_{k i}

and

{(B^{t} A^{t})}_{i j} = \sum_{k = 1}^{n} {(B^{t})}_{i k} {(A^{t})}_{k j} = \sum_{k = 1}^{n} B_{k i} A_{j k},

Therefore the transpose of a product is the product of the transposes in the opposite order.

Immediate consequence of our definition of matrix multiplication:

Theorem 2.11 [core]

Let $V, W$ , and $Z$ be finite-dimensional vector spaces with ordered bases $α, β$ , and $γ$ , respectively. Let $T : V \to W$ and $U : W \to Z$ be linear transformations. Then

[UT]_{α}^{γ} = [U]_{β}^{γ} [T]_{α}^{β} .

Corollary

Let $V$ be a finite-dimensional vector space with an ordered basis $β$ . Let $T, U \in L (V)$ . The $[UT]_{β} = [U]_{β} [T]_{β}$

Example 2.3.2 [core]

Let $U : P_{3} (R) \to P_{2} (R)$ and $T : P_{2} (R) \to P_{3} (R)$ be the linear transformations respectively defined by

U (f (x)) = f^{'} (x) and T (f (x)) = \int_{0}^{x} f (t) d t

Let $α$ and $β$ be the standard ordered bases of $P_{3} (R)$ and $P_{2} (R)$ , respectively. From calculus, it follows that $UT = I$ , the identity transformation on $P_{2} (R)$ . To illustrate Theorem 2.11, observe that

\begin{aligned} _{β} & = [U]_{α}^{β} [T]_{β}^{α} = (\begin{array}{cccc} 0 & 1 & 0 & 0 \\ 0 & 0 & 2 & 0 \\ 0 & 0 & 0 & 3 \end{array}) (\begin{array}{ccc} 0 & 0 & 0 \\ 1 & 0 & 0 \\ 0 & \frac{1}{2} & 0 \\ 0 & 0 & \frac{1}{3} \end{array}) \\ = (\begin{array}{ccc} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{array}) = [I]_{β} \end{aligned}

The preceding $3 \times 3$ diagonal matrix is called an identity matrix and is defined next, along with a very useful notation, the Kronecker delta.

Definition and theorem

Definition: identity matrix

We define the Kronecker delta $δ_{i j}$ by $δ_{i j} = 1$ if $i = j$ and $δ_{i j} = 0$ if $i \neq j$ .then $n \times n$ identity matrix $I_{n}$ is defined by $(I_{n})_{i j} = δ_{i j}$ .

Theorem 2.12.

Let $A$ be an $m \times n$ matrix, $B$ and $C$ be $n \times p$ matrices, and $D$ and $E$ be $q \times m$ matrices. Then

$A (B + C) = A B + A C$ and $(D + E) A = D A + E A$ .
$a (A B) = (a A) B = A (a B)$ for any scalar $a$ .
$I_{m} A = A = A I_{n}$ .
If $V$ is an $n$ -dimensional vector space with an ordered basis $β$ , then $[I_{V}]_{β} = I_{n}$ .

Corollary.

Let $A$ be an $m \times n$ matrix, $B_{1}, B_{2}, \dots, B_{k}$ be $n \times p$ matrices, $C_{1}, C_{2}, \dots, C_{k}$ be $q \times m$ matrices, and $a_{1}, a_{2}, \dots, a_{k}$ be scalars. Then

A (\sum_{i = 1}^{k} a_{i} B_{i}) = \sum_{i = 1}^{k} a_{i} A B_{i}

and

(\sum_{i = 1}^{k} a_{i} C_{i}) A = \sum_{i = 1}^{k} a_{i} C_{i} A

For an $n \times n$ matrix $A$ , we define $A^{1} = A, A^{2} = A A, A^{3} = A^{2} A$ , and, in general, $A^{k} = A^{k - 1} A$ for $k = 2, 3, \dots$ . We define $A^{0} = I_{n}$ .

With this notation, we see that if

A = (\begin{array}{ll} 0 & 0 \\ 1 & 0 \end{array})

then $A^{2} = O$ (the zero matrix) even though $A \neq O$ . Thus the cancellation property for multiplication in fields is not valid for matrices. To see why, assume that the cancellation law is valid. Then, from $A \cdot A = A^{2} = O = A \cdot O$ , we would conclude that $A = O$ , which is false.

Theorem 2.13

Let $A$ be an $m \times n$ matrix and $B$ be an $n \times p$ matrix. For each $j (1 \leq j \leq p)$ let $u_{j}$ and $v_{j}$ denote the $j$ th columns of $A B$ and $B$ , respectively. Then

$u_{j} = A v_{j}$
$v_{j} = B e_{j}$ , where $e_{j}$ is the jth standard vector over $F^{p}$ .

Theorem 2.14 [core]

Let $V$ and $W$ be finite-dimensional vector spaces having ordered bases $β$ and $γ$ , respectively, and let $T : V \to W$ be linear. Then, for each $u \in V$ , we have

[T (u)]_{γ} = [T]_{β}^{γ} [u]_{β} .

Example 2.3.3

Let $T : P_{3} (R) \to P_{2} (R)$ be the linear transformation defined by $T (f (x)) = f^{'} (x)$ , and let $β$ and $γ$ be the standard ordered bases for $P_{3} (R)$ and $P_{2} (R)$ , respectively. If $A = [T]_{β}^{γ}$ , then, from Example 2.2.4, we have

A = (\begin{array}{llll} 0 & 1 & 0 & 0 \\ 0 & 0 & 2 & 0 \\ 0 & 0 & 0 & 3 \end{array})

We illustrate Theorem 2.14 by verifying that $[T (p (x))]_{γ} = [T]_{β}^{γ} [p (x)]_{β}$ , where $p (x) \in P_{3} (R)$ is the polynomial $p (x) = 2 - 4 x + x^{2} + 3 x^{3}$ . Let $q (x) = T (p (x))$ ; then $q (x) = p^{'} (x) = - 4 + 2 x + 9 x^{2}$ . Hence

[T (p (x))]_{γ} = [q (x)]_{γ} = (\begin{array}{r} - 4 \\ 2 \\ 9 \end{array})

but also

\begin{aligned} [T]_{β}^{γ} [p (x)]_{β} = A [p (x)]_{β} \\ = & (\begin{array}{llll} 0 & 1 & 0 & 0 \\ 0 & 0 & 2 & 0 \\ 0 & 0 & 0 & 3 \end{array}) (\begin{array}{r} 2 \\ - 4 \\ 1 \\ 3 \end{array}) = (\begin{array}{r} - 4 \\ 2 \\ 9 \end{array}) . \end{aligned}

left-multiplication transformation

Definition: left-multiplication transformation

Let $A$ be an $m \times n$ matrix with entries from a field $F$ . We denote by $L_{a}$ the mapping $L_{A} : F^{n} \to F^{m}$ defined by $L_{A} (x) = A x$ (the matrix product of $A$ and $x$ ) for each column vector $x \in F^{n}$ . We call $L_{A}$ a left-multiplication transformation.

Example 2.3.4

Let

A = (\begin{matrix} 1 & 2 & 1 \\ 0 & 1 & 2 \end{matrix})

Then $A \in M_{2 \times 3} (R)$ and $L_{A} : R^{3} \to R^{2}$ . If

x = (\begin{matrix} 1 \\ 3 \\ - 1 \end{matrix})

then

L_{A} (x) = A x = (\begin{matrix} 1 & 2 & 1 \\ 0 & 1 & 2 \end{matrix}) (\begin{matrix} 1 \\ 3 \\ - 1 \end{matrix}) = (\begin{matrix} 6 \\ 1 \end{matrix})

Theorem 2.15

Let $A$ be an $m \times n$ matrix with entries from F. Then the left-multiplication transformation $L_{A} : F^{n} \to F^{m}$ is linear. Furthermore, if $B$ is any other $m \times n$ matrix (with entries from $F$ ) and $β$ and $γ$ are the standard ordered bases for $F^{n}$ and $F^{m}$ , respectively, then we have the following properties.

$[L_{A}]_{β}^{γ} = A$ .
$L_{A} = L_{B}$ if and only if $A = B$ .
$L_{A + B} = L_{A} + L_{B}$ and $L_{a A} = a L_{A}$ for all $a \in F$ .
If $T : F^{n} \to F^{m}$ is linear, then there exists a unique $m \times n$ matrix $C$ such that $T = L_{c}$ . In fact, $C = [T]_{β}^{γ}$ .
If $E$ is an $n \times p$ matrix, then $L_{A E} = L_{A} L_{E}$ .
If $m = n$ , then $L_{I_{n}} = I_{F^{n}}$ .

Theorem 2.16: Associative in Matrix Multiplication

Let $A, B$ , and $C$ be matrices such that $A (B C)$ is defined. Then $(A B) C$ is also defined and $A (B C) = (A B) C$ ; that is, matrix multiplication is associative.

2.4 Invertibility and Isomorphisms

inverse

Definition: inverse

Let $V$ and $W$ be vector spaces, and let $T : V \to W$ be linear. A function $U : W \to V$ is said to be an inverse of $T$ if $TU = I_{W}$ and $UT = I_{V}$ . If $T$ has an inverse, then $T$ is said to be invertible. The inverse of $T$ is unique and is denoted by $T^{- 1}$ .
The following facts hold for invertible functions $T$ and $U$ :

$(TU)^{- 1} = U^{- 1} T^{- 1}$ .
$(T^{- 1})^{- 1} = T$ ; in particular, $T^{- 1}$ is invertible.

We often use the fact that a function is invertible if and only if it is both one-to-one and onto. We can therefore restate Theorem 2.5 as follows.

Let $T : V \to W$ be a linear transformation, where $V$ and $W$ are finite-dimensional spaces of equal dimension. Then $T$ is invertible if and only if $rank (T) = \dim (V)$ .

Example 2.4.1

Let $T : P_{1} (R) \to R^{2}$ be the linear transformation defined by $T (a + b x) = (a, a + b)$ . The reader can verify directly that $T^{- 1} : R^{2} \to P_{1} (R)$ is defined by $T^{- 1} (c, d) = c + (d - c) x$ . Observe that $T^{- 1}$ is also linear. As Theorem 2.17 demonstrates, this is true in general.

Theorem

Theorem 2.17

Let $V$ and $W$ be vector spaces, and let $T : V \to W$ be linear and invertible. Then $T^{- 1} : W \to V$ is linear.

Definition: invertible

Let $A$ be an $n \times n$ matrix. Then $A$ is invertible if there exists an $n \times n$ matrix $B$ such that $A B = B A = I$ .

If A is invertible, then the matrix B such that $A B = B A = I$ is unique. The matrix $B$ is called the inverse of $A$ and is denoted by $A^{- 1}$ .

Example 2.4.2

The reader should verify that the inverse of

(\begin{matrix} 3 & - 7 \\ 5 & 7 \end{matrix}) is (\begin{matrix} 3 & 3 \\ 2 & - 1 \end{matrix})

dimension

Lemma

Let $T$ be an invertible linear transformation from $V$ to $W$ . Then $V$ is finite-dimensional if and only if $W$ is finite-dimensional. In this case, $\dim (V) = \dim (W)$ .

Theorem 2.18: Judgement if invertible [core]

Let $V$ and $W$ be finite-dimensional vector spaces with ordered bases $β$ and $γ$ , respectively. Let $T : V \to W$ be linear. Then $T$ is invertible if and only if $[T]$ is invertible. Furthermore, $[T^{- 1}]_{γ}^{β} = ([T]_{β}^{γ})^{- 1}$ .

Example 2.4.3

Let $β$ and $γ$ be the standard ordered bases of $P_{1} (R)$ and $R^{2}$ , respectively.

\begin{aligned} T & : P_{1} (R) \to R^{2} & T (a + b x) & = (a, a + b) \\ T^{- 1} & : R^{2} \to P_{1} (R) & T^{- 1} (c, d) & = c + (d - c) x \end{aligned}

For $T$ as in Example 2.4.1, we have

[T]_{β}^{γ} = (\begin{matrix} 1 & 0 \\ 1 & 1 \end{matrix}), [T^{- 1}]_{γ}^{β} = (\begin{matrix} 1 & 0 \\ - 1 & 1 \end{matrix})

It can be verified by matrix multiplication that each matrix is the inverse of the other.

Corollary 1

Let $V$ be a finite-dimensional vector space with an ordered basis $β$ , and let $T : V \to V$ be linear. Then $T$ is invertible if and only if $[T]_{β}$ is invertible. Furthermore, $[T^{- 1}]_{β} = ([T]_{β})^{- 1}$ .

Corollary 2

Let $A$ be an $n \times n$ matrix. Then $A$ is invertible if and only if $L_{A}$ is invertible. Furthermore, $(L_{A})^{- 1} = L_{A^{- 1}}$ .

isomorphism

Definitions: isomorphism

Let $V$ and $W$ be vector spaces. We say that $V$ is isomorphic to $W$ if there exists a linear transformation $T : V \to W$ that is invertible. Such a linear transformation is called an isomorphism from $V$ onto $W$ .

Example 2.4.4

Define $T : F^{2} \to P_{1} (F)$ by $T (a_{1}, a_{2}) = a_{1} + a_{2} x$ . It is easily checked that $T$ is an isomorphism; so $F^{2}$ is isomorphic to $P_{1} (F)$ .

Example 2.4.5

Define

T : P_{3} (R) \to M_{2 \times 2} (R) by T (f) = (\begin{matrix} f (1) & f (2) \\ f (3) & f (4) \end{matrix})

It is easily verified that $T$ is linear. By the Lagrange interpolation formula (Section 1.6), $T (f) = O$ only when $f$ is the zero polynomial. Thus, $T$ is one-to-one.

Moreover, since $\dim (P_{3} (R)) = \dim (M_{2 \times 2} (R))$ , it follows that $T$ is invertible. We conclude that $P_{3} (R)$ is isomorphic to $M_{2 \times 2} (R)$ .

*orphism

a homomorphism preserves the structure, and some types of homomorphisms are:

Epimorphism: a homomorphism that is surjectiv (AKA onto)
Monomorphism: a homomorphism that is injective (AKA one-to-one, 1-1, or univalent)
Isomorphism: a homomorphism that is bijective (AKA 1-1 and onto); isomorphic objects are equivalent, but perhaps defined in different ways
Endomorphism: a homomorphism from an object to itself
Automorphism: a bijective endomorphism (an isomorphism from an object onto itself, essentially just a re-labeling of elements)

Theorem about isomorphism

Theorem 2.19

Let $V$ and $W$ be finite-dimensional vector spaces (over the same field). Then $V$ is isomorphic to $W$ iff $\dim (V) = \dim (W)$ .

Corollary

Let $V$ be vector space over $F$ . Then $V$ is isomorphic to $F^{n}$ iff $\dim (V) = n$ .

Theorem 2.20

Let $V$ and $W$ be finite-dimensional vector spaces of dimensions $n$ and $m$ , respectively, with ordered bases $β$ and $γ$ . Then the function $Φ : L (V, W) \to M_{m \times n} (F)$ , defined by $Φ (T) = [T]_{β}^{γ}$ for $T \in L (V, W)$ , is an isomorphism.

Corollary

$L (V, W)$ is finite-dimensional with dimension $m n$ .

Definition: standard representation

Let $β$ be an ordered basis for an n-dimensional vector space $V$ over field $F$ . The standard representation of $V$ w.r.t. $β$ is the function $ϕ_{β} : V \to F^{n}$ defined by $ϕ_{β} (x) = [x]_{β}$ for each $x \in V$ .

Example 2.4.6

Let $β = {(1, 0), (0, 1)}$ and $γ = {(1, 2), (3, 4)}$ . It's easily observed that both are ordered bases for $R^{2}$ . For $x = (1, - 2)$ , we have

ϕ_{β} (x) = (\begin{matrix} 1 \\ - 2 \end{matrix}), ϕ_{γ} (x) = (\begin{matrix} - 5 \\ 2 \end{matrix}) i.e. - 5 (1, 2) + 2 (3, 4) = (1, - 2)

We observed arlier that $ϕ_{β}$ is a linear transformation. the next theorem tells us much more.

Theorem 2.21

For any finite-dimensional vector space $V$ with ordered basis $β, ϕ_{β}$ is an isomorphism.

Relationship

Let $V$ and $W$ be vector spaces of dimension $n$ and $m$ , respectively, and let $T : V \to W$ be a linear transformation. Define $A = [T]_{β}^{γ}$ , where $β$ and $γ$ are arbitrary ordered bases of $V$ and $W$ , respectively. We are now able to use $ϕ_{β}$ and $ϕ_{γ}$ to study the relationship between the linear transformations $T$ and $L_{A} : F^{n} \to F^{m}$ .

Consider the following two composites of linear transformations that map $V$ into $F^{m}$ :

Map $V$ into $F^{n}$ with $ϕ_{β}$ and follow this transformation with $L_{A}$ ; this yields the composite $L_{A} ϕ_{β}$ .
Map $V$ into $W$ with $T$ and follow it by $ϕ_{γ}$ to obtain the composite $ϕ_{γ} T$ .

These 2 composites are depictedby the dashed arrows in the diagram. By a simple reformulation of Theorem 2.14, we may conclude that

L_{A} ϕ_{β} = ϕ_{γ} T

( $[T (u)]_{γ} = [T] [u]_{β}$ )

That is, the diagram "commutes." Heuristically, this relationship indicates that after $V$ and $W$ are identified with $F^{n}$ and $F^{m}$ via $ϕ_{β}$ and $ϕ_{γ}$ , respectively, we may "identify" $T$ with $L_{A}$ . This diagram allows us to transfer operations on abstract vector spaces to ones on $F^{n}$ and $F^{m}$ .

Example 2.4.7

Recall the linear transformation $T : P_{3} (R) \to P_{2} (R)$ defined by $T (f (x)) = f^{'} (x)$ . Let $β$ and $γ$ be the standard ordered bases for $P_{3} (R)$ and $P_{2} (R)$ , respectively, and let $ϕ_{β} : P_{3} (R) \to R^{4}$ and $ϕ_{γ} : P_{2} (R) \to R^{3}$ be the corresponding standard representations.

If $A = [T]$ , then

A = (\begin{matrix} 0 & 1 & 0 & 0 \\ 0 & 0 & 2 & 0 \\ 0 & 0 & 0 & 3 \end{matrix})

Consider the polynomial $p (x) = 2 + x - 3 x^{2} + 5 x^{3}$ . We show that

L_{A} ϕ_{β} (p (x)) = (\begin{matrix} 0 & 1 & 0 & 0 \\ 0 & 0 & 2 & 0 \\ 0 & 0 & 0 & 3 \end{matrix}) (\begin{matrix} 2 \\ 1 \\ - 3 \\ 5 \end{matrix}) = (\begin{matrix} 1 \\ - 6 \\ 15 \end{matrix})

But since $T (p (x)) = p^{'} (x) = 1 - 6 x + 15 x^{2}$ , we have

ϕ_{γ} T (p (x)) = (\begin{matrix} 1 \\ - 6 \\ 15 \end{matrix})

L_{A} ϕ_{β} (p (x)) = ϕ_{γ} T (p (x))

2.5 The Change of Coordinate Matrix

Theorem 2.22

Let $β$ and $β^{'}$ be two ordered bases for a finite-dimensional vector space $V$ , and let $Q = [I_{V}]_{β}^{β^{'}}$ . Then

$Q$ is invertible.
For any $v \in V, [v]_{β} = Q [v]_{β^{'}}$

The matrix $Q = [I_{V}]_{β}^{β^{'}}$ is called a change of coordinate matrix. Because of part (b) of the theorem, we say that $Q$ changes $β^{'}$ -coordinates into $β$ -coordinates. Observe that if $β = x_{1}, x_{2}, . . ., x_{n}$ and $β^{'} = x_{1}^{'}, x_{2}^{'}, . . ., x_{n}^{'}$ , then

x_{j}^{'} = \sum_{i = 1}^{n} Q_{i j} x_{i}

for $j = 1, 2, \dots, n$ ; that is, the jth column of $Q$ is $[x_{j}^{'}]_{β}$ .

Example 2.5.1

In $R^{2}$ , and let

β = {(1, 1), (1, - 1)}, β^{'} = {(2, 4), (3, 1)}

Since

(2, 4) = 3 (1, 1) - 1 (1, - 1), (3, 1) = 2 (1, 1) + 1 (1, - 1)

the change of coordinate matrix $Q$ changing $β^{'}$ -coordinates into $β$ -coordinates is

Q = (\begin{matrix} 3 & 2 \\ - 1 & 1 \end{matrix})

For instance,

[(2, 4)]_{β} = Q [(2, 4)]_{β^{'}} = Q (\begin{matrix} 1 \\ 0 \end{matrix}) = (\begin{matrix} 3 \\ - 1 \end{matrix})

For the remainder of this section, we consider only linear transformations that map a vector space $V$ into itself. Such a linear transformation is called a linear operator on $V$

Theorem 2.23 [core]

Theorem 2.23

Let $T$ be a linear operator on a finite-dimensional vector space $V$ , and let $β$ and $β^{'}$ be ordered bases for $V$ . Suppose $Q$ is the change of coordinate matrix that changes $β^{'}$ -coordinates into $β$ -coordinates. Then

[T]_{β^{'}} = {Q_{β^{'}}^{β}}^{- 1} [T]_{β} Q_{β}^{β^{'}}

Proof:

Q [T]_{β^{'}} = [I]_{β^{'}}^{β} = [T]_{β^{'}}^{β^{'}} = [IT]_{β^{'}}^{β} = [TI]_{β^{'}}^{β} = [T]_{β}^{β} [I]_{β^{'}}^{β} = [I]_{β} Q .

Example 2.5.2: Calculation about coordination change

Let $T$ be the linear operator on $R^{2}$ defined by

T (\begin{matrix} a \\ b \end{matrix}) = (\begin{matrix} 3 a - b \\ a + 3 b \end{matrix})

Let

β = {(1, 1), (1, - 1)}, β^{'} = {(2, 4), (3, 1)}

be the ordered bases. It can be known that

[T]_{β} = (\begin{matrix} 3 & 1 \\ - 1 & 3 \end{matrix})

(Here gives details about the result above)

Given that $[T]_{β} = Q^{' - 1} [T]_{γ} Q^{'}$ , where $γ$ is standard ordered basis, thus

[T]_{γ} = (\begin{matrix} 3 & - 1 \\ 1 & 3 \end{matrix})

And

Q^{'} = (\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}), Q^{' - 1} = \frac{1}{2} (\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}) .

[\mathsf{T}]_\beta &= \frac12 \begin{pmatrix} 1 & 1\\ 1 & -1 \end{pmatrix} \begin{pmatrix} 3 & -1\\ 1 & 3 \end{pmatrix} \begin{pmatrix} 1 & 1\\ 1 & -1 \end{pmatrix} = \begin{pmatrix} 3 & 1\\ -1 & 3 \end{pmatrix}

The change of coordinate matrix that changes $β^{'}$ -coordinate into $β$ -coordinate is

Q = (\begin{matrix} 3 & 2 \\ - 1 & 1 \end{matrix})

and

Q^{- 1} = \frac{1}{5} (\begin{matrix} 1 & - 2 \\ 1 & 3 \end{matrix})

hence, by theorem 2.23:

[T]_{β^{'}} = Q^{- 1} [T]_{β} Q = (\begin{matrix} 4 & 1 \\ - 2 & 2 \end{matrix})

To show that this's the correct matrix, we can verify that the image under $T$ of each vector of $β^{'}$ is the linear combination of the vectors of $β^{'}$ with the entries of thecorresponding column as its coefficients. For example, the image of the 2nd vector in $β^{'}$ is

T (\begin{matrix} 3 \\ 1 \end{matrix}) = (\begin{matrix} 8 \\ 6 \end{matrix}) = 1 (\begin{matrix} 2 \\ 4 \end{matrix}) + 2 (\begin{matrix} 3 \\ 1 \end{matrix})

Note that the coeffficients of the linear combination are the entries of the 2nd column of $[T]_{β^{'}}$

Example 2.5.3： Application about coordination change [core]

Recall the reflection about the x-axis in Example 3 in Section 2.1. The rule $(x, y) \to (x, - y)$ is easy to obtain. Let $T$ be the reflection about the line $y = 2 x$ . We wish to find an expression for $T (a, b)$ for any $(a, b) \in R^{2}$ . Since $T$ is linear, it is determined by its values on a basis for $R^{2}$ . For basis vectors,

T (1, 2) = (1, 2), T (- 2, 1) = (2, - 1) = - 1 \times (- 2, 1)

Therefore if we let

β^{'} = {(\begin{matrix} 1 \\ 2 \end{matrix}), (\begin{matrix} - 2 \\ 1 \end{matrix})}

Then the matrix representing $T$ in the basis $β^{'} = {(1, 2), (- 2, 1)}$ is

\begin{matrix} [T]_{β^{'}} = (\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}) \\ (1, 0)^{⊤} for T (1, 2) = (1, 2) = 1 \times (1, 2) + 0 \times (2, - 1) \\ (0, - 1)^{⊤} for T (- 2, 1) = (2, - 1) = 0 \times (1, 2) + - 1 \times (2, - 1) \end{matrix}

Let $β$ be the standard ordered basis for $R^{2}$ , and let $Q$ be the matrix that changes $β^{'}$ -coordinates into $β$ -coordinates. Then

Q_{β^{'}}^{β} = (\begin{matrix} 1 & - 2 \\ 2 & 1 \end{matrix})

and $Q^{- 1} [T]_{β} Q = [T]_{β^{'}}$ . We can solve this equation for $[T]_{β}$ to obtain that $[T]_{β} = Q [T]_{β^{'}} Q^{- 1}$ . Because

{Q_{β^{'}}^{β}}^{- 1} = \frac{1}{5} (\begin{matrix} 1 & 2 \\ - 2 & 1 \end{matrix})

Then it can be verified that

[T]_{β} = {Q_{β}^{β^{'}}}^{- 1} [T]_{β} Q_{β}^{β^{'}} = Q_{β^{'}}^{β} [T]_{β^{'}} {Q_{β^{'}}^{β}}^{- 1} = \frac{1}{5} (\begin{matrix} - 3 & 4 \\ 4 & 3 \end{matrix})

Since $β$ is the standard ordered basis, it follows that $T$ is left-multiplication by $[T]_{β}$ . Thus for any $(a, b) \in R^{2}$ , we have

T (\begin{matrix} a \\ b \end{matrix}) = \frac{1}{5} (\begin{matrix} - 3 & 4 \\ 4 & 3 \end{matrix}) (\begin{matrix} a \\ b \end{matrix}) = \frac{1}{5} (\begin{matrix} - 3 a + 4 b \\ 4 a + 3 b \end{matrix})

Corollary of Theorem 2.23

Corollary

Let $A \in M_{n \times n} (F)$ , and let $γ$ be an ordered basis for $F^{n}$ . Then

[L_{A}]_{γ} = Q^{- 1} A Q

where $Q$ is the $n \times n$ matrix whose jth column is the jth vector of $γ$ .

Example 2.5.4

Let

A = (\begin{matrix} 2 & 1 & 0 \\ 1 & 1 & 3 \\ 0 & - 1 & 0 \end{matrix})

and let

γ = {(\begin{matrix} - 1 \\ 0 \\ 0 \end{matrix}), (\begin{matrix} 2 \\ 1 \\ 0 \end{matrix}), (\begin{matrix} 1 \\ 1 \\ 1 \end{matrix})}

which is an ordered basis for $R^{3}$ . Let $Q$ be the $3 \times 3$ matrix whose jth column is the jth vector of $γ$ . Then

Q = (\begin{matrix} - 1 & 2 & 1 \\ 0 & 1 & 1 \\ 0 & 0 & 1 \end{matrix}) and Q^{- 1} = (\begin{matrix} - 1 & 2 & - 1 \\ 0 & 1 & - 1 \\ 0 & 0 & 1 \end{matrix})

So by the preceding corollary,

[L_{A}]_{γ} = Q^{- 1} A Q = (\begin{matrix} 0 & 2 & 8 \\ - 1 & 4 & 6 \\ 0 & - 1 & - 1 \end{matrix})

Definition

Let $A, B$ be matrices in $M_{n \times n} (F)$ . We say $B$ is similar to $A$ if there exists an invertible matrix $Q$ such that

B = Q^{- 1} A Q

Functions and Linear Transformations and Matrices ​

Addendic B: Functions ​

Example B.1 ​

one-to-one, onto ​

2.1 Linear Transformations, Null Spaces, and Ranges ​

Linear Transformation ​

Example 1 ​

Example 2: rotation ​

Example 3&4: Reflection ​

Example 5: Polynominal ​

Example 6: integration ​

Special transformations ​

Example 7: identity and zero transformation ​

Example 8 ​

Example 9 ​

Definition and Theorem ​

Example 10 ​

Example 11 ​

Example 12 ​

Theorem 2.6 ​

Example 13 ​

2.2 The Matrix Representation of a Linear Transformations ​

coordinate vector ​

Example 2.2.1 ​

Example 2.2.2: coordinate vector for polynominal ​

matrix representation ​

Example 2.2.3: matrix representation for tuples ​

Example 2.2.4: matrix representation for polynominal ​

operations on linear transformation ​

Example 2.2.5 ​

2.3 Composition of Linear Transformations and Matrix Multiplication ​

matrix product ​

Example 2.3.1 ​

transpose ​

Example 2.3.2 [core] ​

Definition and theorem ​

Example 2.3.3 ​

left-multiplication transformation ​

Example 2.3.4 ​

2.4 Invertibility and Isomorphisms ​

inverse ​

Example 2.4.1 ​

Theorem ​

Example 2.4.2 ​

dimension ​

Example 2.4.3 ​

isomorphism ​

Example 2.4.4 ​

Example 2.4.5 ​

*orphism ​

Theorem about isomorphism ​

Example 2.4.6 ​

Relationship ​

Example 2.4.7 ​

2.5 The Change of Coordinate Matrix ​

Example 2.5.1 ​

Theorem 2.23 [core] ​

Example 2.5.2: Calculation about coordination change ​

Example 2.5.3： Application about coordination change [core] ​

Corollary of Theorem 2.23 ​

Example 2.5.4 ​

Functions and Linear Transformations and Matrices

Addendic B: Functions

Example B.1

one-to-one, onto

2.1 Linear Transformations, Null Spaces, and Ranges

Linear Transformation

Example 1

Example 2: rotation

Example 3&4: Reflection

Example 5: Polynominal

Example 6: integration

Special transformations

Example 7: identity and zero transformation

Example 8

Example 9

Definition and Theorem

Example 10

Example 11

Example 12

Theorem 2.6

Example 13

2.2 The Matrix Representation of a Linear Transformations

coordinate vector

Example 2.2.1

Example 2.2.2: coordinate vector for polynominal

matrix representation

Example 2.2.3: matrix representation for tuples

Example 2.2.4: matrix representation for polynominal

operations on linear transformation

Example 2.2.5

2.3 Composition of Linear Transformations and Matrix Multiplication

matrix product

Example 2.3.1

transpose

Example 2.3.2 [core]

Definition and theorem

Example 2.3.3

left-multiplication transformation

Example 2.3.4

2.4 Invertibility and Isomorphisms

inverse

Example 2.4.1

Theorem

Example 2.4.2

dimension

Example 2.4.3

isomorphism

Example 2.4.4

Example 2.4.5

*orphism

Theorem about isomorphism

Example 2.4.6

Relationship

Example 2.4.7

2.5 The Change of Coordinate Matrix

Example 2.5.1

Theorem 2.23 [core]

Example 2.5.2: Calculation about coordination change

Example 2.5.3： Application about coordination change [core]

Corollary of Theorem 2.23

Example 2.5.4