5 Eigenvalues and Eigenvectors

5.1 Eigenvalues and Eigenvectors

Definition and Theorem

This chapter is concerned with the so-called diagonalization problem. For a given linear operator $T$ on a finite-dimensional vector space $V$ , we seek answers to the following questions.

Does there exist an ordered basis $β$ for $V$ such that $[T]_{β}$ is a diagonal matrix?
If such a basis exists, how can it be found?

Definitions: diagonalizable

A linear operator $T$ on a finite-dimensional vector space $V$ is called diagonalizable if there is an ordered basis $β$ for $V$ such that $[T]_{β}$ is a diagonal matrix. A square matrix $A$ is called diagonalizable if $L_{A}$ is diagonalizable.

Definitions: eigenvalue & eigenvector

Let $T$ be a linear operator on a vector space $V$ . A nonzero vector $v \in V$ is called an eigenvector of $T$ if there exists a scalar $λ$ such that $T (v) = λ v$ . The scalar $λ$ is called the eigenvalue corresponding to the eigenvector $V$ .

Let $A$ be in $M_{n \times n} (F)$ . A nonzero vector $v \in F^{n}$ is called an eigenvector of $A$ if $v$ is an eigenvector of $L_{A}$ ; that is, if $A v = λ v$ for some scalar $λ$ . The scalar $λ$ is called the eigenvalue of $A$ corresponding to the eigenvector $V$ .

Theorem 5.1

A linear operator $T$ on a finite-dimensional vector space $V$ is diagonalizable if and only if there exists an ordered basis $β$ for $V$ consisting of eigenvectors of $T$ .

Furthermore, if $T$ is diagonalizable, $β = {v_{1}, v_{2}, . . ., v_{n}}$ is an ordered basis of eigenvectors of $T$ , and $D = [T]_{β}$ , then $D$ is a diagonal matrix and $D_{j j}$ is the eigenvalue corresponding to $v_{j}$ for $1 \leq j \leq n$ .

Corollary

A matrix $A \in M_{n \times n} (F)$ is diagonalizable if and only if there exists an ordered basis for $F^{n}$ consisting of eigenvectors of $A$ . Furthermore, if ${v_{1}, v_{2}, . . ., v_{n}}$ is an ordered basis for $F^{n}$ consisting of eigenvectors of $A$ and $Q$ is the $n \times n$ matrix whose $j$ th column is $v_{j}$ for $j = 1, 2, . . ., n$ , then $D = Q^{- 1} A Q$ is a diagonal matrix such that $D_{j j}$ is the eigenvalue of $A$ corresponding to $v_{j}$ . Hence $A$ is diagonalizable if and only if it is similar to a diagonal matrix.

[T]_{β} = [\begin{matrix} λ_{1} & 0 & \dots & 0 \\ 0 & λ_{2} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & λ_{n} \end{matrix}]

Example 5.1.1

Let

A = [\begin{matrix} 1 & 3 \\ 4 & 2 \end{matrix}], v_{1} = [\begin{matrix} 1 \\ - 1 \end{matrix}], v_{2} = [\begin{matrix} 3 \\ 4 \end{matrix}] .

Since

L_{A} (v_{1}) = A v_{1} = [\begin{matrix} 1 & 3 \\ 4 & 2 \end{matrix}] [\begin{matrix} 1 \\ - 1 \end{matrix}] = [\begin{matrix} - 2 \\ 2 \end{matrix}] = - 2 [\begin{matrix} 1 \\ - 1 \end{matrix}] = - 2 v_{1}

$v_{1}$ is an eigenvector of $L_{A}$ , and hence of $A$ . Here $λ_{1} = - 2$ is the eigenvalue corresponding to $v_{1}$ . Furthermore,

L_{A} (v_{2}) = A v_{2} = [\begin{matrix} 1 & 3 \\ 4 & 2 \end{matrix}] [\begin{matrix} 3 \\ 4 \end{matrix}] = [\begin{matrix} 15 \\ 20 \end{matrix}] = 5 [\begin{matrix} 3 \\ 4 \end{matrix}] = 5 v_{2},

and so $v_{2}$ is an eigenvector of $L_{A}$ , and hence of $A$ , with the corresponding eigenvalue $λ_{2} = 5$ . Note that $β = {v_{1}, v_{2}}$ is an ordered basis for $R^{2}$ consisting of eigenvectors of both $A$ and $L_{A}$ , and therefore $A$ and $L_{A}$ are diagonalizable. Moreover, by Theorem 5.1 and its corollary, if

Q = [\begin{matrix} 2 & 3 \\ - 1 & 4 \end{matrix}]

then

Q^{- 1} A Q = [L_{A}]_{β} = [\begin{matrix} - 2 & 0 \\ 0 & 5 \end{matrix}]

Example 5.1.2

Let $T$ be the linear operator on $R^{2}$ that rotates each vector in the plane through an angle of $π / 2$ .

It is clear geometrically that for any nonzero vector $v$ , the vector $T (v)$ does not lie on the line through 0 determined by $v$ (geometrical meaning of eigenvector, but for this $T$ ); hence $T (v)$ is not a multiple of $v$ . Therefore $T$ has no eigenvectors and, consequently, no eigenvalues. Thus there exist operators (and matrices) with no eigenvalues or eigenvectors. Of course, such operators and matrices are not diagonalizable.

While in the field of complex number $C^{2}$ , it exists:

A = [\begin{matrix} \cos θ & - \sin θ \\ \sin θ & \cos θ \end{matrix}]

p_{A} (t) = det (A - t I) = t^{2} - 2 (\cos θ) t + 1

e^{\pm i θ} = \cos θ \pm i \sin θ, λ_{1} = e^{i θ}, λ_{2} = e^{- i θ}

For basis $β = {x_{1}, x_{2}}$ ,

[A]_{β} = [\begin{matrix} e^{i θ} & 0 \\ 0 & e^{- i θ} \end{matrix}]

Let

x = c_{1} x_{1} + c_{2} x_{2}, [x]_{β} = (c_{1}, c_{2})^{⊤}

Then

[A x]_{β} = [A]_{β} [x]_{β} = [c_{1} e^{i θ}, c_{2} e^{- i θ}]^{⊤} .

Example 5.1.3

Let $C^{\infty} (R)$ denote the set of all functions $f : R \to R$ having derivatives of all orders(including the polynominl functions, the sine an cosine functions, the exponential functions, etc.). Clearly, $C^{\infty} (R)$ is a subspace of the vector space $F (R, R)$ of all functions from $R$ to $R$ as defined in Section 1.2.

Let $T : C^{\infty} (R) \to C^{\infty} (R)$ be the function defined by $T (f) = f^{'}$ , the derivative of $f$ . It's easily verified that $T$ is a linear operator on $C^{\infty} (R)$ . We determine the eigenvalues and eigenvectors of $T$ .

Suppose $f$ is an eigenvector of $T$ with eigenvalue $λ$ . Then $f^{'} = λ f$ . This's a 1st-order differential equation whose solutions are of the form $f (t) = c e^{λ t}$ for some constant $c$ . Consequently, every real number $λ$ is an eigenvalue of $T$ , and corresponds to eigenvectors of the form $c e^{λ t}$ for $c \neq 0$ . Note that for $λ = 0$ , the eigevectors are the nonzero constant functions, i.e. $c$ .

(Here, the function $f$ is the eigenvector $v$ in $A v = λ v$ )

Theorem 5.2

Let $A \in M_{n \times n} (F)$ . A scalar $λ$ is an eigenvalue of $A$ if and only if

det (A - λ I_{n}) = 0

Definition: characteristic polynomial (For matrix)

The polynomial $f (t) = det (A - t I_{n})$ is called the characteristic polynomial of $A$ .

Definition: characteristic polynomial (For linear transformation)

Let $T$ be a linear operator on an $n$ -dimensional vector space $V$ with ordered basis $β$ . We define the characteristic polynomial $f (t)$ of $T$ to be the characteristic polynomial of $A = [T]_{β}$ . That is,

f (t) = det (A - t I_{n})

Example 5.1.4

To find the eigenvalues of

A = [\begin{matrix} 1 & 1 \\ 4 & 1 \end{matrix}] \in M_{2 \times 2} (R),

we compute its characteristic polynomial:

\begin{aligned} det (A - t I_{2}) = det [\begin{array}{c} 1 - t & 1 \\ 4 & 1 - t \end{array}] \\ = & (1 - t)^{2} - 4 = t^{2} - 2 t - 3 = (t - 3) (t + 1) . \end{aligned}

Hence, the only eigenvalues of $A$ are 3 and -1.

Example 5.1.5

Let $T$ be the linear operator on $P_{2} (R)$ defined by

T (f (x)) = f (x) + (x + 1) f^{'} (x),

let $β$ be the standard ordered basis for $P_{2} (R)$ , and let $A = [T]_{β}$ . Then

A = [\begin{matrix} 1 & 1 & 0 \\ 0 & 2 & 2 \\ 0 & 0 & 3 \end{matrix}]

(Explanation:

\begin{matrix} f (x) = a + b x + c x^{2} \\ f^{'} (x) = b + 2 c x \\ \begin{aligned} T (f (x)) & = (a + b x + c x^{2}) + (x + 1) (b + 2 c x) \\ = a + b x + c x^{2} + b x + b + 2 c x^{2} + 2 c x \\ = (a + b) + (2 b + 2 c) x + 3 c x^{2} \end{aligned} \end{matrix}

)

The characteristic polynomial of $T$ is

\begin{aligned} det (A - t I_{3}) & = | \begin{array}{c} 1 - t & 1 & 0 \\ 0 & 2 - t & 2 \\ 0 & 0 & 3 - t \end{array} | \\ = (1 - t) (2 - t) (3 - t) = - (t - 1) (t - 2) (t - 3) . \end{aligned}

Hence $λ$ is an eigenvalue of $T$ if and only if $λ = 1, 2, or 3$ .

Theorem 5.3

Let $A \in M_{n \times n} (F)$ .

The characteristic polynomial of $A$ is a polynomial of degree $n$ with leading coefficient $(- 1)^{n}$ .
$A$ has at most $n$ distinct eigenvalues.

Theorem 5.4

Let $T$ be a linear operator on a vector space $V$ , and let $λ$ be an eigenvalue of $T$ . A vector $v \in V$ is an eigenvector of $T$ corresponding to $λ$ if and only if $v \neq 0$ and $v \in N (T - λ I)$ .

Example 5.1.6 [core]

To find all eigenvectors of the matrix

A = [\begin{matrix} 1 & 1 \\ 4 & 1 \end{matrix}],

recall $A$ has two eigenvalues $λ_{1} = 3$ and $λ_{2} = - 1$ . We begin with eigenvectors corresponding to $λ_{1} = 3$ . Let

B_{1} = A - 3 I = [\begin{matrix} 1 - 3 & 1 \\ 4 & 1 - 3 \end{matrix}] = [\begin{matrix} 2 & 1 \\ 4 & - 2 \end{matrix}] .

A vector $x = (x_{1}, x_{2})^{T} \in R^{2}$ is an eigenvector corresponding to $λ_{1} = 3$ iff $x \neq 0$ and $x \in N (L_{B_{1}})$ , i.e.,

B_{1} x = [\begin{matrix} - 2 & 1 \\ 4 & - 2 \end{matrix}] [\begin{matrix} x_{1} \\ x_{2} \end{matrix}] = [\begin{matrix} - 2 x_{1} + x_{2} \\ 4 x_{1} - 2 x_{2} \end{matrix}] = 0 .

Clearly the set of all solutions to this equation is

{t [\begin{matrix} 1 \\ 2 \end{matrix}] : t \in R} .

Now, suppose $x$ is an eigenvector corresponding to $λ_{2} = - 1$ . Let

B_{2} = A - (- 1) I = A + I = [\begin{matrix} 1 + 1 & 1 \\ 4 & 1 + 1 \end{matrix}] = [\begin{matrix} 2 & 1 \\ 4 & 2 \end{matrix}] .

$x \in N (L_{B_{2}})$ iff

B_{2} x = 0 ⟹ {\begin{cases} 2 x_{1} + x_{2} = 0 \\ 4 x_{1} + 2 x_{2} = 0 \end{cases}

Hence

N (L_{B_{2}}) = {t [\begin{matrix} 1 \\ - 2 \end{matrix}] : t \in R} .

Thus $x$ is an eigenvector corresponding to $λ_{2} = - 1$ iff

x = t (1, - 2)^{⊤} for some t \neq 0.

Observe that

{(1, 2)^{⊤}, (1, - 2)^{⊤}}

is a basis for $R^{2}$ consisting of eigenvectors of $A$ . Thus $L_{A}$ , and hence $A$ , is diagonalizable.

Suppose that $β$ is a basis for $F^{n}$ consisting of eigenvectors of $A$ . The corollary to Theorem 2.23 assures that if $Q$ is the $n \times n$ matrix whose columns are the vectors in $β$ , then $Q^{- 1} A Q$ is a diagonal matrix. For example, in Example 5.1.6, if

Q = [\begin{matrix} 1 & 1 \\ 2 & - 2 \end{matrix}], D = Q^{- 1} A Q = [\begin{matrix} 3 & 0 \\ 0 & - 1 \end{matrix}] .

The diagonal entries of this matrix are the eigenvalues of $A$ that correspond to the respective columns of $Q$ .

To find the eigenvectors of a linear operator $T$ on an $n$ -dimensional vector space, select an ordered basis $β$ for $V$ and let $A = [T]_{β}$ . Then for $v \in V$ , $ϕ_{β} (v) = [v]_{β}$ , the coordinate vector of $v$ relative to $β$ . We show that $v \in V$ is an eigenvector of $T$ corresponding to $λ$ if and only if $ϕ_{β} (v)$ is an eigenvector of $A$ corresponding to $λ$ . Suppose $v$ is an eigenvector with eigenvalue $λ$ , then $T (v) = λ v$ . Hence

A ϕ_{β} (v) = L_{A} ϕ_{β} T (v) = ϕ_{β} (T (v)) = ϕ_{β} (λ v) = λ ϕ_{β} (v),

Now $ϕ_{β} (v) \neq 0$ , since $ϕ_{β}$ is an isomorphism; hence $ϕ_{β} (v)$ is an eigenvector of $A$ . This argument is reversible, and so we can establish that if $ϕ_{β} (v)$ is an eigenvector of $A$ corresponding to $λ$ , then $v$ is an eigenvector of T corresponding to $λ$ .

An equivalent formulation of the result discussed in the preceding paragraph is that for an eigenvalue $λ$ of $A$ (and hence of $T$ ), a vector $y \in F^{n}$ is an eigenvector of $A$ corresponding to $λ$ if and only if $ϕ_{β}^{- 1} (y)$ is an eigenvector of T corresponding to $λ$ .

We can choose to solve the problem using whichever path is easier.

Example 5.1.7

Let $T$ be the linear operator on $P_{2} (R)$ defined in Example 5.1.5, i.e., $T (f (x)) = f (x) + (x + 1) f^{'} (x)$ , and let $β$ be the standard ordered basis for $P_{2} (R)$ . Recall that $T$ has eigenvalues $1, 2, 3$ and that

A = [T]_{β} = (\begin{array}{ccc} 1 & 1 & 0 \\ 0 & 2 & 2 \\ 0 & 0 & 3 \end{array})

We consider each eigenvalue separately. Let $λ_{1} = 1$ , and define

B_{1} = A - λ_{1} I = (\begin{array}{lll} 0 & 1 & 0 \\ 0 & 1 & 2 \\ 0 & 0 & 2 \end{array})

Then

x = (\begin{array}{l} x_{1} \\ x_{2} \\ x_{3} \end{array}) \in R^{3}

is an eigenvector corresponding to $λ_{1} = 1$ if and only if $x \neq 0$ and $x \in N (L_{B_{1}})$ ; that is, $x$ is a nonzero solution to the system

\begin{aligned} x_{2} & = 0 \\ x_{2} + & 2 x_{3} & = 0 \\ 2 x_{3} & = 0 \end{aligned}

Notice that this system has three unknowns, $x_{1}, x_{2}$ , and $x_{3}$ , but one of these, $x_{1}$ , does not actually appear in the system. Since the values of $x_{1}$ do not affect the system, we assign $x_{1}$ a parametric value, say $x_{1} = a$ , and solve the system for $x_{2}$ and $x_{3}$ . Clearly, $x_{2} = x_{3} = 0$ , and so the eigenvectors of $A$ corresponding to $λ_{1} = 1$ are of the form

a (\begin{array}{l} 1 \\ 0 \\ 0 \end{array}) = a e_{1}

for $a \neq 0$ . Consequently, the eigenvectors of T corresponding to $λ_{1} = 1$ are of the form

ϕ_{β}^{- 1} (a e_{1}) = a ϕ_{β}^{- 1} (e_{1}) = a \cdot 1 = a

for any $a \neq 0$ . Hence the nonzero constant polynomials are the eigenvectors of $T$ corresponding to $λ_{1} = 1$ .

Next let $λ_{2} = 2$ , and define

B_{2} = A - λ_{2} I = (\begin{array}{rrr} - 1 & 1 & 0 \\ 0 & 0 & 2 \\ 0 & 0 & 1 \end{array})

It is easily verified that

N (L_{B_{2}}) = {a (\begin{array}{l} 1 \\ 1 \\ 0 \end{array}) : a \in R},

and hence the eigenvectors of $T$ corresponding to $λ_{2} = 2$ are of the form

ϕ_{β}^{- 1} (a (\begin{array}{l} 1 \\ 1 \\ 0 \end{array})) = a ϕ_{β}^{- 1} (e_{1} + e_{2}) = a (1 + x)

for $a \neq 0$ . Finally, consider $λ_{3} = 3$ and

B_{3} = A - λ_{3} I = (\begin{array}{rrr} - 2 & 1 & 0 \\ 0 & - 1 & 2 \\ 0 & 0 & 0 \end{array})

Since

N (L_{B_{3}}) = {a (\begin{array}{l} 1 \\ 2 \\ 1 \end{array}) : a \in R}

the eigenvectors of $T$ corresponding to $λ_{3} = 3$ are of the form

ϕ_{β}^{- 1} (a (\begin{array}{l} 1 \\ 2 \\ 1 \end{array})) = a ϕ_{β}^{- 1} (e_{1} + 2 e_{2} + e_{3}) = a (1 + 2 x + x^{2})

for $a \neq 0$ .

For each eigenvalue, select the corresponding eigenvector with $a = 1$ in the preceding descriptions to obtain $γ = {1, 1 + x, 1 + 2 x + x^{2}}$ , which is an ordered basis for $P_{2} (R)$ consisting of eigenvectors of $T$ . Thus $T$ is diagonalizable, and

[T]_{γ} = (\begin{array}{ccc} 1 & 0 & 0 \\ 0 & 2 & 0 \\ 0 & 0 & 3 \end{array})

5.2 Diagonalizability

Theorem 5.5 and Corollary

What is still needed is a simple test to determine whether an operator or a matrix can be diagonalized, as well as a method for actually finding a basis of eigenvectors.

Theorem 5.5

Let $T$ be a linear operator on a vector space, and let $λ_{1}, λ_{2}, \dots, λ_{k}$ be distinct eigenvalues of $T$ . For each $i = 1, 2, \dots, k,$ let $S_{i} (i = 1, 2, \dots, k)$ be a finite set of eigenvectors of $T$ corresponding to $λ_{i}$ . If each $S_{i}$ is linearly independent, then $S_{1} \cup S_{2} \cup \dots \cup S_{k}$ is also linearly independent.

Corollary

Let $T$ be a linear operator on an $n$ -dimensional vector space $V$ . If $T$ has $n$ distinct eigenvalues, then $T$ is diagonalizable.

Example 5.2.1

Let

A = [\begin{matrix} 1 & 1 \\ 1 & 1 \end{matrix}] \in M_{2 \times 2} (R) .

The characteristic polynomial of $A$ ( and hence of $L_{A}$ ) is $t (t - 2)$ , giving two distinct eigenvalues $λ_{1} = 0$ and $λ_{2} = 2$ . By the corollary to Theorem 5.5, $A$ is diagonalizable for $L_{A}$ is a linear operator on the 2-dimensional vector space $R^{2}$ .

split

Definition: split over

A polynomial $f (t)$ in $P (F)$ splits over $F$ if there are scalars $c, a_{1}, . . ., a_{n}$ (not necessarily distinct) in $F$ such that

f (t) = c (t - a_{1}) (t - a_{2}) \dots (t - a_{n}) .

For example, $t^{2} - 1 = (t + 1) (t - 1)$ splits over $R$ , but $(t^{2} + 1) (t - 2)$ does not split over $R$ because $t^{2} + 1$ cannot be factored into a product of linear factors. However, $(t^{2} + 1) (t - 2)$ does split over $C$ because it factors into the product $(t + i) (t - i) (t - 2)$ .

If $f (t)$ is the characteristic polynomial of a linear operator or a matrix over a field $F$ , then the statement that $f (t)$ splits is understood to mean that it splits over $F$ .

Theorem 5.6

The characteristic polynomial of any diagonalizable linear operator splits.

From this theorem, it is clear that if $T$ is a diagonalizable linear operator on an $n$ -dimensional vector space that fails to have distinct eigenvalues, then the characteristic polynomial of $T$ must have repeated zeros.

The converse of Theorem 5.6 is false; that is, the characteristic polynomial of $T$ may split, but $T$ need not be diagonalizable. (See Example 5.2.3, which follows.) The following concept helps us determine when an operator whose characteristic polynomial splits is diagonalizable.

Definition: multiplicity

Let $λ$ be an eigenvalue of a linear operator or matrix with characteristic polynomial $f (t)$ . The multiplicity (sometimes called the algebraic multiplicity) of $λ$ is the largest positive integer $k$ for which $(t - λ)^{k}$ is a factor of $f (t)$ .

Example 5.2.2

Let

A = [\begin{matrix} 3 & 1 & 0 \\ 0 & 3 & 4 \\ 0 & 0 & 4 \end{matrix}],

which has characteristic polynomial $f (t) = - (t - 3)^{2} (t - 4)$ . Hence $λ = 3$ is an eigenvalue of $A$ with multiplicity $2$ , and $λ = 4$ is an eigenvalue of $A$ with multiplicity $1$ .

Remark

Let $A$ be a square matrix. A (complex) number $λ$ is an eigenvalue of $A$ if and only if $λ$ is a root of the characteristic equation of $A$ .

The multiplicity of $λ$ being a root of the characteristic equation is called the algebraic multiplicity of $λ$ , denoted $m_{a}$ .

The dimension of the eigenspace associated to eigenvalue $λ$ , that is, the maximum number of linearly independent eigenvectors associated with eigenvalue $λ$ , is called the geometric multiplicity of $λ$ , denoted $m_{g}$ . It can be shown that for each eigenvalue $λ$ , we have

1 \leq m_{g} \leq m_{a},

where $m_{g}$ and $m_{a}$ are the geometric and algebraic multiplicities of $λ$ respectively.

Therefore we have $A$ is diagonalizable if and only if the algebraic multiplicity and the geometric multiplicity are equal for all eigenvalues of $A$ .

The eigenvectors of $T$ corresponding to the eigenvalue $λ$ are the nonzero vectors in the null space of $T - λ I$ , we are led naturally to the study of this set.

eigenspace

Definition: eigenspace

Let $T$ be a linear operator on a vector space $V$ , and let $λ$ be an eigenvalue of $T$ . Define $E_{λ} = {x \in V : T (x) = λ x} = N (T - λ I_{V})$ . The set $E_{λ}$ is called the eigenspace of $T$ corresponding to the eigenvalue $λ$ . Analogously, we define the eigenspace of a square matrix $A$ corresponding to the eigenvalue $λ$ to be the eigenspace of $L_{A}$ corresponding to $λ$ .

Theorem 5.7

Let $T$ be a linear operator on a finite-dimensional vector space $V$ , and let $λ$ be an eigenvalue of $T$ having multiplicity $m$ . Then

1 \leq \dim (E_{λ}) \leq m,

where $E_{λ}$ is the eigenspace corresponding to $λ$ .

(1 \leq m_{g} \leq m_{a})

Example 5.2.3

Let $T$ be the linear operator on $P_{2} (R)$ defined by $T (f (x)) = f^{'} (x)$ . The matrix representation of $T$ with respect to the standard ordered basis $β$ for $P_{2} (R)$ is

[T]_{β} = [\begin{matrix} 0 & 1 & 0 \\ 0 & 0 & 2 \\ 0 & 0 & 0 \end{matrix}] .

Consequently, the characteristic polynomial of $T$ is

det ([T]_{β} - t I) = det [\begin{matrix} - t & 1 & 0 \\ 0 & - t & 2 \\ 0 & 0 & - t \end{matrix}] = - t^{3} .

Thus $T$ has only one eigenvalue ( $λ = 0$ ) with multiplicity $3$ . Solving $T (f (x)) = f^{'} (x) = 0$ shows that $E_{λ} = N (T - λ I) = N (T)$ is the subspace of $P_{2} (R)$ consisting of the constant polynomials. So ${1}$ is a basis for $E_{λ}$ , and therefore $\dim (E_{λ}) = 1$ . Consequently, there is no basis for $P_{2} (R)$ consisting of eigenvectors of $T$ , and therefore $T$ is not diagonalizable.

A matrix is diagonalizable if and only if the algebraic multiplicity and geometric multiplicity are equal for all eigenvalues.

Example 5.2.4

Let $T$ be the linear operator on $R^{3}$ defined by

T [\begin{matrix} a_{1} \\ a_{2} \\ a_{3} \end{matrix}] = [\begin{matrix} 4 a_{1} + a_{3} \\ 2 a_{1} + 3 a_{2} + 2 a_{3} \\ a_{1} + 4 a_{3} \end{matrix}] .

We determine the eigenspace of $T$ corresponding to each eigenvalue. Let $β$ be the standard ordered basis for $R^{3}$ . Then

[T]_{β} = [\begin{matrix} 4 & 0 & 1 \\ 2 & 3 & 2 \\ 1 & 0 & 4 \end{matrix}],

and hence the characteristic polynomial of $T$ is

det ([T]_{β} - t I) = det [\begin{matrix} 4 - t & 0 & 1 \\ 2 & 3 - t & 2 \\ 1 & 4 & a_{3} - t \end{matrix}] = - (t - 5) (t - 3)^{2} .

So the eigenvalues of $T$ are $λ_{1} = 5$ and $λ_{2} = 3$ with multiplicities $1$ and $2$ , respectively.

Since

E_{λ_{1}} = N (T - λ_{1} I) = {x \in R^{3} : (T - 5 I) x = 0},

$E_{λ_{1}}$ is the solution space of the system of linear equations

{\begin{cases} - x_{1} + x_{3} = 0 \\ 2 x_{1} + 2 x_{2} + 2 x_{3} = 0 \\ x_{1} + x_{3} = 0, \end{cases}

which means $E_{λ_{1}}$ is spanned by the basis ${(1, 2, - 1)}$ . Hence, $\dim (E_{λ_{1}}) = 1$ .

Similarly, $E_{λ_{2}} = N (T - 3 I)$ is the solution space of the system

{\begin{cases} x_{1} + x_{3} = 0 \\ 2 x_{1} + 2 x_{3} = 0 \\ x_{1} + x_{3} = 0. \end{cases}

Since the unknown $x_{2}$ does not appear in the first two equations, we assign it an arbitrary parameter $s$ , and solve the system for $x_{1}$ and $x_{3}$ , introducing another parameter $t$ . The general solution to the system is

x = s [\begin{matrix} 0 \\ 1 \\ 0 \end{matrix}] + t [\begin{matrix} - 1 \\ 0 \\ 1 \end{matrix}], for some x, t \in R .

So a basis for $E_{λ_{2}}$ is ${(0, 1, 0), (- 1, 0, 1)}$ and $\dim (E_{λ_{2}}) = 2$ .

In this case, the multiplicity of each eigenvalue $λ_{i}$ is equal to the dimension of the corresponding eigenspace $E_{λ_{i}}$ . The union of the two bases just derived, namely,

{(1, 2, - 1), (0, 1, 0), (- 1, 0, 1)},

is linearly independent and hence is a basis for $R^{3}$ consisting of eigenvectors of $T$ . Consequently, $T$ is diagonalizable.

Lemma

Let $T$ be a linear operator, and let $λ_{1}, λ_{2}, \dots, λ_{k}$ be distinct eigenvalues of $T$ . For each $i = 1, 2, \dots, k$ , let $v_{i} \in E_{λ_{i}}$ , the eigenspace corresponding to $λ_{i}$ . If

v_{1} + v_{2} + \dots + v_{k} = 0,

then $v_{i} = 0$ for all $i$ .

Theorem 5.5 says that eigenvectors which corresponds to different eigenvalues are linearly independent, so the only possibility is that they are all zeros.

Theorem 5.8

Let $T$ be a linear operator on a vector space $V$ , and let $λ_{1}, λ_{2}, \dots, λ_{k}$ be distinct eigenvalues of $T$ . For each $i = 1, \dots, k$ , let $S_{i}$ be a finite linearly independent subset of the eigenspace $E_{λ_{i}}$ . Then $S = S_{1} \cup S_{2} \cup \dots \cup S_{k}$ is a linearly independent subset of $V$ .

Theorem 5.9

Let $T$ be a linear operator on an $n$ -dimensional vector space $V$ such that the characteristic polynomial of $T$ splits. Let $λ_{1}, λ_{2}, \dots, λ_{k}$ be the distinct eigenvalues of $T$ . Then

$T$ is diagonalizable if and only if the algebraic multiplicity of $λ_{i}$ equals $\dim (E_{λ_{i}})$ for all $i$ (i.e. algebraic multiplicity = geometric multiplicity).
If $T$ is diagonalizable and $β_{i}$ is an ordered basis for $E_{λ_{i}}$ for each $i$ , then

β = β_{1} \cup β_{2} \cup \dots \cup β_{k}

is an ordered basis for $V$ consisting of eigenvectors of $T$ .

Test for Diagonalizability

Let $T$ be a linear operator on an $n$ -dimensional vector space $V$ . Then $T$ is diagonalizable if and only if

The characteristic polynomial of $T$ splits.
For each eigenvalue $λ$ of $T$ , the multiplicity of $λ$ equals the corresponding geometrical multiplicity:

multiplicity (λ) = nullity (T - λ I) = n - rank (T - λ I) .

Example 5.2.5

We test the matrix

A = [\begin{matrix} 3 & 1 & 0 \\ 0 & 3 & 0 \\ 0 & 0 & 4 \end{matrix}] \in M_{3 \times 3} (R)

for diagonalizability.

The characteristic polynomial of $A$ is

det (A - t I) = - (t - 4) (t - 3)^{2},

which splits and so condition 1 of the test for diagonalization is satisfied. The eigenvalues are $λ_{1} = 4$ and $λ_{2} = 3$ with multiplicities $1$ and $2$ , respectively.

Since $λ_{1}$ has multiplicity 1, condition 2 is satisfied for $λ_{1}$ . For $λ_{2}$ :

A - λ_{2} I = [\begin{matrix} 0 & 1 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 1 \end{matrix}]

the matrix $A - 3 I$ has rank 2, so the nullity is $3 - 2 = 1$ , which is not the multiplicity of $λ_{2}$ . Thus, condition 2 fails for $λ_{2}$ and $A$ is not diagonalizable.

Example 5.2.6 [core]

Let $T$ be the linear operator on $P_{2} (R)$ defined by

T (f (x)) = f (1) + f^{'} (0) x + (f^{'} (0) + f^{″} (0)) x^{2} .

With $α = {1, x, x^{2}}$ as the standard ordered basis for $P_{2} (R)$ , let $B = [T]_{α}$ . Then

B = [\begin{matrix} 1 & 1 & 1 \\ 0 & 1 & 0 \\ 0 & 1 & 2 \end{matrix}] .

The characteristic polynomial of $B$ , and hence of $T$ , is

- (t - 1)^{2} (t - 2),

which splits. Hence, condition 1 of the test for diagonalizability is satisfied.

Also, $B$ has eigenvalues $λ_{1} = 1$ and $λ_{2} = 2$ with multiplicities $2$ and $1$ , respectively.

Condition 2 is satisfied for $λ_{2}$ because it has multiplicity $1$ .

Checking condition 2 for $λ_{1} = 1$ :

rank (B - λ_{1} I) = rank (\begin{matrix} 0 & 1 & 1 \\ 0 & 0 & 0 \\ 0 & 1 & 1 \end{matrix}) = 1,

the rank of $B - I$ is 1, so nullity is $3 - 1 = 2$ , which equals the multiplicity of $λ_{1}$ . Therefore $T$ is diagonalizable.

We now find an ordered basis $γ$ for $R^{3}$ of eigenvectors of $B$ . For each eigenvalue:

The eigenspace corresponding to $λ_{1} = 1$ is

E_{λ_{1}} = {x \in R^{3} : (B - I) x = 0}

which is the solution space for the system

x_{2} + x_{3} = 0,

and has

γ_{1} = {[\begin{matrix} 1 \\ 0 \\ 0 \end{matrix}], [\begin{matrix} 0 \\ - 1 \\ 1 \end{matrix}]} .

as a basis.

The eigenspace corresponding to $λ_{2} = 2$ is

E_{λ_{2}} = {x \in R^{3} : (B - 2 I) x = 0}

which is the solution space for the system

\begin{aligned} - x_{1} + & x_{2} + x_{3} & = 0 \\ x_{2} & = 0, \end{aligned}

and has

γ_{1} = {[\begin{matrix} 1 \\ 0 \\ 1 \end{matrix}]} .

as a basis.

Let

γ = γ_{1} \cup γ_{2} = {[\begin{matrix} 1 \\ 0 \\ 0 \end{matrix}], [\begin{matrix} 0 \\ - 1 \\ 1 \end{matrix}], [\begin{matrix} 1 \\ 0 \\ 1 \end{matrix}]}

Then $γ$ is an ordered basis for $R^{3}$ consisting of eigenvectors of $B$ .

Finally, observe that the vector in $γ$ are the coordinatevectors relative to $α$ of the vectors in the set

β = {1, - x + x^{2}, 1 + x^{2}},

which is an ordered basis for $P_{2} (R)$ consisting of eigenvectors of $T$ . Thus

[T]_{β} = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 2 \end{matrix}] .

Example 5.2.7

Let

A = [\begin{matrix} 0 & - 2 \\ 1 & 3 \end{matrix}] .

We show $A$ is diagonalizable and find a $2 \times 2$ matrix $Q$ such that $Q^{- 1} A Q$ is diagonal. We then show how to use this result to compute $A^{n}$ for any positive integar $n$ .

First observe the characteristic polynomial of $A$ is $(t - 1) (t - 2)$ , so $A$ has two distinct eigenvalues, $λ_{1} = 1$ and $λ_{2} = 2$ .

By applying the corollary to Theorem 5.5 to the operator $L_{A}$ , $A$ is diagonalizable. Moreover,

γ_{1} = {(- 2, 1)^{⊤}}, γ_{2} = {(- 1, 1)^{⊤}} .

are bases for the eigenspaces $E_{λ_{1}}$ and $E_{λ_{2}}$ , respectively. Therefore,

γ = γ_{1} \cup γ_{2} = {(- 2, 1)^{⊤}, (- 1, 1)^{⊤}}

is an ordered basis for $R^{2}$ consisting of eigenvectors of $R^{2}$ . Let

Q = [\begin{matrix} - 2 & - 1 \\ 1 & 1 \end{matrix}]

be the matrix whose columns are the vectors in $γ$ . Then, by the corollary to Theorem 2.23,

D = Q^{- 1} A Q = [\begin{matrix} 1 & 0 \\ 0 & 2 \end{matrix}] .

To compute $A^{n}$ for any positive integer $n$ ,

\begin{aligned} A^{n} & = (Q D Q^{- 1})^{n} = (Q D Q^{- 1}) (Q D Q^{- 1}) \dots (Q D Q^{- 1}) \\ = Q D^{n} Q^{- 1} \\ = Q [\begin{array}{c} 1^{n} & 0 \\ 0 & 2^{n} \end{array}] Q^{- 1} \\ = [\begin{array}{c} - 2 & - 1 \\ 1 & 1 \end{array}] [\begin{array}{c} 1 & 0 \\ 0 & 2^{n} \end{array}] [\begin{array}{c} - 1 & - 1 \\ 1 & 2 \end{array}] = [\begin{array}{c} 2 - 2^{n} & 2 - 2^{n + 1} \\ - 1 + 2^{n} & - 1 + 2^{n + 1} \end{array}] . \end{aligned}

Systems of Differential Equations

Exercise 5.2.8

Consider the system of differential equations

{\begin{cases} x_{1}^{'} = 3 x_{1} + x_{2} + x_{3} \\ x_{2}^{'} = 2 x_{1} + 4 x_{2} + 2 x_{3} \\ x_{3}^{'} = - x_{1} - x_{2} + x_{3}, \end{cases}

where each $x_{i} = x_{i} (t)$ is a differentiable real-valued function of the real value $t$ . Clearly, this system has a solution, namel, the solution in which each $x_{i} (t)$ is the zero function. We determine all of the solutions to this system.

Let $x : R \to R^{3}$ be

x (t) = [\begin{matrix} x_{1} (t) \\ x_{2} (t) \\ x_{3} (t) \end{matrix}],

and $x^{'} (t)$ is the derivative of $x$ .

Let

A = [\begin{matrix} 3 & 1 & 1 \\ 2 & 4 & 2 \\ - 1 & - 1 & 1 \end{matrix}] .

be the coefficient omatrix of the given system, so that we can rewrite the system as matrix equation $x^{'} = A x$ .

It can be verified that for

Q = [\begin{matrix} - 1 & 0 & - 1 \\ 0 & - 1 & - 2 \\ 1 & 1 & 1 \end{matrix}] and D = [\begin{matrix} 2 & 0 & 0 \\ 0 & 2 & 0 \\ 0 & 0 & 4 \end{matrix}],

we have $Q^{- 1} A Q = D$ . Substitute $A = Q D Q^{- 1}$ into $x^{'} = A x$ to obtain

x^{'} = Q D Q^{- 1} x,

or equivalently,

Q^{- 1} x^{'} = D Q^{- 1} x .

Define $y (t) = Q^{- 1} x (t)$ yielding $y^{'} = Q^{- 1} x^{'}$ . Hence,

y^{'} = D y,

which is a system of decoupled equations

{\begin{cases} y_{1}^{'} = 2 y_{1} \\ y_{2}^{'} = 2 y_{2} \\ y_{3}^{'} = 4 y_{3} . \end{cases}

Solutions for each $y_{i}$ are

y_{1} (t) = c_{1} e^{2 t}, y_{2} (t) = c_{2} e^{2 t}, y_{3} (t) = c_{3} e^{4 t},

with arbitrary constants $c_{1}, c_{2}, c_{3}$ .

Back-substituting,

\begin{aligned} x (t) = Q y (t) = [\begin{array}{c} - 1 & 0 & - 1 \\ 0 & - 1 & - 2 \\ 1 & 1 & 1 \end{array}] [\begin{array}{c} c_{1} e^{2 t} \\ c_{2} e^{2 t} \\ c_{3} e^{4 t} \end{array}] \\ = & [\begin{array}{c} c_{1} e^{2 t} + c_{3} e^{4 t} \\ - c_{2} e^{2 t} - c_{3} e^{4 t} \\ c_{1} e^{2 t} + c_{2} e^{2 t} + c_{3} e^{4 t} \end{array}] . \end{aligned}

Thus the general solution is

\begin{aligned} x (t) & = e^{2 t} z_{1} + e^{4 t} z_{2} \\ = e^{2 t} [c_{1} [\begin{array}{c} - 1 \\ 0 \\ 1 \end{array}] + c_{2} [\begin{array}{c} 0 \\ - 1 \\ 1 \end{array}]] = e^{4 t} [c_{3} [\begin{array}{c} - 1 \\ - 2 \\ 1 \end{array}]] . \end{aligned}

where $z_{1} \in E_{λ_{1}}$ and $z_{2} \in E_{λ_{2}}$ , with $λ_{1} = 2$ and $λ_{2} = 4$ .

This concludes the main part of the example.

Here is more generalized conclusion.

Application for ODE

Let $n \times n$ matrix $A = (a_{i j})$ be the coefficient matrix of a system of differential equations

x_{i}^{'} = \sum_{j = 1}^{n} a_{i j} x_{j}, i = 1, \dots, n .

Suppose $A$ is diagonalizable and the distinct eigenvalues of $A$ are $λ_{1}, λ_{2}, \dots, λ_{k}$ .

Prove that a differentiable function $x : R \to R^{n}$ is a solution to the system if and only if

x (t) = e^{λ_{1} t} z_{1} + e^{λ_{2} t} z_{2} + \dots + e^{λ_{k} t} z_{k},

where $z_{i} \in E_{λ_{i}}$ for $i = 1, \dots, k$ . Use this result to prove that the set of solutions form an $n$ -dimensional real vector space.

Following the step of the previous exercise, we may pick a matrix $Q$ whose column vectors consist of eigenvectors and $Q$ is invertible. Let $D$ be the diagonal matrix $Q^{- 1} A Q$ . And we also know that finally we'll have the solution $x = Q u$ for some vector $u$ whose $i$ -th entry is $c_{i} e^{λ}$ if the $i$ -th column of $Q$ is an eigenvector corresponding to $λ$ . By denoting $\bar{D}$ to be the diagonal matrix with ${\bar{D}}_{i i} = e^{D_{i i}}$ , we may write $x = Q \bar{D} y$ . where the $i$ -th entry of $y$ is $c_{i}$ . So the solution must be of the form described in the exercise.

For the second statement, we should know first that the set

{e^{λ_{1} t}, e^{λ_{2} t}, \dots e^{λ_{k} t}}

are linearly independent in the space of real functions. Since $Q$ invertible, we know that the solution set

{Q \bar{D} y : y \in R^{n}}

is an $n$ -dimensional real vector space.

5 Eigenvalues and Eigenvectors ​

5.1 Eigenvalues and Eigenvectors ​

Definition and Theorem ​

Example 5.1.1 ​

Example 5.1.2 ​

Example 5.1.3 ​

Example 5.1.4 ​

Example 5.1.5 ​

Example 5.1.6 [core] ​

Example 5.1.7 ​

5.2 Diagonalizability ​

Theorem 5.5 and Corollary ​

Example 5.2.1 ​

split ​

Example 5.2.2 ​

Remark ​

eigenspace ​

Example 5.2.3 ​

Example 5.2.4 ​

Example 5.2.5 ​

Example 5.2.6 [core] ​

Example 5.2.7 ​

Systems of Differential Equations ​

Exercise 5.2.8 ​

Application for ODE ​

5 Eigenvalues and Eigenvectors

5.1 Eigenvalues and Eigenvectors

Definition and Theorem

Example 5.1.1

Example 5.1.2

Example 5.1.3

Example 5.1.4

Example 5.1.5

Example 5.1.6 [core]

Example 5.1.7

5.2 Diagonalizability

Theorem 5.5 and Corollary

Example 5.2.1

split

Example 5.2.2

Remark

eigenspace

Example 5.2.3

Example 5.2.4

Example 5.2.5

Example 5.2.6 [core]

Example 5.2.7

Systems of Differential Equations

Exercise 5.2.8

Application for ODE