3 Elementary Matrix Operations and Systems of Linear Equations

3.1 Elementary Matrix Operations and Elementary Matrices

Elementary Matrix Operations

Definitions: Elementary Matrix Operations

Let $A$ be an $m \times n$ matrix. Any one of the following three operations on the rows [columns] of $A$ is called an elementary row [column] operation:

Interchanging any two rows [columns] of$A $;
Multiplying any row [column] of $A$ by a nonzero scalar;
Adding any scalar multiple of a row [column] of $A$ to another row [column].

Any of these three operations is called an elementary operation. Elementary operations are of type 1, type 2, or type 3 depending on whether they are obtained by (1), (2), or (3).

Example 3.1.1

Let

A = [\begin{matrix} 1 & 2 & 3 & 4 \\ 2 & 1 & - 1 & 3 \\ 4 & 0 & 1 & 2 \end{matrix}]

Interchanging the second row of $A$ with the first row is an example of an elementary row operation of type 1. The resulting matrix is

B = [\begin{matrix} 2 & 1 & - 1 & 3 \\ 1 & 2 & 3 & 4 \\ 4 & 0 & 1 & 2 \end{matrix}]

Multiplying the second column of $A$ by $3$ is an example of an elementary column operation of type 2. The resulting matrix is

C = [\begin{matrix} 1 & 6 & 3 & 4 \\ 2 & 3 & - 1 & 3 \\ 4 & 0 & 1 & 2 \end{matrix}]

Adding $4$ times the third row of $A$ to the first row is an example of an elementary row operation of type 3. In this case, the resulting matrix is

M = [\begin{matrix} 17 & 2 & 7 & 12 \\ 2 & 1 & - 1 & 3 \\ 4 & 0 & 1 & 2 \end{matrix}]

elementary matrix

Definition: elementary matrix

An $n \times n$ elementary matrix is a matrix obtained by performing an elementary operation on $I_{n}$ . The elementary matrix is said to be of type 1, 2, or 3 according to whether the elementary operation performed on $I_{n}$ is a type 1, 2, or 3 operation, respectively.

Theorem 3.1

Let $A \in M_{m \times n} (F)$ , and suppose that $B$ is obtained from $A$ by performing an elementary row [column] operation. Then there exists an $m \times m$ [ $n \times n$ ] elementary matrix $E$ such that $B = E A$ [ $B = A E$ ]. In fact, $E$ is obtained from $I_{m}$ [ $I_{n}$ ] by performing the same elementary row [column] operation as that which was performed on $A$ to obtain $B$ . Conversely, if $E$ is an elementary $m \times m$ [ $n \times n$ ] matrix, then $E A$ [ $A E$ ] is the matrix obtained from $A$ by performing the same elementary row [column] operation as that which produces $E$ from $I_{m}$ [ $I_{n}$ ].

Proof is skipped. Verifying Theorem 3.1 for each type of elementary row operation. The proof for column operations can then be obtained by using the matrix transpose to transform a column operation into a row operation.

Example 3.1.2

Consider the matrices $A$ and $B$ in Example 1. In this case, $B$ is obtained from $A$ by interchanging the first two rows of $A$ . Performing this same operation on $I_{3}$ , we obtain the elementary matrix

E = [\begin{matrix} 0 & 1 & 0 \\ 1 & 0 & 0 \\ 0 & 0 & 1 \end{matrix}]

Note that $E A = B$ . ( $E A$ means row operation according to the definition of matrix multiplication)

In the second part of Example 1, $C$ is obtained from $A$ by multiplying the second column of $A$ by $3$ . Performing this same operation on $I_{4}$ , we obtain the elementary matrix

E = [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 3 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}]

Observe that $A E = C$ .

Theorem 3.2: inverse of elementary matrix

Elementary matrices are invertible, and the inverse of an elementary matrix is an elementary matrix of the same type.

3.2 The Rank of a Matrix and Matrix Inverses

Definition: rank

If $A \in M_{m \times n} (F)$ , we define the rank of $A$ , denoted $rank (A)$ , to be the rank of the linear transformation $L_{A} : F^{n} \to F^{m}$ .

Theorem 3.3

Let $T : V \to W$ be a linear transformation between finite-dimensional vector spaces, and let $β, γ$ be ordered bases for $V, W$ , respectively. Then $rank (T) = rank ([T]_{β}^{γ})$

Every matrix $A$ is the matrix representation of the linear transformation $L_{A}$ w.r.t. the appropriate standard ordered bases. Thus the rank of the linear transformation $L_{A}$ is the same as the rank of one of its matrix representations, namely, $A$ .

Theorem 3.4

Let $A$ be an $m \times n$ matrix. If $P$ and $Q$ are invertible $m \times m$ and $n \times n$ matrices, respectively, then

$rank (A Q) = rank (A)$ ,
$rank (P A) = rank (A)$ , and therefore,
$rank (P A Q) = rank (A)$ .

Corollary

Let $V, W$ be finite-dimensional vector space and $T : V \to W$ be an isomorphism. Let $V_{0}$ be a subspac of $V$ , then

$T (V_{0})$ is a subspace of $W$ ;
$\dim (V_{0}) = \dim (T (V_{0}))$ .

Corollary

Elementary row and column operations on a matrix are rank-preserving.

Theorem 3.5

The rank of any matrix equals the maximum number of its linearly independent columns; that is, the rank of a matrix is the dimension of the subspace generated by its columns.

Example 3.2.1 (Rank Determination)

Let

A = [\begin{matrix} 1 & 0 & 1 \\ 0 & 1 & 1 \\ 1 & 0 & 1 \end{matrix}] .

Observe that the first and second columns of $A$ are linearly independent and that the third column is a linear combination of the first two. Thus,

rank (A) = \dim span {[\begin{matrix} 1 \\ 0 \\ 1 \end{matrix}], [\begin{matrix} 0 \\ 1 \\ 0 \end{matrix}], [\begin{matrix} 1 \\ 1 \\ 1 \end{matrix}]} = 2.

Example 3.2.2

Let

A = [\begin{matrix} 1 & 2 & 1 \\ 1 & 0 & 3 \\ 1 & 1 & 2 \end{matrix}] .

If we substract the 1st row of $A$ from rows 2 and 3 (type 3 elementary row operations), the result is

[\begin{matrix} 1 & 2 & 1 \\ 0 & - 2 & 2 \\ 0 & - 1 & 1 \end{matrix}] .

If we now substract twice the 1st column from the 2nd and substract the 1st column from the 3rd (type 3 elementary column operations), we obtain

[\begin{matrix} 1 & 0 & 0 \\ 0 & - 2 & 2 \\ 0 & - 1 & 1 \end{matrix}] .

It's now obvious that the maximum number of linearly independent columns of this matrix is $2$ . Hence the rank of $A$ is $2$ .

Theorem 3.6

Let $A$ be an $m \times n$ matrix of rank $r$ . Then $r \leq m$ , $r \leq n$ , and by means of a finite number of elementary row and column operations, $A$ can be transformed into the matrix

D = [\begin{matrix} I_{r} & 0_{1} \\ 0_{2} & 0_{3} \end{matrix}],

where $I_{r}$ is the $r \times r$ identity matrix and the other blocks are zero matrices. Thus $D_{i i} = 1$ for $i \leq r$ and $D_{i j} = 0$ otherwise.

Example 3.2.3

Conclusion about rank

Corollary 1

Let $A$ be an $m \times n$ matrix of rank $r$ . Then there exist invertible matrices $B$ (size $m \times m$ ) and $C$ (size $n \times n$ ) such that $D = B A C$ , where

D = [\begin{matrix} I_{r} & 0_{1} \\ 0_{2} & 0_{3} \end{matrix}]

is the $m \times n$ matrix in which $0_{1}, 0_{2}, 0_{3}$ are zero matrices.

Corollary 2

Let $A$ be an $m \times n$ matrix. Then

$rank (A^{⊤}) = rank (A)$ .
The rank of any matrix equals the maximum number of its linearly independent rows; that is, the rank of a matrix is the dimension of the subspace generated by its rows.
The rows and columns of any matrix generate subspaces of the same dimension, numerically equal to the rank of the matrix.

Corollary 3

Every invertible matrix is a product of elementary matrices

Theorem 3.7

Let $T : V \to W$ and $U : W \to Z$ be linear transformations on finite- dimensional vector spaces $V, W, Z$ , and let $A$ and $B$ be matrices s.t. the product $A B$ is defined. Then

$rank (UT) \leq rank (U)$
$rank (UT) \leq rank (T)$
$rank (A B) \leq rank (A)$
$rank (A B) \leq rank (B)$

Example 3.2.4

The Inverse of a Matrix

Definition: augented matrix

Let $A$ and $B$ be $m \times n$ and $m \times p$ matrices, respectively. By the augmented matrix $(A | B)$ , we mean the $m \times (n + p)$ matrix $(A B)$ , that is, the matrix whose first $n$ columns are the columns of $A$ , and whose last $p$ columns are the columns of $B$ .

Let $A$ be an invertible $n \times n$ matrix, and consider the $n \times 2 n$ augmented matrix $C = (A | I_{n})$ . By Exercise 15, we have $A^{- 1} C = (A^{- 1} A | A^{- 1} I_{n}) = (I_{n} | A^{- 1}) . (1)$

By Corollary 3 to Theorem 3.6, $A^{- 1}$ is the product of elementary matrices, say $A^{- 1} = E_{p} E_{p - 1} \dots E_{1}$ . Thus (1) becomes $E_{p} E_{p - 1} \dots E_{1} (A | I_{n}) = A^{- 1} C = (I_{n} | A^{- 1}) .$

Because multiplying a matrix on the left by an elementary matrix transforms the matrix by an elementary row operation (Theorem 3.1 p. 149), we have the following result: If $A$ is an invertible $n \times n$ matrix, then it is possible to transform the matrix $(A | I_{n})$ into the matrix $(I_{n} | A^{- 1})$ by means of a finite number of elementary row operations.

Conversely, suppose that $A$ is invertible and that, for some $n \times n$ matrix $B$ , the matrix $(A | I_{n})$ can be transformed into the matrix $(I_{n} | B)$ by a finite number of elementary row operations. Let $E_{1}, E_{2}, . . ., E_{p}$ be the elementary matrices associated with these elementary row operations as in Theorem 3.1; then

\begin{matrix} (2) & E_{p} E_{p - 1} \dots E_{1} (A | I_{n}) = (I_{n} | B) . \end{matrix}

Letting $M = E_{p} E_{p - 1} \dots E_{1}$ , we have from (2) that $(M A | M I_{n}) = (I_{n} | B) .$ Hence $M A = I_{n}$ and $M = B$ . It follows that $M = A^{- 1}$ . So $B = A^{- 1}$ . Thus we have the following result: If $A$ is an invertible $n \times n$ matrix, and the matrix $(A | I_{n})$ is transformed into a matrix of the form $(I_{n} B)$ by means of a finite number of elementary row operations, then $B = A^{- 1}$ .

If, on the other hand, $A$ is an $n \times n$ matrix that is not invertible, then $rank (A) < n$ . Hence any attempt to transform $(A | I_{n})$ into a matrix of the form $(I_{n} | B)$ by means of elementary row operations must fail because otherwise $A$ can be transformed into $I_{n}$ using the same row operations. This is impossible, however, because elementary row operations preserve rank. In fact, $A$ can be transformed into a matrix with a row containing only zero entries, yielding the following result: If $A$ is an $n \times n$ matrix that is not invertible, then any attempt to transform $(A | I_{n})$ into a matrix of the form $(I_{n} | B)$ produces a row whose first $n$ entries are zeros.

Example 3.2.5

Example 3.2.6

We determine whether the matrix

A = (\begin{matrix} 1 & 2 & 1 \\ 2 & 1 & - 1 \\ 1 & 5 & 4 \end{matrix})

is invertible, and if it is, we compute its inverse. Using a strategy similar to the one used in Example 3.2.5, we attempt to use elementary row operations to transform

(A | I) = [\begin{array}{cccccc} 1 & 2 & 1 & 1 & 0 & 0 \\ 2 & 1 & - 1 & 0 & 1 & 0 \\ 1 & 5 & 4 & 0 & 0 & 1 \end{array}]

into a matrix of the form $(I | B)$ . We first add $- 2$ times row $1$ to row $2$ and $- 1$ times row $1$ to row $3$ . We then add row $2$ to row $3$ . The result,

\begin{aligned} [\begin{array}{cccccc} 1 & 2 & 1 & 1 & 0 & 0 \\ 2 & 1 & - 1 & 0 & 1 & 0 \\ 1 & 5 & 4 & 0 & 0 & 1 \end{array}] \to [\begin{array}{cccccc} 1 & 2 & 1 & 1 & 0 & 0 \\ 0 & - 3 & - 3 & - 2 & 1 & 0 \\ 0 & 3 & 3 & - 1 & 0 & 1 \end{array}] \\ \to & [\begin{array}{cccccc} 1 & 2 & 1 & 1 & 0 & 0 \\ 0 & - 3 & - 3 & - 2 & 1 & 0 \\ 0 & 0 & 0 & - 3 & 1 & 1 \end{array}] . \end{aligned}

is a matrix with a row whose 1st 3entries are zeros. Therefore $A$ is not invertible.

Example 3.2.7 [core]

Let $T : P_{2} (R) \to P_{2} (R)$ be defined by

T (f (x)) = f (x) + f^{'} (x) + f^{″} (x),

where $f^{'} (x)$ and $f^{″} (x)$ denote the first and second derivatives of $f (x)$ . We use Corollary 1 of Theorem 2.18 to test $T$ for invertibility and compute the inverse if $T$ is invertible. Taking $β$ to be the standard ordered basis of $P_{2} (R)$ , we have

[T]_{β} = [\begin{matrix} 1 & 1 & 2 \\ 0 & 1 & 2 \\ 0 & 0 & 1 \end{matrix}] .

Using the method of Examples 5 and 6, $[T]_{β}$ is invertible with inverse

([T]_{β})^{- 1} = (\begin{matrix} 1 & - 1 & 0 \\ 0 & 1 & - 2 \\ 0 & 0 & 1 \end{matrix}) .

Thus $T$ is invertible, and $([T]_{β})^{- 1} = [T^{- 1}]_{β}$ . Hence by Theorem 2.14 (p. 91), we have

[T^{- 1} (a_{0} + a_{1} x + a_{2} x^{2})]_{β} = [\begin{matrix} 1 & - 1 & 0 \\ 0 & 1 & - 2 \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} a_{0} \\ a_{1} \\ a_{2} \end{matrix}] = [\begin{matrix} a_{0} - a_{1} \\ a_{1} - 2 a_{2} \\ a_{2} \end{matrix}]

Therefore

T^{- 1} (a_{0} + a_{1} x + a_{2} x^{2}) = (a_{0} - a_{1}) + (a_{1} - 2 a_{2}) x + a_{2} x^{2} .

3.3 Systems of Linear Equations - Theoretical Aspects

The system of equations

{\begin{cases} a_{11} X_{1} + a_{12} X_{2} + \dots + a_{1 n} X_{n} = b_{1} \\ a_{21} X_{1} + a_{22} X_{2} + \dots + a_{2 n} X_{n} = b_{2} \\ ⋮ \\ a_{m 1} X_{1} + a_{m 2} X_{2} + \dots + a_{m n} X_{n} = b_{m} \end{cases}

where $a_{i j}$ and $b_{i}$ are scalars in a field $F$ and $x_{1}, x_{2}, \dots, x_{n}$ are $n$ variables taking values in $F$ , is called a system of $m$ linear equations in $n$ unknowns over the field $F$ .

The $m \times n$ matrix

A = [\begin{matrix} a_{11} & a_{12} & \dots & a_{1 n} \\ a_{21} & a_{22} & \dots & a_{2 n} \\ ⋮ & ⋮ & ⋮ \\ a_{m 1} & a_{m 2} & \dots & a_{m n} \end{matrix}]

is called the coefficient matrix of the system $(S)$ .

We can write the system as a single matrix equation

A x = b,

where

x = (\begin{matrix} x_{1} \\ x_{2} \\ ⋮ \\ x_{n} \end{matrix}), b = (\begin{matrix} b_{1} \\ b_{2} \\ ⋮ \\ b_{m} \end{matrix}) .

To exploit the results that we have developed, we often consider a system of linear equations as a single matrix equation.

A solution to the system $(S)$ is an $n$ -tuple

s = (s_{1}, s_{2}, \dots, s_{n})^{⊤} \in F^{n}

such that $A s = b$ . The set of all solutions to the system $(S)$ is called the solution set of the system. System $(S)$ is consistent if its solution set is nonempty; otherwise it is inconsistent.

Example 3.3.1

(a) Consider the system

{\begin{cases} x_{1} + x_{2} = 3 \\ x_{1} - x_{2} = 1 \end{cases}

The solution is unique: $x_{1} = 2$ , $x_{2} = 1$ .

In matrix form:

(\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}) (\begin{matrix} x_{1} \\ x_{2} \end{matrix}) = (\begin{matrix} 3 \\ 1 \end{matrix}) .

(b) Consider the system

{\begin{cases} 2 x_{1} + 3 x_{2} + x_{3} = 1 \\ x_{1} - x_{2} + 2 x_{3} = 6 \end{cases}

which has infinitely many solutions, such as: $s = (- 6, 2, 7)^{⊤}$ and $s = (8, - 4, - 3)^{⊤}$ .

{\begin{cases} x_{1} + x_{2} = 0 \\ x_{1} + x_{2} = 1 \end{cases}

that is

[\begin{matrix} 1 & 1 \\ 1 & 1 \end{matrix}] [\begin{matrix} x_{1} \\ x_{2} \end{matrix}] = [\begin{matrix} 0 \\ 1 \end{matrix}]

It's evident that this system has no solutions. Thus we see that a system of linear equations can have one, many, or no solutions.

homogeneous

Definitions: homogeneous

A system $A x = b$ is said to be homogeneous if $b = 0$ , otherwise nonhomogeneous.

Theorem 3.8

Let $A x = 0$ be a homogeneous system of $m$ linear equations in $n$ unknowns over a field $F$ . Let $K$ denote the set of all solutions to $A x = 0$ . Then $K = N (L_{A})$ ; hence $K$ is a subspace of $F^{n}$ of dimension $n - rank (L_{A}) = n - rank (A)$ .

Corollary

If $m < n$ , the system $A x = 0$ has a nonzero solution.

Example 3.3.2 [core]

(a) Consider the system

{\begin{cases} x_{1} + 2 x_{2} + x_{3} = 0 \\ x_{1} - x_{2} + x_{3} = 0 \end{cases}

Let

A = (\begin{matrix} 2 & 1 & 1 \\ 0 & 0 & 1 \\ - 1 & - 1 & 1 \end{matrix})

be the coefficient matrix of this system. It is clear that $rank (A) = 2.$ If $K$ is the solution set of this system, then $\dim (K) = 3 - 2 = 1.$ Thus any nonzero solution constitutes a basis for $K .$ For example, since

(\begin{matrix} 1 \\ - 2 \\ 3 \end{matrix})

is a solution to the given system, it is a basis for $K .$ Thus any vector in $K$ is of the form

(\begin{matrix} t \\ - 2 t \\ 3 t \end{matrix})

where $t \in R .$

(b) Consider the system $x_{1} - 2 x_{2} + x_{3} = 0$ of one equation in three unknowns. If $A = (\begin{matrix} 1 & - 2 & 1 \end{matrix})$ is the coefficient matrix, then $rank (A) = 1.$ Hence if $K$ is the solution set, then $\dim (K) = 3 - 1 = 2.$ Note that

[\begin{matrix} 2 \\ 1 \\ 0 \end{matrix}] and [\begin{matrix} - 1 \\ 0 \\ 1 \end{matrix}]

are linearly independent vectors in $K$ . Thus they constitute a basis for $K$ , so that

K = {t_{1} [\begin{matrix} 2 \\ 1 \\ 0 \end{matrix}] + t_{2} [\begin{matrix} - 1 \\ 0 \\ 1 \end{matrix}] : t_{1}, t_{2} \in R} .

Theorem 3.9

Let $K$ be the solution set of a system $A x = b$ , and let $K_{H}$ be the solution set of the corresponding homogeneous system $A x = 0$ . Then for any solution $s$ to $A x = b$ ,

K = {s} + K_{H} = {s + k : k \in K_{H}} .

Example 3.3.3

(a) Consider the system

{\begin{cases} x_{1} + 2 x_{2} + x_{3} = 7 \\ x_{1} + x_{2} - x_{3} = - 4. \end{cases}

The corresponding homogeneous system is the system in Example 3.3.2(a). It is easily verified that

s = (\begin{matrix} 1 \\ 1 \\ 4 \end{matrix})

is a solution to the preceding nonhomogeneous system. So the solution set of the system is

K = (\begin{matrix} 1 \\ 1 \\ 4 \end{matrix}) + t (\begin{matrix} 1 \\ - 2 \\ 3 \end{matrix}), t \in R

by Theorem 3.9.

(b) Consider the system

2 x_{1} - 2 x_{2} + x_{3} = 4.

The corresponding homogeneous system is the system in Example 3.3.2(b). Since

s = (\begin{matrix} 4 \\ 0 \\ 0 \end{matrix})

is a solution to the given system, the solution set $K$ can be written as

K = (\begin{matrix} 4 \\ 0 \\ 0 \end{matrix}) + t_{1} (\begin{matrix} 2 \\ 1 \\ 0 \end{matrix}) + t_{2} (\begin{matrix} - 1 \\ 0 \\ 1 \end{matrix}) : t_{1}, t_{2} \in R .

Theorem 3.10

Let $A x = b$ be a system of $n$ linear equations in $n$ unknowns. If $A$ is invertible, then the system has exactly one solution, namely,

x = A^{- 1} b .

Conversely, if the system has exactly one solution, then $A$ is invertible.

Example 3.3.4

Consider the system:

{\begin{cases} 2 x_{2} + 4 x_{3} = 2 \\ 2 x_{1} + 4 x_{2} + 2 x_{3} = 3 \\ 3 x_{1} + 3 x_{2} + x_{3} = 1 \end{cases}

Using the inverse matrix of the coefficient matrix, the unique solution is

(\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}) = A^{- 1} b = (\begin{matrix} - \frac{7}{8} \\ - \frac{5}{4} \\ \frac{1}{8} \end{matrix}) .

Theorem 3.11

Let $A x = b$ be a system of linear equations. Then the system is consistent if and only if

rank (A) = rank (A | b) .

Example 3.3.5

Recall the system of equations

{\begin{cases} x_{1} + x_{2} = 0, \\ x_{1} + x_{2} = 1. \end{cases}

in Example 3.3.1(c). Since

A = (\begin{matrix} 1 & 1 \end{matrix}), (A | b) = (\begin{matrix} 1 & 1 & 0 \\ 1 & 1 & 1 \end{matrix}),

rank (A) = 1, rank (A | b) = 2.

Because the two ranks are unequal, the system has no solutions.

Example 3.3.6

We can use Theorem 3.11 to determine whether $(3, 3, 2)$ is in the range of the linear transformation $T : R^{3} \to R^{3}$ defined by

T (a_{1}, a_{2}, a_{3}) = (a_{1} + a_{2} + a_{3}, a_{1} a_{2} + a_{3}, a_{1} + a_{3}) .

We check if $(3, 3, 2) \in R (T)$ by solving the system

\begin{aligned} x_{1} + x_{2} & + x_{3} = 3, \\ x_{1} - x_{2} & + x_{3} = 3, \\ x_{1} & + x_{3} = 2 \end{aligned}

Since the rank of the coefficient matrix is 2 but the augmented matrix is 3, this system has no solutions. Hence $(3, 3, 2) \notin R (T) .$

3.4 Systems of Linear Equations - Computational Aspects

Definition: equivalent

Two systems of linear equations are called equivalent if they have the same solution set.

Theorem 3.13

Let $A x = b$ be a system of $m$ linear equations in $n$ unknowns, and let $C$ be an invertible $m \times m$ matrix. Then $(C A) x = C b$ is equivalent to $A x = b$ .

Corollary

If $(A^{'} | b^{'})$ is obtained from $(A | b)$ by a finite number of elementary row operations, then the system $A^{'} x = b^{'}$ is equivalent to the original system.

Gaussian Elimination Method Outline:

Form the augmented matrix $(A | b)$ .
Use elementary row operations to transform $(A | b)$ into reduced row echelon form.
Solve the system from the reduced form using back substitution.

We now describe a method for solving any system of linear equations.

Consider the system:

\begin{aligned} 3 & x_{1} + 2 & x_{2} + 3 & x_{3} - 2 & x_{4} & = 1, \\ x_{1} + & x_{2} + & x_{3} & = 3, \\ x_{1} + 2 & x_{2} + & x_{3} - & x_{4} & = 2. \end{aligned}

First, form the augmented matrix:

(\begin{array}{ccccc} 3 & 2 & 3 & - 2 & 1 \\ 1 & 1 & 1 & 0 & 3 \\ 1 & 2 & 1 & - 1 & 2 \end{array}) .

Then perform elementary row operations to transform it into an upper triangular matrix in reduced row echelon form using steps involving row interchanges, scaling, and elimination as described.

In the leftmost nonzero column, create a 1 in the first column;

(\begin{array}{ccccc} 1 & 2 & 1 & - 1 & 2 \\ 1 & 1 & 1 & 0 & 3 \\ 3 & 2 & 3 & - 2 & 1 \end{array}) .

By means of type 3 row operations, use the first row to obtain zeros in the remaining positions of the leftmost nonzero column; $(\begin{array}{ccccc} 1 & 2 & 1 & - 1 & 2 \\ 0 & - 1 & 0 & 1 & 1 \\ 0 & - 4 & 0 & 1 & - 5 \end{array}) .$
Create a 1 in the next row in the leftmost possible column, without using previous rows;

(\begin{array}{ccccc} 1 & 2 & 1 & - 1 & 2 \\ 0 & 1 & 0 & - 1 & - 1 \\ 0 & - 4 & 0 & 1 & - 5 \end{array}) .

Use type 3 elementary row operations to obtain zeros below the 1 created in the preceding step;

(\begin{array}{ccccc} 1 & 2 & 1 & - 1 & 2 \\ 0 & 1 & 0 & - 1 & - 1 \\ 0 & 0 & 0 & - 3 & - 9 \end{array}) .

Repeat 3 and 4 until no nonzero rows remain;

(\begin{array}{ccccc} 1 & 2 & 1 & - 1 & 2 \\ 0 & 1 & 0 & - 1 & - 1 \\ 0 & 0 & 0 & 1 & 3 \end{array}) .

Work upward, beginning with the last nonzero row, and add multiples of each row to the rows above. Repeat the process for each preceding row until it is performed with the 2nd row, at which time the reduction process is completed.

(\begin{array}{ccccc} 1 & 0 & 1 & 0 & 1 \\ 0 & 1 & 0 & 0 & 2 \\ 0 & 0 & 0 & 1 & 3 \end{array}) .

Definition: Reduced row echelon form

A matrix is said to be in reduced row echelon form if it satisfies: (a) Any row containing a nonzero entry precedes any row with all zero entries.

(b) The first nonzero entry in each row is the only nonzero entry in its column.

The method described with forward elimination and back substitution is known as Gaussian elimination.

Theorem 3.14

Gaussian elimination transforms any matrix into its reduced row echelon form.

Theorem 3.15

Let $A x = b$ be a system of $r$ nonzero equations in $n$ unknowns with $rank (A) = rank (A | b) = r .$ Suppose $(A | b)$ is in reduced row echelon form. Then:

$rank (A) = r$ .
If the general solution is

s = s_{0} + t_{1} u_{1} + \dots + t_{n - r} u_{n - r},

then ${u_{1}, \dots, u_{n - r}}$ is a basis for the solution set of the corresponding homogeneous system, and $s_{0}$ is the solution to the original system.

Theorem 3.16

Let $A$ be an $m \times n$ matrix of rank $r > 0,$ and $B$ the reduced row echelon form of $A .$ Then:

(a) The number of nonzero rows in $B$ is $r .$

(b) For each $i = 1, \dots, r,$ there is a column $b_{j_{i}}$ of $B$ such that $b_{j_{i}} = e_{i} .$

(d) For each $k = 1, \dots, n,$ if column $k$ of $B$ is a linear combination $\sum d_{i} e_{i},$ then column $k$ of $A$ is $\sum d_{i} a_{j_{i}}$ .

Corollary

The reduced row echelon form of a matrix is unique.

Example 3.4.2

Let

A = (\begin{array}{lllll} 2 & 4 & 6 & 2 & 4 \\ 1 & 2 & 3 & 1 & 1 \\ 2 & 4 & 8 & 0 & 0 \\ 3 & 6 & 7 & 5 & 9 \end{array}) .

The reduced row echelon form of $A$ is

B = (\begin{array}{cccc} 1 & 2 & 0 & 4 & 0 \\ 0 & 0 & 1 & - 1 & 0 \\ 0 & 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 & 0 \end{array}) .

Since $B$ has three nonzero rows, the rank of $A$ is 3 . The first, third, and fifth columns of $B$ are $e_{1}, e_{2}$ , and $e_{3}$ ; so Theorem 3.16(c) asserts that the first, third, and fifth columns of $A$ are linearly independent.

Let the columns of $A$ be denoted $a_{1}, a_{2}, a_{3}, a_{4}$ , and $a_{5}$ . Because the second column of $B$ is $2 e_{1}$ , it follows from Theorem 3.16(d) that $a_{2} = 2 a_{1}$ , as is easily checked. Moreover, since the fourth column of $B$ is $4 e_{1} + (- 1) e_{2}$ , the same result shows that

a_{4} = 4 a_{1} + (- 1) a_{3} .

Example 3.4.3

The set

S = {2 + x + 2 x^{2} + 3 x^{3}, 4 + 2 x + 4 x^{2} + 6 x^{3}, 6 + 3 x + 8 x^{2} + 7 x^{3}, 2 + x + 5 x^{3}, 4 + x + 9 x^{3}}

generates a subspace $V$ of $P_{3} (R)$ . To find a subset of $S$ that is a basis for $V$ , we consider the subset

S^{'} = {(2, 1, 2, 3), (4, 2, 4, 6), (6, 3, 8, 7), (2, 1, 0, 5), (4, 1, 0, 9)}

consisting of the images of the polynomials in $S$ under the standard representation of $P_{3} (R)$ with respect to the standard ordered basis. Note that the $4 \times 5$ matrix in which the columns are the vectors in $S^{'}$ is the matrix $A$ in Example 2. From the reduced row echelon form of $A$ , which is the matrix $B$ in Example 2, we see that the first, third, and fifth columns of $A$ are linearly independent and the second and fourth columns of $A$ are linear combinations of the first, third, and fifth columns. Hence

{(2, 1, 2, 3), (6, 3, 8, 7), (4, 1, 0, 9)}

is a basis for the subspace of $R^{4}$ that is generated by $S^{'}$ . It follows that

{2 + x + 2 x^{2} + 3 x^{3}, 6 + 3 x + 8 x^{2} + 7 x^{3}, 4 + x + 9 x^{3}}

is a basis for the subspace $V$ of $P_{3} (R)$ .

Example 3.4.4 [mid-term]

Let

V = {(x_{1}, x_{2}, x_{3}, x_{4}, x_{5}) \in R^{5} : x_{1} + 7 x_{2} + 5 x_{3} - 4 x_{4} + 2 x_{5} = 0} .

It is easily verified that $V$ is a subspace of $R^{5}$ and that

S = {(- 2, 0, 0, - 1, - 1), (1, 1, - 2, - 1, - 1), (- 5, 1, 0, 1, 1)}

is a linearly independent subset of $V$ .

To extend $S$ to a basis for $V$ , we first obtain a basis $β$ for $V$ . To do so, we solve the system of linear equations that defines $V$ . Since in this case $V$ is defined by a single equation, we need only write the equation as

x_{1} = - 7 x_{2} - 5 x_{3} + 4 x_{4} - 2 x_{5}

and assign parametric values to $x_{2}, x_{3}, x_{4}, x_{5}$ . If $x_{2} = t_{1}, x_{3} = t_{2}, x_{4} = t_{3}$ , and $x_{5} = t_{4}$ , then the vectors in $V$ have the form

\begin{aligned} (x_{1}, x_{2}, x_{3}, x_{4}, x_{5}) = & (- 7 t_{1} - 5 t_{2} + 4 t_{3} - 2 t_{4}, t_{1}, t_{2}, t_{3}, t_{4}) \\ = & t_{1} (- 7, 1, 0, 0, 0) + t_{2} (- 5, 0, 1, 0, 0) \\ + t_{3} (4, 0, 0, 1, 0) + t_{4} (- 2, 0, 0, 0, 1) . \end{aligned}

Hence

β = {(- 7, 1, 0, 0, 0), (- 5, 0, 1, 0, 0), (4, 0, 0, 1, 0), (- 2, 0, 0, 0, 1)}

is a basis for $V$ by Theorem 3.15.

The matrix whose columns consist of the vectors in $S$ followed by those $β$ is

(\begin{matrix} - 2 & 1 & - 5 & - 7 & - 5 & 4 & - 2 \\ 0 & 1 & 1 & 1 & 0 & 0 & 0 \\ 0 & - 2 & 0 & 0 & 1 & 0 & 0 \\ - 1 & - 1 & 1 & 0 & 0 & 1 & 0 \\ - 1 & - 1 & 1 & 0 & 0 & 0 & 1 \end{matrix})

and its reduced row echelon form is

(\begin{matrix} 1 & 0 & 0 & 1 & 1 & 0 & - 1 \\ 0 & 1 & 0 & 0 & - .5 & 0 & 0 \\ 0 & 0 & 1 & 1 & .5 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 & - 1 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}) .

Thus

{(- 2, 0, 0, - 1, - 1), (1, 1, - 2, - 1, - 1), (- 5, 1, 0, 1, 1), (4, 0, 0, 1, 0)}

is a basis for $V$ containing $S$ .

3 Elementary Matrix Operations and Systems of Linear Equations ​

3.1 Elementary Matrix Operations and Elementary Matrices ​

Elementary Matrix Operations ​

Example 3.1.1 ​

elementary matrix ​

Example 3.1.2 ​

3.2 The Rank of a Matrix and Matrix Inverses ​

Example 3.2.1 (Rank Determination) ​

Example 3.2.2 ​

Example 3.2.3 ​

Conclusion about rank ​

Example 3.2.4 ​

The Inverse of a Matrix ​

Example 3.2.5 ​

Example 3.2.6 ​

Example 3.2.7 [core] ​

3.3 Systems of Linear Equations - Theoretical Aspects ​

Example 3.3.1 ​

homogeneous ​

Example 3.3.2 [core] ​

Example 3.3.3 ​

Theorem 3.10 ​

Example 3.3.4 ​

Theorem 3.11 ​

Example 3.3.5 ​

Example 3.3.6 ​

3.4 Systems of Linear Equations - Computational Aspects ​

Gaussian Elimination Method Outline: ​

Example 3.4.2 ​

Example 3.4.3 ​

Example 3.4.4 [mid-term] ​

3 Elementary Matrix Operations and Systems of Linear Equations

3.1 Elementary Matrix Operations and Elementary Matrices

Elementary Matrix Operations

Example 3.1.1

elementary matrix

Example 3.1.2

3.2 The Rank of a Matrix and Matrix Inverses

Example 3.2.1 (Rank Determination)

Example 3.2.2

Example 3.2.3

Conclusion about rank

Example 3.2.4

The Inverse of a Matrix

Example 3.2.5

Example 3.2.6

Example 3.2.7 [core]

3.3 Systems of Linear Equations - Theoretical Aspects

Example 3.3.1

homogeneous

Example 3.3.2 [core]

Example 3.3.3

Theorem 3.10

Example 3.3.4

Theorem 3.11

Example 3.3.5

Example 3.3.6

3.4 Systems of Linear Equations - Computational Aspects

Gaussian Elimination Method Outline:

Example 3.4.2

Example 3.4.3

Example 3.4.4 [mid-term]