6 Inner Products Spaces

6.1 Inner Products and Norms

Definition

Definition: Inner product [core]

Let $V$ be a vector space over field $F$ . An inner product on $V$ is a function assigning each ordered pair of vectors $⟨ x, y ⟩$ in $V \times V$ a scalar $⟨ x, y ⟩ \in F$ ,satisfying:

$⟨ x + z, y ⟩ = ⟨ x, y ⟩ + ⟨ z, y ⟩$ .
$⟨ c x, y ⟩ = c ⟨ x, y ⟩$ for $c \in F$ .
$⟨ x, y ⟩ = \overset{―}{⟨ y, x ⟩}$ where the bar denotes complex conjugation.
$⟨ x, x ⟩ > 0$ if $x \neq 0$

Note that:

The 3rd one reduces to $⟨ x, y ⟩ = ⟨ y, x ⟩$ if $F = R$
The 1st and 2nd parts simply require that the inner product be linear in the first component, (and conjugate linear in the second component).
It's easily shown that if $a_{1}, a_{2}, \dots, a_{n} \in F$ and $y, v_{1}, v_{2}, \dots, v_{n} \in V$ , then $⟨ \sum_{i = 1}^{n} a_{i} v_{i}, y ⟩ = \sum_{i = 1}^{n} a_{i} ⟨ v_{i}, y ⟩ .$

The idea of distance or length is missing. Therefore we need a richer structure, the so-called inner product space structure, by adding a new inner product function.

Example 6.1.1

Example 1 For $x = (a_{1}, a_{2}, \dots, a_{n})$ and $y = (b_{1}, b_{2}, \dots, b_{n})$ in $F^{n}$ , define

⟨ x, y ⟩ = \sum_{i = 1}^{n} a_{i} \overset{―}{b_{i}}

The verification that $⟨ \cdot, \cdot ⟩$ satisfies conditions the 1st through (4th) is easy. For example, if $z = (c_{1}, c_{2}, \dots, c_{n})$ , we have for (a)

\begin{aligned} ⟨ x + z, y ⟩ & = \sum_{i = 1}^{n} (a_{i} + c_{i}) \overset{―}{b_{i}} = \sum_{i = 1}^{n} a_{i} \overset{―}{b_{i}} + \sum_{i = 1}^{n} c_{i} \overset{―}{b_{i}} \\ = ⟨ x, y ⟩ + ⟨ z, y ⟩ \end{aligned}

Thus, for $x = (1 + i, 4)$ and $y = (2 - 3 i, 4 + 5 i)$ in $C^{2}$ ,

⟨ x, y ⟩ = (1 + i) (2 + 3 i) + 4 (4 - 5 i) = 15 - 15 i .

The inner product in Example 1 is called the standard inner product on $F^{n}$ . When $F = R$ the conjugations are not needed, and in early courses this standard inner product is usually called the dot product and is denoted by $x \cdot y$ instead of $⟨ x, y ⟩$ .

Example 6.1.2

If $⟨ x, y ⟩$ is any inner product on a vector space $V$ and $r > 0$ , define another inner product by $⟨ x, y ⟩^{'} = r ⟨ x, y ⟩$ . If $r \leq 0$ , then the 4th one of definition for inner product would not hold.

Example 6.1.3

Let $V = C ([0, 1])$ , the vector space of real-valued continuous functions on $[0, 1]$ . For $f, g \in V$ , define

⟨ f, g ⟩ = \int_{0}^{1} f (t) g (t) d t .

Since the proceding integral is linear in $f$ , the 1st and the 2nd parts are immediate.
the 3rd one is trival (real-value).
If $f ≢ 0$ , then $f^{2}$ is bounded away from zero on some subinterval of $[0, 1]$ (continuity is used here), and hence $⟨ f, f ⟩ = \int_{0}^{1} [f (t)]^{2} d t > 0$ .

conjugate transpose

Definition: conjugate transpose

Let $A \in M_{m \times n} (F)$ . Define the conjugate transpose or adjoint of $A$ as the $n \times m$ matrix $A^{*}$ such that $(A^{*})_{i j} = {\overset{―}{A}}_{j i}$ for all $i, j$ .

Example 6.1.4

A = [\begin{matrix} i & 1 + 2 i \\ 2 & 3 + 4 i \end{matrix}],

then

A^{*} = [\begin{matrix} - i & 2 \\ 1 - 2 i & 3 - 4 i \end{matrix}] .

Notice that:

if $x, y$ are viewed as column vector in $F^{n}$ , then $⟨ x, y ⟩ = y^{*} x$ .
The conjugate transpose of a matrix plays a very important role in the remainder of this chapter. In the case that $A$ has only real entries, $A^{*}$ is simply the transpose of $A$ .

Example 6.1.5

Let $V = M_{n \times n} (F)$ , and define $⟨ A, B ⟩ = tr (B^{*} A)$ for $A, B \in V$ . (Recall that the trace of a matrix $A$ is defined by $tr (A) = \sum_{i = 1}^{n} A_{i i}$ .) We verify that the 1st and 4th of the definition of inner product hold and leave (b) and (c) to the reader. For this purpose, let $A, B, C \in V$ . Then

\begin{aligned} ⟨ A + B, C ⟩ & = tr (C^{*} (A + B)) = tr (C^{*} A + C^{*} B) \\ = tr (C^{*} A) + tr (C^{*} B) = ⟨ A, C ⟩ + ⟨ B, C ⟩ . \end{aligned}

Also

\begin{aligned} ⟨ A, A ⟩ & = tr (A^{*} A) = \sum (A^{*} A)_{i i} = \sum_{i = 1}^{n} \sum_{k = 1}^{n} (A^{*})_{i k} A_{k i} \\ = \sum_{i = 1}^{n} \sum_{k = 1}^{n} \overset{―}{A_{k i}} A_{k i} = \sum_{i = 1}^{n} \sum_{k = 1}^{n} | A_{k i} |^{2} . \end{aligned}

Now if $A \neq 0$ , then $A_{k i} \neq 0$ for some $k$ and $i$ . So $⟨ A, A ⟩ > 0$ .

inner product space

The inner product on $M_{n \times n} (F)$ in Example 6.1.5 is called the Frobenius inner product.

A vector space $V$ over $F$ endowed with a specific inner product is called an inner product space. If $F = C$ , we call $V$ a complex inner product space, whereas if $F = R$ , we call $V$ a real inner product space.

For the remainder of this chapter, $F^{n}$ denotes the inner product space with the standard inner product as defined in Example 6.1.1. Likewise, $M_{n \times n} (F)$ denotes the inner product space with the Frobenius inner product as defined in Example 6.1.5. The reader is cautioned that two distinct inner products on a given vector space yield two distinct inner product spaces. For instance, it can be shown that both

⟨ f (x), g (x) ⟩_{1} = \int_{- 1}^{1} f (t) g (t) d t and ⟨ f (x), g (x) ⟩_{2} = \int_{0}^{1} f (t) g (t) d t

are inner products on the vector space $P (R)$ . Even though the underlying vector space is the same, however, these two inner products yield two different inner product spaces. For example, the polynomials $f (x) = x$ and $g (x) = x^{2}$ are orthogonal in the second inner product space, but not in the first. About orthogonal, introduce it later.

A very important inner product space that resembles $C ([0, 1])$ is the space $H$ of continuous complex-valued functions defined on the interval $[0, 2 π]$ with the inner product

⟨ f, g ⟩ = \frac{1}{2 π} \int_{0}^{2 π} f (t) \overset{―}{g (t)} d t .

Show that the vector space $H$ with $⟨ \cdot, \cdot ⟩$ defined above is an inner product space.

Check the condition one by one.

\begin{aligned} ⟨ f + h, g ⟩ = \frac{1}{2 π} \int_{0}^{2 π} (f (t) + h (t)) \overset{―}{g (t)} d t \\ = & \frac{1}{2 π} \int_{0}^{2 π} f (t) \overset{―}{g (t)} d t + \frac{1}{2 π} \int_{0}^{2 π} h (t) \overset{―}{g (t)} d t = ⟨ f, g ⟩ + ⟨ h, g ⟩ . \\ ⟨ c f, g ⟩ = \frac{1}{2 π} \int_{0}^{2 π} (c f (t)) \overset{―}{g (t)} d t = c \frac{1}{2 π} \int_{0}^{2 π} f (t) \overset{―}{g (t)} d t = c ⟨ f, g ⟩ . \\ ⟨ f, g ⟩ = \overset{―}{⟨ g, f ⟩} . \\ ⟨ f, f ⟩ = \frac{1}{2 π} \int_{0}^{2 π} | f (t) |^{2} d t > 0 if f ≢ 0. \end{aligned}

At this point, we mention a few facts about integration of complex-valued functions.

the imaginary number $i$ can be treated as a constant under the integration sign.
every complex-valued function $f$ may be written as $f = f_{1} + i f_{2}$ , where $f_{1}$ and $f_{2}$ are real-valued functions. Thus we have

\int f = \int f_{1} + i \int f_{2} and \overset{―}{\int f} = \int \bar{f} .

Theorem 6.1

Let $V$ be an inner product space. Then for $x, y, z \in V$ and $c \in F$ , the following statements are true.

$⟨ x, y + z ⟩ = ⟨ x, y ⟩ + ⟨ x, z ⟩$ .
$⟨ x, c y ⟩ = c ⟨ x, y ⟩$ .
$⟨ x, 0 ⟩ = ⟨ 0, x ⟩ = 0$ .
$⟨ x, x ⟩ = 0$ if and only if $x = 0$ .
If $⟨ x, y ⟩ = ⟨ x, z ⟩$ for all $x \in V$ , then $y = z$ .

The 1st and 2nd of Theorem 6.1 show that the inner product is conjugate linear in the second component.

Norms

Definition: Norm / Length

Let $V$ be an inner product space. For $x \in V$ , we define the norm or length of $x$ by $∥ x ∥ = \sqrt{⟨ x, x ⟩}$ .

Example 6.1.6

Let $V = F^{n}$ . If $x = (a_{1}, a_{2}, \dots, a_{n})$ , then

∥ x ∥ = ∥ (a_{1}, a_{2}, \dots, a_{n}) ∥ = \sqrt{\sum_{i = 1}^{n} | a_{i} |^{2}}

is the Euclidean definition of length. Note that if $n = 1$ , we have $∥ a ∥ = | a |$ .

Theorem 6.2

Let $V$ be an inner product space over $F$ . Then for all $x, y \in V$ and $c \in F$ , the following statements are true.

$∥ c x ∥ = | c | \cdot ∥ x ∥$ .
$∥ x ∥ = 0$ if and only if $x = 0$ . In any case, $∥ x ∥ \geq 0$ .
(Cauchy-Schwarz Inequality) $| ⟨ x, y ⟩ | \leq ∥ x ∥ \cdot ∥ y ∥$ .
(Triangle Inequality) $∥ x + y ∥ \leq ∥ x ∥ + ∥ y ∥$ .

Example 6.1.7

For $F^{n}$ , we may apply 3rd and 4th of Theorem 6.2 to the standard inner product to obtain the following well-known inequalities:

| \sum_{i = 1}^{n} a_{i} {\bar{b}}_{i} | \leq {[\sum_{i = 1}^{n} {| a_{i} |}^{2}]}^{1 / 2} {[\sum_{i = 1}^{n} {| b_{i} |}^{2}]}^{1 / 2}

and

{[\sum_{i = 1}^{n} {| a_{i} + b_{i} |}^{2}]}^{1 / 2} \leq {[\sum_{i = 1}^{n} {| a_{i} |}^{2}]}^{1 / 2} + {[\sum_{i = 1}^{n} {| b_{i} |}^{2}]}^{1 / 2} .

Orthogonal [core]

Definitions: orthogonal & orthonormal

Let $V$ be an inner product space. Vectors $x$ and $y$ in $V$ are orthogonal (perpendicular) if $⟨ x, y ⟩ = 0$ .

A subset $S$ of $V$ is orthogonal if any two distinct vectors in $S$ are orthogonal. A vector $x$ in $V$ is a unit vector if $∥ x ∥ = 1$ . Finally, a subset $S$ of $V$ is orthonormal if $S$ is orthogonal and consists entirely of unit vectors.

Note that if $S = {v_{1}, v_{2}, \dots}$ (infinite dimension), then $S$ is orthonormal if and only if $⟨ v_{i}, v_{j} ⟩ = δ_{i j}$ , where $δ_{i j}$ denotes the Kronecker delta. Also, observe that multiplying vectors by nonzero scalars does not affect their orthogonality and that if $x$ is any nonzero vector, then $(1 / ∥ x ∥) x$ is a unit vector. The process of multiplying a nonzero vector by the reciprocal of its length is called normalizing.

Example 6.1.8

In $F^{3}$ , ${(1, 1, 0), (1, - 1, 1), (- 1, 1, 2)}$ is an orthogonal set of nonzero vectors, but it is not orthonormal; however, if we normalize the vectors in the set, we obtain the orthonormal set

{\frac{1}{\sqrt{2}} (1, 1, 0), \frac{1}{\sqrt{3}} (1, - 1, 1), \frac{1}{\sqrt{6}} (- 1, 1, 2)} .

Example 6.1.9

Recall the inner product space $H$ (defined above). We introduce an important orthonormal subset $S$ of $H$ .

For what follows, $i$ is the imaginary number such that $i^{2} = - 1$ . For any integer $n$ , let $f_{n} (t) = e^{i n t}$ , where $0 \leq t \leq 2 π$ . (Recall that $e^{i n t} = \cos n t + i \sin n t$ .)

Now define $S = {f_{n} : n is an integer}$ . Clearly $S$ is a subset of $H$ . Using the property that $e^{i t} = e^{- i t}$ for every real number $t$ , we have, for $m \neq n$ ,

⟨ f_{m}, f_{n} ⟩ = \frac{1}{2 π} \int_{0}^{2 π} e^{i m t} e^{- i n t} d t = \frac{1}{2 π} \int_{0}^{2 π} e^{i (m - n) t} d t = 0.

Also,

⟨ f_{n}, f_{n} ⟩ = \frac{1}{2 π} \int_{0}^{2 π} 1 d t = 1.

In other words, $⟨ f_{m}, f_{n} ⟩ = δ_{m n}$ , the Kronecker delta.

6.2 The Gram-Schmidt Orthogonalization Process and Orthogonal Complements

Orthonormal basis

Definition: orthonormal basis

Let $V$ be an inner product space. A subset of $V$ is an orthonormal basis for $V$ if it is an ordered basis that is orthonormal.

Example 6.2.1

The standard ordered basis for $F^{n}$ is an orthonormal basis for $F^{n}$ .

Example 6.2.2

The set ${(1 / \sqrt{5}, 2 / \sqrt{5}), (2 / \sqrt{5}, - 1 / \sqrt{5})}$ is an orthonormal basis for $R^{2}$ .

Theorem 6.3

Let $V$ be an inner product space and $S = {v_{1}, v_{2}, \dots, v_{k}}$ be an orthogonal subset of $V$ consisting of nonzero vectors. If $y \in span (S)$ , then

y = \sum_{i = 1}^{k} a_{i} v_{i},

where

a_{i} = \frac{⟨ y, v_{i} ⟩}{∥ v_{i} ∥^{2}} for i = 1, 2, \dots, k .

Corollary of Theorem 6.3

Corollary

If, in addition to the hypotheses of Theorem 6.3, $S$ is orthonormal and $y \in span (S)$ , then

y = \sum_{i = 1}^{k} ⟨ y, v_{i} ⟩ v_{i} .

Let $V$ be an inner product space, and let $S$ be an orthogonal subset of $V$ consisting of nonzero vectors. Then $S$ is linearly independent.

For Corollary 1: If $V$ possesses a finite orthonormal basis, then Corollary 1 allows us to compute the coefficients in a linear combination very easily.

Example 6.2.3

By Corollary 2, the orthonormal set

{\frac{1}{\sqrt{2}} (1, 1, 0), \frac{1}{\sqrt{3}} (1, - 1, 1), \frac{1}{\sqrt{6}} (- 1, 1, 2)}

obtained in Example 6.1.8 is an orthonormal basis for $R^{3}$ . Let $x = (2, 1, 3)$ . The coefficients given by Corollary 1 to Theorem 6.3 that express $x$ as a linear combination of the basis vectors are

a_{1} = \frac{⟨ x, v_{1} ⟩}{∥ v_{1} ∥^{2}} = \frac{2 + 1}{\sqrt{2}}, a_{2} = \frac{2 - 1 + 3}{\sqrt{3}}, a_{3} = \frac{- 2 + 1 + 6}{\sqrt{6}} .

As a check, we have

(2, 1, 3) = a_{1} (1, 1, 0) + a_{2} (1, - 1, 1) + a_{3} (- 1, 1, 2) .

Gram-Schmidt process

Before stating this theorem, let us consider a simple case. Suppose that ${w_{1}, w_{2}}$ is a linearly independent subset of an inner product space (and hence a basis for some two-dimensional subspace). We want to construct an orthogonal set from ${w_{1}, w_{2}}$ that spans the same subspace.

Figure 6.1 suggests that the set ${v_{1}, v_{2}}$ , where $v_{1} = w_{1}$ and $v_{2} = w_{2} - c w_{1}$ , has this property if $c$ is chosen so that $v_{2}$ is orthogonal to $w_{1}$ . To find $c$ , we solve

0 = ⟨ v_{2}, w_{1} ⟩ = ⟨ w_{2} - c w_{1}, w_{1} ⟩ = ⟨ w_{2}, w_{1} ⟩ - c ⟨ w_{1}, w_{1} ⟩ .

c = \frac{(w_{2}, w_{1})}{∥ w_{1} ∥^{2}} .

Thus

v_{2} = w_{2} - \frac{⟨ w_{2}, w_{1} ⟩}{∥ w_{1} ∥^{2}} w_{1}

Theorem 6.4: Gram-Schmidt process [core]

Let $V$ be an inner product space and $S = {w_{1}, w_{2}, \dots, w_{n}}$ be a linearly independent subset of $V$ . Define $S^{'} = {v_{1}, v_{2}, \dots, v_{n}}$ , where $v_{1} = w_{1}$ , and for $2 \leq k \leq n$ ,

v_{k} = w_{k} - \sum_{j = 1}^{k - 1} \frac{⟨ w_{k}, v_{j} ⟩}{∥ v_{j} ∥^{2}} v_{j} .

Then $S^{'}$ is an orthogonal set of nonzero vectors such that $span (S^{'}) = span (S)$ .

This construction of ${v_{1}, v_{2}, \dots, v_{n}}$ by the use of Theorem 6.4 is called the Gram-Schmidt process.

Example 6.2.4 [core]

In $R^{4}$ , let $w_{1} = (1, 0, 1, 0)$ , $w_{2} = (1, 1, 1, 1)$ , and $w_{3} = (0, 1, 2, 1)$ . Then ${w_{1}, w_{2}, w_{3}}$ is linearly independent. We use the Gram-Schmidt process to compute orthogonal vectors $v_{1}, v_{2}, v_{3}$ , then normalize them to obtain an orthonormal set.

Take $v_{1} = w_{1} = (1, 0, 1, 0)$ . Then

v_{2} = w_{2} - \frac{⟨ w_{2}, v_{1} ⟩}{∥ v_{1} ∥^{2}} v_{1} = (1, 1, 1, 1) - \frac{2}{2} (1, 0, 1, 0) = (0, 1, 0, 1) .

Finally,

\begin{aligned} v_{3} & = w_{3} - \frac{⟨ w_{3}, v_{1} ⟩}{∥ v_{1} ∥^{2}} v_{1} - \frac{⟨ w_{3}, v_{2} ⟩}{∥ v_{2} ∥^{2}} v_{2} \\ = (0, 1, 2, 1) - \frac{2}{2} (1, 0, 1, 0) - \frac{2}{2} (0, 1, 0, 1) = (- 1, 0, 1, 0) . \end{aligned}

Normalization yields the orthonormal basis ${u_{1}, u_{2}, u_{3}}$ where

\begin{aligned} u_{1} & = \frac{v_{1}}{∥ v_{1} ∥} = \frac{1}{\sqrt{2}} (1, 0, 1, 0), \\ u_{2} & = \frac{v_{2}}{∥ v_{2} ∥} = \frac{1}{\sqrt{2}} (0, 1, 0, 1), \\ u_{3} & = \frac{v_{3}}{∥ v_{3} ∥} = (- 1, 0, 1, 0) . \end{aligned}

Example 6.2.5

Let $V = P (R)$ with the inner product

⟨ f (x), g (x) ⟩ = \int_{- 1}^{1} f (t) g (t) d t,

and consider the subspace $P_{2} (R)$ with the standard ordered basis $β = {1, x, x^{2}}$ . We use the Gram-Schmidt process to replace $β$ with an orthogonal basis ${v_{1}, v_{2}, v_{3}}$ for $P_{2} (R)$ , then obtain an orthonormal basis.

Take $v_{1} = 1$ . Then ${‖ v_{1} ‖}^{2} = \int_{- 1}^{1} 1^{2} d t = 2$ , and $⟨ x, v_{1} ⟩ = \int_{- 1}^{1} t \cdot 1 d t = 0$ . Thus

v_{2} = x - \frac{⟨ v_{1}, x ⟩}{{‖ v_{1} ‖}^{2}} = x - \frac{0}{2} = x .

Furthermore,

⟨ x^{2}, v_{1} ⟩ = \int_{- 1}^{1} t^{2} \cdot 1 d t = \frac{2}{3} and ⟨ x^{2}, v_{2} ⟩ = \int_{- 1}^{1} t^{2} \cdot t d t = 0

Therefore

\begin{aligned} v_{3} & = x^{2} - \frac{⟨ x^{2}, v_{1} ⟩}{{‖ v_{1} ‖}^{2}} v_{1} - \frac{⟨ x^{2}, v_{2} ⟩}{{‖ v_{2} ‖}^{2}} v_{2} \\ = x^{2} - \frac{1}{3} \cdot 1 - 0 \cdot x \\ = x^{2} - \frac{1}{3} . \end{aligned}

We conclude that ${1, x, x^{2} - \frac{1}{3}}$ is an orthogonal basis for $P_{2} (R)$ .

To obtain an orthonormal basis, we normalize $v_{1}, v_{2}$ , and $v_{3}$ to obtain

\begin{aligned} u_{1} = \frac{1}{\sqrt{\int_{- 1}^{1} 1^{2} d t}} = \frac{1}{\sqrt{2}} \\ u_{2} = \frac{x}{\sqrt{\int_{- 1}^{1} t^{2} d t}} = \sqrt{\frac{3}{2}} x \end{aligned}

and similarly,

u_{3} = \frac{v_{3}}{‖ v_{3} ‖} = \sqrt{\frac{5}{8}} (3 x^{2} - 1) .

Thus ${u_{1}, u_{2}, u_{3}}$ is the desired orthonormal basis for $P_{2} (R)$ .

If we continue applying the Gram-Schmidt orthogonalization process to the basis ${1, x, x^{2}, \dots}$ for $P (R)$ , we obtain an orthogonal basis whose elements are called the Legendre polynomials. The orthogonal polynomials $v_{1}, v_{2}$ , and $v_{3}$ in Example 6.2.5 are the first three Legendre polynomials.

Theorem 6.5

Let $V$ be a nonzero finite-dimensional inner product space. Then $V$ has an orthonormal basis $β$ . Furthermore, if $β = {v_{1}, v_{2}, \dots, v_{n}}$ and $x \in V$ , then

x = \sum_{i = 1}^{n} ⟨ x, v_{i} ⟩ v_{i} .

Example 6.2.6

We use Theorem 6.5 to represent the polynomial $f (x) = 1 + 2 x + 3 x^{2}$ as a linear combination of the vectors in the orthonormal basis ${u_{1}, u_{2}, u_{3}}$ for $P_{2} (R)$ obtained in Example 6.2.5. Observe that:

\begin{array}{r} ⟨ f (x), u_{1} ⟩ = \int_{- 1}^{1} \frac{1}{\sqrt{2}} (1 + 2 t + 3 t^{2}) u_{1} (t) d t = 2 \sqrt{2}, \\ ⟨ f (x), u_{2} ⟩ = \int_{- 1}^{1} \sqrt{\frac{3}{2}} t (1 + 2 t + 3 t^{2}) u_{1} (t) d t = \frac{2 \sqrt{6}}{3}, \\ ⟨ f (x), u_{3} ⟩ = \int_{- 1}^{1} (1 + 2 t + 3 t^{2}) u_{1} (t) d t = \frac{2 \sqrt{10}}{5} . \end{array}

Therefore,

f (x) = 2 \sqrt{2} u_{1} + \frac{2 \sqrt{6}}{3} u_{2} + \frac{2 \sqrt{10}}{5} u_{3} .

Corollary of Theorem 6.5

Corollary

Let $V$ be a finite-dimensional inner product space with an orthonormal basis $β = {v_{1}, v_{2}, \dots, v_{n}}$ . Let $T$ be a linear operator on $V$ , and let $A = [T]_{β}$ . Then for any $i$ and $j$ ,

A_{i j} = ⟨ T (v_{j}), v_{i} ⟩ .

Fourier coefficients [core]

Definition: Fourier coefficients

Let $β$ be an orthonormal subset (possibly infinite) of an inner product space $V$ , and let $x \in V$ . We define the Fourier coefficients of $x$ relative to $β$ to be the scalars $⟨ x, y ⟩$ where $y \in β$ .

Example 6.2.7

Let $S = {e^{i n t} : n is an integer}$ . In Example 6.1.9, $S$ was shown to be an orthonormal set in $H$ . We compute the Fourier coefficients of $f (t) = t$ relative to $S$ . Using integration by parts, for $n \neq 0$ ,

⟨ f, f_{n} ⟩ = \frac{1}{2 π} \int_{0}^{2 π} t e^{- i n t} d t = \frac{i}{n},

and for $n = 0$ ,

⟨ f, 1 ⟩ = \frac{1}{2 π} \int_{0}^{2 π} t d t = π .

Orthogonal complement

Definition: orthogonal complement

Let $S$ be a nonempty subset of an inner product space $V$ . We define $S^{⊥}$ (read "S perp") to be the set of all vectors in $V$ that are orthogonal to every vector in $S$ ; that is,

S^{⊥} = {x \in V : ⟨ x, y ⟩ = 0 for all y \in S} .

The set $S^{⊥}$ is called the orthogonal complement of $S$ . It is easily seen that $S^{⊥}$ is a subspace of $V$ for any subset $S$ of $V$ .

Example 6.2.8

The reader should verify that ${0}^{⊥} = V$ and $V^{⊥} = {0}$ for any inner product space $V$ .

Example 6.2.9

If $V = R^{3}$ and $S = {e_{3}}$ , then $S^{⊥}$ equals the xy-plane.

Let $S_{0} = {x_{0}}$ , where $x_{0}$ is a nonzero vector in $R^{3}$ . Describe $S^{⊥}$ geometrically. Now suppose $S = {x_{1}, x_{2}}$ is a linearly independent subset of $R^{3}$ . Describe $S^{⊥}$ geometrically.

We may think of $S^{⊥}$ as the plane orthogonal to $x_{0}$ , and if $S$ spans a plane, then $S^{⊥}$ is the line orthogonal to that plane.

Consider the problem in $R^{3}$ of finding the distance from a point $P$ to a plane $W$ . (See Figure 6.2.) If we let $y$ be the vector determined by $0$ and $P$ , we may restate the problem as follows:

Determine the vector $u$ in $W$ that is "closest" to $y$ . The desired distance is clearly given by $∥ y - u ∥$ . Notice from the figure that the vector $z = y - u$ is orthogonal to every vector in $W$ , so $z \in W^{⊥}$ .

Orthogonal projection

Theorem 6.6

Let $W$ be a finite-dimensional subspace of an inner product space $V$ , and let $y \in V$ . Then there exist unique vectors $u \in W$ and $z \in W^{⊥}$ such that $y = u + z$ . Furthermore, if ${v_{1}, v_{2}, \dots, v_{k}}$ is an orthonormal basis for $W$ , then

u = \sum_{i = 1}^{k} ⟨ y, v_{i} ⟩ v_{i} .

P.S. From Corollary 1 of Theorem 6.3:

u = \sum_{i = 1}^{k} ⟨ u, v_{i} ⟩ v_{i} .

Corollary

In the notation of Theorem 6.6, the vector $u$ is the unique vector in $W$ that is "closest" to $y$ ; that is, for any $x \in W$ , $∥ y - x ∥ \geq ∥ y - u ∥$ , and this inequality is an equality if and only if $x = u$ .

The vector $u$ is called the orthogonal projection of $y$ onto $W$ .

Example 6.2.10

Let $V = P_{3} (R)$ with inner product

⟨ f (x), g (x) ⟩ = \int_{- 1}^{1} f (t) g (t) d t for all f, g \in V .

We compute the orthogonal projection $f_{1} (x)$ of $f (x) = x^{3}$ on $P_{2} (R)$ .

By Example 6.2.5, ${u_{1}, u_{2}, u_{3}}$ is an orthonormal basis for $P_{2} (R)$ with

{\frac{1}{\sqrt{2}}, \sqrt{\frac{3}{2}} x, \sqrt{\frac{5}{8}} (3 x^{2} - 1)}

Computing inner products:

⟨ f, u_{1} ⟩ = 0, ⟨ f, u_{2} ⟩ = \int_{- 1}^{1} t^{3} \sqrt{\frac{3}{2}} t d t = \frac{\sqrt{6}}{5}, ⟨ f, u_{3} ⟩ = 0.

Hence

f_{1} (x) = \sum_{i = 1}^{3} ⟨ f (x), u_{i} ⟩ u_{i} = \frac{3}{5} x .

Theorem 6.7

Suppose that $S = {v_{1}, v_{2}, \dots, v_{k}}$ is an orthonormal set in an $n$ -dimensional inner product space $V$ . Then

$S$ can be extended to an orthonormal basis ${v_{1}, v_{2}, \dots, v_{k}, v_{k + 1}, \dots, v_{n}}$ for $V$ .
If $W = span (S)$ , then

S_{1} = {v_{k + 1}, v_{k + 2}, \dots, v_{n}}

is an orthonormal basis for $W^{⊥}$ .

If $W$ is any subspace of $V$ , then

\dim (V) = \dim (W) + \dim (W^{⊥}) .

Example 6.2.11

Let $W = span ({e_{1}, e_{2}})$ in $F^{3}$ . Then $x = (a, b, c) \in W^{⊥}$ if and only if

⟨ x, e_{1} ⟩ = 0, ⟨ x, e_{2} ⟩ = 0.

So $x = (0, 0, c)$ , and therefore $W^{⊥} = span ({e_{3}})$ . Thus $e_{3} \in W^{⊥}$ and from 3rd claim of Theorem 6.7, that $\dim (W^{⊥}) = 3 - 2 = 1$ .

6 Inner Products Spaces ​

6.1 Inner Products and Norms ​

Definition ​

Example 6.1.1 ​

Example 6.1.2 ​

Example 6.1.3 ​

conjugate transpose ​

Example 6.1.4 ​

Example 6.1.5 ​

inner product space ​

Norms ​

Example 6.1.6 ​

Theorem 6.2 ​

Example 6.1.7 ​

Orthogonal [core] ​

Example 6.1.8 ​

Example 6.1.9 ​

6.2 The Gram-Schmidt Orthogonalization Process and Orthogonal Complements ​

Orthonormal basis ​

Example 6.2.1 ​

Example 6.2.2 ​

Theorem 6.3 ​

Corollary of Theorem 6.3 ​

Example 6.2.3 ​

Gram-Schmidt process ​

Example 6.2.4 [core] ​

Example 6.2.5 ​

Theorem 6.5 ​

Example 6.2.6 ​

Corollary of Theorem 6.5 ​

Fourier coefficients [core] ​

Example 6.2.7 ​

Orthogonal complement ​

Example 6.2.8 ​

Example 6.2.9 ​

Orthogonal projection ​

Example 6.2.10 ​

Theorem 6.7 ​

Example 6.2.11 ​

6 Inner Products Spaces

6.1 Inner Products and Norms

Definition

Example 6.1.1

Example 6.1.2

Example 6.1.3

conjugate transpose

Example 6.1.4

Example 6.1.5

inner product space

Norms

Example 6.1.6

Theorem 6.2

Example 6.1.7

Orthogonal [core]

Example 6.1.8

Example 6.1.9

6.2 The Gram-Schmidt Orthogonalization Process and Orthogonal Complements

Orthonormal basis

Example 6.2.1

Example 6.2.2

Theorem 6.3

Corollary of Theorem 6.3

Example 6.2.3

Gram-Schmidt process

Example 6.2.4 [core]

Example 6.2.5

Theorem 6.5

Example 6.2.6

Corollary of Theorem 6.5

Fourier coefficients [core]

Example 6.2.7

Orthogonal complement

Example 6.2.8

Example 6.2.9

Orthogonal projection

Example 6.2.10

Theorem 6.7

Example 6.2.11