2 Controllablity, bang-bang control

2.1 Definitions

Controllability question

Given the initial point $x^{0}$ and a “target” set $S \subset R^{n}$ , does there exist a control steering the system to $S$ in finite time?

For the time being we will therefore not introduce any payoff criterion that would characterize an “optimal” control, but instead will focus on the question as to whether or not there exist controls that steer the system to a given goal. In this chapter we will mostly consider the problem of driving the system to the origin $S = {0}$ .

Definition

Definition: reachable set

reachable set for time $t$ : $C (t) =$ set of initial points $x^{0}$ for which there exists a control such that $x (t) = 0$
reachable set: $C =$ set of initial points $x^{0}$ for which there exists a control such that $x (t) = 0$ for some finite time $t$ ;
$C = ⋃_{t \geq 0} C (t)$

Let $M^{n \times m}$ denote the set of all $n \times m$ matrices. Assume that this and next chapter, the ODE is linear in both the state $x (\cdot)$ and the control $α (\cdot)$ , and he ODE hasthe form

{\begin{cases} \dot{x} (t) = M x (t) + N α (t) \\ x (0) = x^{0} \end{cases}

where $M \in M^{n \times n}$ and $N \in M^{n \times m}$ . Assume the set $A$ of conrol parameters is a cube in $R^{m}$ :

A = [- 1, 1]^{m} = {a \in R^{m} | | a_{i} | \leq 1, i = 1, \dots, m}

2.2 Quick review of linear ODE

Definition: fundamental solution

Let $X (\cdot) : R \to M^{n \times n}$ be the unique solution of the matrix ODE:

{\begin{cases} \dot{X} (t) = M X (t) (t \in R) \\ X (0) = I . \end{cases}

We call $X (t)$ the fundamental solution and sometimes write

X (t) = e^{t M} := \sum_{t = 0}^{\infty} \frac{t^{k} M^{k}}{k!}

Last formula being the definition of the exponential $e^{t M}$ and observe that

X^{- 1} (t) = X (- t)

Theorem 2.1: Solving linear systems of ODE

The unique solution of the homogeneous system of ODE
${\begin{cases} \dot{x} (t) = M x (t) \\ x (0) = x^{0} . \end{cases}$

x (t) = X (t) x^{0} = e^{t M} x^{0}

The unique solution of the nonhomogeneous system

{\begin{cases} \dot{x} (t) = M x (t) + f (t) \\ x (0) = x^{0} . \end{cases}

x (t) = X (t) x^{0} + X (t) \int_{0}^{t} X^{- 1} (s) f (s) d s

This expression is the variation of parameters formula.

2.3 Controllability of linear equations

According to the variation of parameters formula, the solution of (linear ODE) for a given control $α (\cdot)$ is

x (t) = X (t) x^{0} + X (t) \int_{0}^{t} X^{- 1} (s) N α (s) d s

\begin{aligned} x^{0} \in C (t) \\ \leftrightarrow & there exists a control α (\cdot) \in A s.t. x (t) = 0 \\ \leftrightarrow & 0 = X (t) x^{0} + X (t) \int_{0}^{t} X^{- 1} (s) N α (s) d s for some control α (\cdot) \in A \\ \leftrightarrow & x^{0} = - \int_{0}^{t} X^{- 1} (s) N α (s) d s for some control α (\cdot) \in A \end{aligned}

Theorem 2.2: Structure of reachable set

The reachable set $C$ is symmetric and convex.
Also, if $x^{0} \in C (\bar{t})$ , then $x^{0} \in C (t)$ for all times $t \geq \bar{t}$

Definition

Definition: symmetric & convex

A set $S$ is symmetric if $x \in S$ implies $- x \in S$
The set $S$ is convex if $x, \hat{x} \in S$ and $0 \leq λ \leq 1$ imply $λ x + (1 - λ) \hat{x} \in S$

Proof of theorem 2.2:

(Symmetric) Let $t \geq 0$ and $x^{0} \in C (t)$ . Then $x^{0} = - \int_{0}^{t} X^{- 1} (s) N α (s) d s$ for some admissible control $α \in A$ .

Therefore $- x^{0} = - \int_{0}^{t} X^{- 1} (s) N (- α (s)) d s$ and $- α (s) \in A$ since set $A$ is symmetric

Therefore $- x^{0} \in C (t)$ , and so each set $C (t)$ symmetric. It follows that $C$ is symmetric

(Convexity) Take $x^{0}, {\hat{x}}^{0} \in C$ so that $x^{0} \in C, {\hat{x}}^{0} \in C (\hat{t})$ for appropriate time $t, \hat{t} \geq 0$ . Assume $t \leq \hat{t}$ . Then

\begin{aligned} x^{0} & = - \int_{0}^{t} X^{- 1} (s) N α (s) d s for some control α (\cdot) \in A \\ {\hat{x}}^{0} & = - \int_{0}^{\hat{t}} X^{- 1} (s) N \hat{α} (s) d s for some control \hat{α} (\cdot) \in A \end{aligned}

Define a new control

\tilde{α} (s) := {\begin{cases} α (s) & if 0 \leq s \leq t \\ 0 & if s > t \end{cases}

Then

x^{0} = - \int_{0}^{\hat{t}} X^{- 1} (s) N \tilde{α} (s) d s

and hence $x^{0} \in C (\hat{t})$ . Now let $0 \leq λ \leq 1$ , and observe

λ x^{0} + (1 - λ) {\hat{x}}^{0} = - \int_{0}^{\hat{t}} X^{- 1} (s) N (λ \tilde{α} (s) + (1 - λ) \hat{α} (s)) d s

Therefore $λ x^{0} + (1 - λ) {\hat{x}}^{0} \in C (\hat{t}) \subseteq C$

Assertion (ii) follows from the foregoing if we take $\bar{t} = \hat{t}$ .
$◻$

A simple example

Let $n = 2$ and $m = 1, A = [- 1, 1]$ , and write $x (t) = {(x^{1} (t), x^{2} (t))}^{T}$ . Suppose

{\begin{cases} {\dot{x}}^{1} = 0 \\ {\dot{x}}^{2} = α (t) . \end{cases}

This is a system of the form $\dot{x} = M x + N α$ , for

M = (\begin{array}{ll} 0 & 0 \\ 0 & 0 \end{array}), N = (\binom{0}{1})

Clearly $C = {(x_{1}, x_{2}) ∣ x_{1} = 0}$ , the $x_{2}$ -axis.

We next wish to establish some general algebraic conditions ensuring that $C$ contains a neighborhood of the origin

Controllability

Definition: controllability matrix

The controllability matrix is

G = G (M, N) := \underset{n \times (m n) matrix}{\underset{⏟}{[\begin{matrix} N & M N & M^{2} N & \dots & M^{n - 1} N \end{matrix}]}}

Theorem 2.3: Controllability matrix

rank G = n \leftrightarrow 0 \in C^{\circ}

Notation

$C^{\circ}$ : the interior of the set $C$ , with its own and neighbor fields in the set
rank of $G$ = number of linearly independent rows / columns of $G$ ; $rank G \leq n$

Proof:

Suppose 1st that $rank G < n$ . This means that the linear span of the columns of G has dimension less than or equal to $n - 1$ . Thus there exists a vector $b \in R^{n}, b \neq 0$ , orthogonal to each column of $G$ . This implies $b^{⊤} G = 0$ . So

b^{⊤} N = b^{⊤} M N = \dots = b^{⊤} M^{n - 1} N = 0

In fact, $b^{⊤} M^{k} N = 0, \forall k \in R_{+}$ To confirm this, recall that

p (λ) := \det (λ I - M)

is the characteristic polynomial of $M$ . The Cayley–Hamilton Theorem states that $p (M) = 0$ So if we write

p (λ) = λ^{n} + β_{n - 1} λ^{n - 1} + \dots + β_{1} λ^{1} + β_{0}

then

p (M) = M^{n} + β_{n - 1} M^{n - 1} + \dots + β_{1} M + β_{0} I = 0

Therefore

M^{n} = - β_{n - 1} M^{n - 1} - β_{n - 2} M^{n - 2} - \dots - β_{1} M - β_{0} I

and so

b^{⊤} M^{n} N = b^{⊤} (- β_{n - 1} M^{n - 1} - \dots) N = 0

Similarly, $b^{⊤} M^{n + 1} N = 0$ , etc.

Now notice that

\begin{aligned} b^{⊤} X^{- 1} (s) N & = b^{⊤} e^{- s M} N \\ = b^{T} \sum_{k = 0}^{\infty} \frac{(- s)^{k} M^{k} N}{k!} \\ = \sum_{k = 0}^{\infty} \frac{(- s)^{k}}{k!} b^{T} M^{k} N = 0 \end{aligned}

Assume next that $x^{0} \in C (t)$ . This is equivalent to having

x^{0} = - \int_{0}^{t} X^{- 1} (s) N α (s) d s for some control α (\cdot) \in A

Then

b \cdot x^{0} = - \int_{0}^{t} b^{⊤} X^{- 1} (s) N α (s) d s = 0

This says that $b$ is orthogonal $x^{0}$ . In other words, $C$ must lie in the hyperplane orthogonal to $b \neq 0$ . Consequently $C^{\circ} = ϕ$ .

(How to understand: in the hyperplane, there is no hypersphere in the set)

Conversely, assume that $0 \notin C^{\circ}$ . Thus $0 \notin C^{\circ} (t), \forall t > 0$ . Since $C (t)$ is convex, there exits a support hyperplane to $C (t)$ through $0$ (This hyperplane put the set into just one side, and $0$ is not in te interior, so can do this). This means that $\exists b \neq 0$ , s.t. $b \cdot x^{0} \leq 0, \forall x^{0} \in C (t)$

(An equation for hyperplane that crosses thre origin is $b \cdot x = 0$ )

Choose any $x^{0} \in C (t)$ . Then

x^{0} = - \int_{0}^{t} X^{- 1} (s) N α (s) d s

for some control $α$ , and therefore

0 \geq b \cdot x^{0} = - \int_{0}^{t} b^{⊤} X^{- 1} (s) N α (s) d s

Thus

\int_{0}^{t} b^{⊤} X^{- 1} (s) N α (s) d s \geq 0 for all controls α (\cdot)

We assert that therefore

b^{⊤} X^{- 1} (s) N \equiv 0

a proof of which follows as a lemma below. We rewrite it as

b^{⊤} e^{- s M} N \equiv 0

Let $s = 0$ to see that $b^{⊤} N = 0$ . Next differentiate it with respect to s, to find that

b^{⊤} (- M) e^{- s M} N \equiv 0

For $s = 0$ this says

b^{⊤} M N = 0

We repeatedly differentiate, to deduce

b^{⊤} M^{k} N = 0, \forall = 0, 1, \dots

and so $b^{⊤} G = 0$ . This implies $rank G < n$ , since $b \neq 0$ .

◻

Lemma 2.4: Integral inequalities

Assume that

\int_{0}^{t} b^{⊤} X^{- 1} (s) N α (s) d s \geq 0

for all controls $α (\cdot)$ . Then

b^{⊤} X^{- 1} (s) N \equiv 0

Proof: Replacing $α$ with $- α$ , we see that

\int_{0}^{t} b^{⊤} X^{- 1} (s) N α (s) d s = 0

for all controls $α (\cdot)$ .

Define

v (s) := b^{⊤} X^{- 1} (s) N

If $v \neq 0$ , then $v (s_{0}) \neq 0$ for some $s_{0}$ . Then there exists an interval $I$ s.t. $s_{0} \in I$ and $v (s) \neq 0$ on $I$ . Now define $α (\cdot) \in A$ this way:

{\begin{cases} α (s) = 0, (s \notin I) \\ α (s) = \frac{v (s)}{| v (s) |} \frac{1}{\sqrt{n}}, (s \in I) \end{cases}

Then

0 = \int_{0}^{t} v (s) α (s) d s = \int_{I} \frac{v (s)}{\sqrt{n}} \frac{v (s)}{| v (s) |} d s = \frac{1}{\sqrt{n}} \int_{I} | v (s) | d s

This implies the contradiction that $v \equiv 0$ in $I$ .

◻

Definition: controllable

We say the linear system (ODE) is controllable if $C = R^{n}$

Theorem 2.5: Criterion for controllability

Let $A$ be the cube $[- 1, 1]^{n}$ in $R^{n}$ . Suppose as well that $rank G = n$ , and $Re λ < 0$ for each eigenvalue $λ$ of the matrix $M$ . Then the system(ODE) is controllable

Proof: Since $rank G = n$ , Theroem 2.3 tells us that $C$ contains some ball $B$ centered at $0$ . Now take any $x^{0} \in R^{n}$ and consider the evolution

{\begin{cases} \dot{x} (t) = M x (t) \\ x (0) = x^{0} \end{cases}

in other words, take the control $α (\cdot) \equiv 0$ . Since $Re λ < 0$ for each eigenvalue $λ$ of the matrix $M$ , then the origin is asymptotically stable. So ther exists a time $T$ s.t. $x (t) \in B$ . Thus $x (T) \in B \subset C$ ; and hence there exists a control $α (\cdot) \in A$ steering $x (t)$ into $0$ in finite time.

◻

Example We once again consider the rocket railroad car, from §1.2, for which $n = 2, m = 1, A = [- 1, 1]$ , and

\dot{x} = [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}] x + [\begin{matrix} 0 \\ 1 \end{matrix}] α

Then

G = [N, M N] = [\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}]

Therefore $rank G = 2 = n$

Also, the characteristic polynomial of the matrix $M$ is

p (λ) = \det (λ I - M) = \det (\begin{matrix} λ & - 1 \\ 0 & λ \end{matrix}) = λ^{2}

Since the eigenvalues are both $0$ , we fail to satisfy the hypotheses of Theorem 2.5.

This example motivates the following extension of the previous theorem:

Theorem 2.6: Improved criterion for controllability

Assume $rank G = n$ and $Re λ \leq 0$ for each eigenvalue $λ$ of $M$ . Then the system(ODE) is controllable.

Proof:

If $C \neq R^{n}$ , then the convexity of $C$ implies that there exists a vector $b \neq 0$ and a real number $μ$ s.t.

b \cdot x^{0} \leq μ, \forall x^{0} \in C

(Must contain a support hyperplane if $C$ doesn't contain the whole space)

Indeed, in the picture we see that $b \cdot (x^{0} - z^{0}) \leq 0$ ; and this implies that $μ := b \cdot z^{0}$ .

We will derive a contradiction.

Given $b \neq 0, μ \in R$ , our intention is to find $x^{0} \in C$ s.t. $b \cdot x^{0} \leq μ$ fails. Recall $x^{0} \in C$ iff $\exists t > 0$ and a control $α (\cdot) \in A$ s.t.

x^{0} = - \int_{0}^{t} X^{- 1} (s) N α (s) d s

Then

b \cdot x^{0} = - \int_{0}^{t} b^{⊤} X^{- 1} (s) N α (s) d s

Define

v (s) := b^{⊤} X^{- 1} (s) N

We assert that $v \neq 0$

To see this, suppose instead that $v \equiv 0$ . Then $k$ times differentiate the expression $b^{⊤} X^{- 1} (s) N$ w.r.t. $s$ and set $s = 0$ , to discover

b^{⊤} M^{k} N = 0, k = 0, 1, 2, \dots

This implies $b$ is orthogonal to the columns of $G$ , and so $rank G < n$ . This is a contradiction to our hypothesis, and therefore $v \neq 0$ holds.

Next, define $α (\cdot)$ this ay:

α (s) := {\begin{cases} - \frac{v (s)}{| v (s) |}, & if v (s) \neq 0, \\ 0, & if v (s) = 0. \end{cases}

Then

b \cdot x^{0} = - \int_{0}^{t} v (s) α (s) d s = \int_{0}^{t} | v (s) | d s

We want to find a time $t > 0$ s.t. $\int_{0}^{t} | v (s) | d s > μ$ . In fact, we assert that

\int_{0}^{\infty} | v (s) | d s = + \infty

To begin the proof above introduce the function

ϕ (t) := \int_{t}^{\infty} v (s) d s

We will find an ODE $ϕ$ satisfies. Take $p (\cdot)$ to be the characteristic polynomial of $M$ . Then

\begin{aligned} p (- \frac{d}{d t}) v (t) \\ = & p (- \frac{d}{d t}) [b^{⊤} e^{- t M} N] \\ = & b^{⊤} (p (- \frac{d}{d t}) e^{- t M}) N \\ = & b^{⊤} (p (M) e^{- t M}) N \equiv 0 \end{aligned}

Since $p (M) = 0$ , according to the Cayley–Hamilton Theorem. But since $p (- \frac{d}{d t}) v (t) \equiv 0$ , it follows that

- \frac{d}{d t} p (- \frac{d}{d t}) ϕ (t) = p (- \frac{d}{d t}) (- \frac{d}{d t} ϕ) = p (- \frac{d}{d t}) v (t) = 0

Hence $ϕ$ solves the (n+1)th order ODE

\frac{d}{d t} p (- \frac{d}{d t}) ϕ (t) = 0

We also know that $ϕ (\cdot) \neq 0$ . Let $μ_{1}, \dots, μ_{n + 1}$ be the solutions of $μ p (- μ) = 0$ . According to ODE theory, we can write

ϕ (t) = sum of the terms of the form p_{i} (t) e^{μ_{i} t}

for appropriate polynomials $p_{i} (\cdot)$

Furthermore, we see that $μ_{n + 1} = 0$ and $μ_{k} = - λ_{k}$ , where $λ_{1}, \dots, λ_{n}$ are the eigenvalues of $M$ . By assumption $Re μ_{k} \geq 0, k = 0, 1, \dots, n$ . If $\int_{0}^{\infty} | v (s) | d s < \infty$ , then

| ϕ (t) | \leq \int_{0}^{\infty} | v (s) | d s \to 0, as t \to \infty

that is, $ϕ (t) \to 0$ as $t \to \infty$ . This's a contradiction to the representation formula of $ϕ (t) = \sum p_{i} (t) e^{μ_{i} t}$ , with $Re μ_{i} \geq 0$ . Assertaion is proved.

Consequently given any $μ, \exists t > 0$ s.t.

b \cdot x^{0} = \int_{0}^{t} | v (s) | d s > μ

a contradiction to (2.8). Therefore $C = R^{n}$ .

◻

2.4 Observability

Consider the linear system of ODE

{\begin{cases} \dot{x} (t) = M x (t) \\ x (0) = x^{0} . \end{cases}

where $M \in M^{n \times n}$ .

In this section we address the observability problem, modeled as follows. We suppose that we can observe

y (t) := N x (t) (t \geq 0)

for a given matrix $N \in M^{m \times n}$ . Consequently, $y (t) \in R^{m}$ . The interesting situation is when $m << n$ and we interpret $y (\cdot)$ as low-dimensional “observations” or “measurements” of the high-dimensional dynamics $x (\cdot)$

Observability question: Given the observation $y (\cdot)$ , can we in principle reconstruct $x (\cdot)$ ? In particular, do observations of $y (\cdot)$ provide enough information for us to deduce the initial value $x^{0}$ for (ODE)?

Definition: observable

The pair (ODE, Observation) called observable if the knowledge of $y (\cdot)$ on any time interval $[0, t]$ allows us to compute $x^{0}$ .

More precisely, (ODE, Observation) is observable if for all solutions $x_{1} (\cdot), x_{2} (\cdot), N x_{1} (\cdot) \equiv N x_{2} (\cdot)$ on a time interval $[0, t]$ implies $x_{1} (0) = x_{2} (0)$ .

2 simple examples

If $N \equiv 0$ , then clearly the system is not observable.
On the other hand, if $m = n$ and $N$ is invertible, then clearly $x (t) = N^{- 1} y (t)$ is observable.

The interesting cases lie between these extremes.

Theorem 2.7: Observability and controllability The system 1

{\begin{cases} \dot{x} (t) = M x (t) \\ y (t) = N x (t) \end{cases}

is observable iff the system 2

\dot{z} (t) = M^{⊤} z (t) + N^{⊤} α (t), A = R^{m}

is controllable, meaning that $C = R^{n}$

INTERPRETATION. This theorem asserts that somehow “observability and controllability are dual concepts” for linear systems.

Proof:

( $\leftarrow$ ) Suppose the system 1 is not observable. Then $\exists x^{1} \neq x^{2} \in R^{n}$ , s.t.
${\begin{cases} {\dot{x}}_{1} (t) = M x_{1} (t), x_{1} (0) = x^{1} \\ {\dot{x}}_{2} (t) = M x_{2} (t), x_{2} (0) = x^{2} \end{cases}$

but $y (t) := N x_{1} (t) \equiv N x_{2} (t), \forall t \geq 0$ . Let

x (t) := x_{1} (t) - x_{2} (t), x^{0} := x^{1} - x^{2}

Then

\dot{x} (t) = M x (t), x (0) = x^{0} \neq 0

but

N x (t) = 0 (t \geq 0)

Now

x (t) = X (t) x^{0} = e^{t M} x^{0}

Thus

N e^{t M} x^{0} = 0 (t \geq 0)

Let $t = 0$ , to find $N x^{0} = 0$ . Then differentiate this expression $k$ times in $t$ and let $t = 0$ , to discover as well that

N M^{k} x^{0} = 0

for $k = 0, 1, 2 \dots$ Hence $(x^{0})^{⊤} (M^{k})^{⊤} N^{⊤} = 0$ and hence $(x^{0})^{⊤} (M^{⊤})^{k} N^{⊤} = 0$ . This implies

(x^{0})^{⊤} [N^{⊤}, M^{⊤} N^{⊤}, \dots, (M^{⊤})^{n - 1} N^{⊤}] = 0

Since $x^{0} \neq 0, rank [N^{⊤}, \dots, (M^{⊤})^{n - 1} N^{⊤}] < n$ . Thus system 2 is not controllable. Consequently, system 2 controllable implies system 1 is observable.

( $\to$ )Assume now system 2 is not controllable. Then $rank [N^{⊤}, \dots, (M^{⊤})^{n - 1} N^{⊤}] < n$ , and consequently according to Theorem 2.3, $\exists x^{0} \neq 0$ , s.t.

(x^{0})^{⊤} [N^{⊤}, \dots, (M^{⊤})^{n - 1} N^{⊤}] = 0

That is, $N M^{k} x^{0} = 0, \forall k = 0, 1, 2, \dots, n - 1$

We want to show that $y (t) = N x (t) \equiv 0$ , where

{\begin{cases} \dot{x} (t) = M x (t) \\ x (0) = x^{0} . \end{cases}

According to the Cayley–Hamilton Theorem, we can write

M^{n} = - β_{n - 1} M^{n - 1} - \dots - β_{0} I

for appropriate constants. Consequently $N M^{n} x^{0} = 0$ . Likewise

\begin{aligned} M^{n + 1} & = M (- β_{n - 1} M^{n - 1} - \dots - β_{0} I) \\ = - β_{n - 1} M^{n} - \dots - β_{0} M \end{aligned}

and so $N M^{n + 1} x^{0} = 0$ . Similarly, $N M^{k} x^{0} = 0, \forall k$ .

Now

x (t) = X (t) x^{0} = e^{M t} x^{0} = \sum_{k = 0}^{\infty} \frac{t^{k} M^{k}}{k!} x^{0}

and therefore $N x (t) = N \sum_{k = 0}^{\infty} \frac{t^{k} M^{k}}{k!} x^{0} = 0$

We have shown that if system 2 is not controllable, then system 1 is not observable.

◻

2.5 bang-bang control

Again take $A$ to be the cube $[- 1, 1]^{m} \in R^{m}$ .

Defnition: bang-bang

A control $α (\cdot) \in A$ is called bang-bang if $\forall t \geq 0$ and for each index $i = 1, \dots, m$ , we have $| α^{i} (t) | = 1$ , where

α (t) = {[\begin{matrix} α^{1} (t) & \dots & α^{m} (t) \end{matrix}]}^{⊤}

Theorem 2.8: bang-bang principle

Let $t > 0$ and suppose $x^{0} \in C (t)$ , for the system

\dot{x} (t) = M x (t) + N α (t) .

Then there exists a bang-bang control $α (\cdot)$ which steers $x^{0}$ to $0$ at time $t$ .

To prove the theorem we need some tools from functional analysis, among them the Krein–Milman Theorem, expressing the geometric fact that every bounded convex set has an extreme point.

2.5.1 Some functional analysis

We will study the “geometry” of certain infinite dimensional spaces of functions.

Notation

\begin{matrix} L^{\infty} = L^{\infty} (0, t; R^{m}) = {α (\cdot) : (0, t) \to R^{m} | sup_{0 \leq s \leq t} | α (s) | < \infty} . \\ ∥ α ∥_{L^{\infty}} = sup_{0 \leq s \leq t} | α (s) | . \end{matrix}

Definition: converge in the weak* sense

Let $α \in L^{\infty}$ for $n = 1, \dots$ and $α \in L^{\infty}$ . We say $α_{n}$ converges to $α$ in the weak* sense, written

α_{n} \overset{*}{⇀} α .

provided

\int_{0}^{t} α_{n} (s) \cdot v (s) d s \to \int_{0}^{t} α (s) \cdot v (s) d s

as $n \to \infty$ , for all $v (\cdot) : [0, t] \to R^{m}$ satisfying $\int_{0}^{t} | v (s) | d s < \infty$ .

We will the following useful weak* compactness theorem for $L^{\infty}$ .

Alaoglu's Theorem

Let $α_{n} \in A, n = 1, \dots$ . Then there exists a subsequence $α_{n_{k}}$ and $α \in A$ s.t.

α_{n_{k}} \overset{*}{⇀} α .

Definition: convex; extreme point

The set $K$ is convex if $\forall x, \hat{x} \in K$ and all real numbers $0 \leq λ \leq 1$ ,
$λ x + (1 - λ) \hat{x} \in K .$
A point $z \in K$ called extreme provided there do not exist points $x, \hat{x} \in K$ and $0 < λ < 1$ s.t.
$z = λ x + (1 - λ) \hat{x} .$

Krein-Milman Theorem

Let $K$ be a convex, nonempty subset of $L^{\infty}$ , which is compact in the weak* topology.

Then $K$ has at least one extreme point.

2.5.2 Application to bang-bang control

The foregoing abstract theory will be useful for us in the following setting. We will take $K$ to be the set of controls which steer $x^{0}$ to $0$ at time $t$ , prove it satisfies the hypotheses of Krein–Milman Theorem and finally show that an extreme point is a bang-bang control.

So consider again the linear dynamics

{\begin{cases} \dot{x} (t) = M x (t) + N α (t) \\ x (0) = x^{0} . \end{cases}

take $x^{0} \in C (t)$ and write

K = {α (\cdot) \in A | α (\cdot) steers x^{0} to 0 in time t} .

Lemma 2.9: Geometry of set of controls

The collection $K$ of admissible controls satisfies the hypotheses of the Krein-Milman Theorem.

Proof: Since $x^{0} \in C (t)$ , we see that $K \notin ϕ$ .

Next we show that $K$ is convex. For this, recall that $α (\cdot) \in K$ iff

x^{0} = - \int_{0}^{t} X^{- 1} (s) N α (s) d s .

Now take also $\hat{α} \in K$ and $0 \leq λ \leq 1$ . Then

x^{0} = - \int_{0}^{t} X^{- 1} (s) N \hat{α} (s) d s .

and so

x^{0} = - \int_{0}^{t} X^{- 1} (s) N (α (s) + (1 - λ) \hat{α} (s)) d s .

Hence $λ α + (1 - λ) \hat{α} \in K$ .

Lastly, we confirm the compactness. Let $α_{n} \in K$ for $n = 1, \dots$ . According to Alaoglu’s Theorem $\existss n_{k} \to \infty$ and $α \in A$ s.t. $α_{n_{k}} \overset{*}{⇀} α$ . We need to show that $α \in K$ .

Now $α_{n_{k}} \in K$ implies

x^{0} = - \int_{0}^{t} X^{- 1} (s) N α_{n_{k}} (s) d s \to - \int_{0}^{t} X^{- 1} (s) N α (s) d s .

by definition of weak-* convergence. Hence $α \in K .$

◻

We can now apply the Krein–Milman Theorem to deduce that there exists an extreme point $α \in K$ . What is interesting is that such an extreme point corresponds to a bang-bang control.

Theorem 2.10: Extremality and bang-bang principle

The control $α (\cdot)$ is bang-bang.

Proof:

We must show that for almost all times $0 \leq s \leq t$ and for each $i = 1, \dots, m$ , we have

| α^{i *} (s) | = 1.

Suppose not. Then there exists an index $i \in {1, \dots, m}$ and a subset $E \subset [0, t]$ of positive measure s.t. $| α^{i *} (s) | < 1$ for $s \in E$ . In fact, $\existss ε > 0$ and a subset $F \subseteq E$ s.t.

| F | > 0 and | α^{i *} (s) | \leq 1 - ε for s \in F .

Define

I_{F} (β (\cdot)) := \int_{F} X^{- 1} (s) N β (s) d s .

for $β = (0, \dots, β (\cdot), \dots, 0)^{⊤} .$ , the function $β$ in the $i^{th}$ slot. Choose any real-valued function $β (\cdot) ≢ 0$ , s.t.

I_{F} (β (\cdot)) = 0

and $| β (\cdot) | \leq 1$ . Define

\begin{aligned} α_{1} (\cdot) & := α^{*} (\cdot) + ε β (\cdot) \\ α_{2} (\cdot) & := α^{*} (\cdot) - ε β (\cdot), \end{aligned}

where we redefine $β$ to be zero off the set $F$

We claim that $α_{1} (\cdot), α_{2} (\cdot) \in K$ .

To see this, observe that

\begin{aligned} - \int_{0}^{t} X^{- 1} (s) N α_{1} (s) d s \\ = & - \int_{0}^{t} X^{- 1} (s) N α^{*} (s) d s - ε \int_{0}^{t} X^{- 1} (s) N β (s) d s \\ = & x^{0} - ε \underset{I_{F} (β (\cdot)) = 0}{\underset{⏟}{\int_{F} X^{- 1} (s) N β (s) d s}} = x^{0} \end{aligned}

Note also $α_{1} (\cdot) \in A$ . Indeed,

{\begin{cases} α_{1} (s) = α^{*} (s) & (s \notin F) \\ α_{1} (s) = α^{*} (s) + ε β (s) & (s \in F) . \end{cases}

But on the set $F$ , we have $| α_{i}^{*} (s) | \leq 1 - ε$ , and therefore

| α_{1} (s) | \leq | α^{*} (s) | + ε | β (s) | \leq 1 - ε + ε = 1 .

Similar considerations apply for $α_{2}$ . Hence $α_{1}, α_{2} \in K$ , as claimed above.

Finally, observe that

\begin{aligned} α_{1} (\cdot) & = α^{*} + ε β, & α_{1} \neq α^{*} \\ α_{2} (\cdot) & = α^{*} - ε β, & α_{2} \neq α^{*} \end{aligned}

But

\frac{1}{2} α_{1} + \frac{1}{2} α_{2} = α^{*} .

and this is a contradiction, since $α^{*}$ is an extreme point of $K$ .

◻

2 Controllablity, bang-bang control ​

2.1 Definitions ​

Controllability question ​

Definition ​

2.2 Quick review of linear ODE ​

2.3 Controllability of linear equations ​

Definition ​

A simple example ​

Controllability ​

2.4 Observability ​

2.5 bang-bang control ​

2.5.1 Some functional analysis ​

2.5.2 Application to bang-bang control ​

2 Controllablity, bang-bang control

2.1 Definitions

Controllability question

Definition

2.2 Quick review of linear ODE

2.3 Controllability of linear equations

Definition

A simple example

Controllability

2.4 Observability

2.5 bang-bang control

2.5.1 Some functional analysis

2.5.2 Application to bang-bang control