3 Linear time-Optimal Control

3.1 Existence of Time-Optimal Control

Consider the linear system of ODE:

{\begin{cases} \dot{x} (t) = M x (t) + N α (t) \\ x (0) = x^{0}, \end{cases}

for given matrices $M \in M^{n \times n}$ and $N \in M^{n \times m}$ . We'll again take $A$ to be the cube $[- 1, 1]^{m} \subset R^{m}$ .

Define next

P [α (\cdot)] := - \int_{0}^{τ} 1 d s = - τ,

where $τ = τ (α (\cdot))$ denotes the first time the solution of our system hits the origin $0$ . (If the trajectory never hits $0$ , we set $τ = \infty$ .)

Optimal Control Problem

We are given the starting point $x^{0} \in R^{n}$ , and want to find an optimal control $α^{*} (\cdot)$ s.t.

P [α^{*} (\cdot)] = max_{α (\cdot) \in A} P [α (\cdot)] .

Then $τ^{*} = - P [α^{*} (\cdot)]$ is the minimum time to steer to the origin.

Theorem 3.1: Existence of Time-Optimal Control

Let $x^{0} \in R^{n}$ . Then there exists an optimal bang-bang control $α^{*} (\cdot)$ .

Proof: Let $τ^{*} := inf {t | x^{0} \in C (t)}$ . We want to show that $x^{0} \in C (τ^{*})$ ; that is, there exists an optimal control $α^{*} (\cdot)$ steering $x^{0}$ to $0$ at time $τ^{*}$ .

Coose $t_{1} \geq t_{2} \geq t_{3} \geq \dots$ s.t. $x^{0} \in C (t_{n})$ and $t_{n} \to τ^{*}$ . Since $x^{0} \in C (t_{n})$ , there exists a control $α_{n} (\cdot) \in A$ s.t.

x^{0} = - \int_{0}^{t_{n}} X^{- 1} (s) N α_{n} (s) d s .

If necessary, redefine $α_{n} (s)$ to be $0$ for $s \geq t_{n}$ . By Alaoglu’s Theorem, there exists a subsequence $n_{k} \to \infty$ and a control $α^{*} (\cdot)$ s.t.

α_{n} \overset{*}{⇀} α^{*} .

We assert that $α^{*} (\cdot)$ is an optimal control. It is easy to check that $α^{*} (s) = 0, s \geq τ^{*}$ . Also

x^{0} = - \int_{0}^{t_{n_{k}}} X^{- 1} (s) N α_{n_{k}} (s) d s = - \int_{0}^{t_{1}} X^{- 1} (s) N α_{n_{k}} (s) d s,

Since $α_{n_{k}} = 0$ for $s \geq t_{n_{k}}$ . let $n_{k} \to \infty$ :

x^{0} = - \int_{0}^{t_{1}} X^{- 1} (s) N α^{*} (s) d s = - \int_{0}^{τ^{*}} X^{- 1} (s) N α^{*} (s) d s

because $α^{*} (s) = 0$ for $s \geq τ^{*}$ . Hence $x^{0} \in C (τ^{*})$ , and therefore $α^{*} (\cdot)$ is optimal.

According to Theorem 2.10 there in fact exists an optimal bang-bang control.

◻

3.2 The Maximum Principle for Linear Time-Optimal Control

The really interesting practical issue now is understanding how to compute an optimal control $α^{*} (\cdot)$ .

Definition

Define $K (t, x^{0})$ to be the reachable set for time $t$ . That is:

K (t, x^{0}) = {x^{1} | there exists α (\cdot) \in A which steers from x^{0} to x^{1} at time t} .

Since $x (\cdot)$ solves ODE, we have $x^{1} \in K (t, x^{0})$ iff

x^{1} = X (t) x^{0} + X (t) \int_{0}^{t} X^{- 1} (s) N α (s) d s = x (t)

for some control $α (\cdot) \in A$ .

Theorem 3.2: Geometry of the set $K$

The set $K (t, x^{0})$ is convex and closed.

Proof:

(convexity) Let $x^{1}, x^{2} \in K (t, x^{0})$ . Then $\exists α_{1}, α_{2} \in A$ s.t.

\begin{aligned} x^{1} & = X (t) x^{0} + X (t) \int_{0}^{t} X^{- 1} (s) N α_{1} (s) d s \\ x^{2} & = X (t) x^{0} + X (t) \int_{0}^{t} X^{- 1} (s) N α_{2} (s) d s, \end{aligned}

Let $0 \leq λ \leq 1$ . Then

λ x^{1} + (1 - λ) x^{2} = X (t) x^{0} + X (t) \int_{0}^{t} X^{- 1} (s) N \underset{\in A}{\underset{⏟}{(λ α_{1} (s) + (1 - λ) α_{2} (s))}} d s,

and hence $λ x^{1} + (1 - λ) x^{2} \in K (t, x^{0})$ .

(Closedness) Assume $x^{k} \in K (t, x^{0})$ for $(k = 1, 2, \dots)$ and $x^{k} \to y$ . We must show $y \in K (t, x^{0})$ . As $x^{k} \in K (t, x^{0})$ . As $x^{k} \in K (t, x^{0}), \exists α_{k} \in A$ s.t.

x^{k} = X (t) x^{0} + X (t) \int_{0}^{t} X^{- 1} (s) N α_{k} (s) d s .

According to Alaoglu’s Theorem, there exist a subsequence $k_{j} \to \infty$ and $α \in A$ s.t. $α_{k} \overset{*}{⇀} α$ . Let $k = k_{j} \to \infty$ in the expression above, to find

y = X (t) x^{0} + X (t) \int_{0}^{t} X^{- 1} (s) N α (s) d s .

Thus $y \in K (t, x^{0})$ , an hence $K (t, x^{0})$ is closed.

◻

Notation: boundary

If $S$ is a set, we write $\partial S$ to denote the boundary of $S$ .

Recall that $τ^{*}$ denotes the minimum time it takes to steer to 0, using the optimal control $α^{*}$ . Note that then $0 \in \partial K (τ^{*}, x^{0})$ .

Theorem 3.3: Portryagin Maximum Prnciple for Linear Time-Optimal Control

there exists a nonzero vector $h$ s.t.

\begin{matrix} (M) & h^{⊤} X^{- 1} (t) N α^{*} (t) = max_{a \in A} {h^{⊤} X^{- 1} (t) N a} . \end{matrix}

for each time $0 \leq t \leq τ^{*}$ .

Interpretation: The significance of this assertion is that if we know $h$ then the maximization principle (M) provides us with a formula for computing $α^{*} (\cdot)$ , or at least extracting useful information.

We will see in the next chapter that assertion (M) is a special case of the general Pontryagin Maximum Principle.

Proof:

We know $0 \in \partial K (τ^{*}, x^{0})$ . Since $K (τ^{*}, x^{0})$ is convex, There exist a supporting plane to $K (τ^{*}, x^{0})$ at $0$ ; this means tat for some $g \neq 0$ , we have

g \cdot x_{1} \leq 0, \forall x_{1} \in K (τ^{*}, x^{0}) .

Now $x^{1} \in K (τ^{*}, x^{0})$ iff $\exists α (\cdot) \in A$ s.t.

x^{1} = X (τ^{*}) x^{0} + X (τ^{*}) \int_{0}^{τ^{*}} X^{- 1} (s) N α (s) d s .

Also

0 = X (τ^{*}) x^{0} + X (τ^{*}) \int_{0}^{τ^{*}} X^{- 1} (s) N α^{*} (s) d s .

Since $g \cdot x^{1} \leq 0$ , we deduce that

\begin{aligned} g^{⊤} (X (τ^{*}) x^{0} + X (τ^{*}) \int_{0}^{τ^{*}} X^{- 1} (s) N α (s) d s) \\ \leq & 0 = X (τ^{*}) x^{0} + X (τ^{*}) \int_{0}^{τ^{*}} X^{- 1} (s) N α^{*} (s) d s . \end{aligned}

Define $h^{⊤} = g^{⊤} X (τ^{*})$ . Then

\int_{0}^{τ^{*}} h^{⊤} X^{- 1} (s) N α (s) d s \leq \int_{0}^{τ^{*}} h^{⊤} X^{- 1} (s) N α^{*} (s) d s;

and therefore

\int_{0}^{τ^{*}} h^{⊤} X^{- 1} (s) N (α^{*} - α (s)) d s \geq 0

for all controls $α (\cdot) \in A$ .

We claim now that the foregoing implies

h^{⊤} X^{- 1} (t) N α^{*} (s) = max_{a \in A} {h^{⊤} X^{- 1} (s) N a}

for almost every time $s$ . For suppose not; then there would exists a subset $E \subset [0, τ^{*}]$ of positive measure, s.t.

h^{⊤} X^{- 1} (t) N α^{*} (s) < max_{a \in A} {h^{⊤} X^{- 1} (s) N a}

for $s \in E$ . Design a new control $\hat{α} (\cdot)$ as follows:

\hat{(s)} = {\begin{cases} α^{*} (s) & (s \notin E) \\ α (s) & (s \in E) \end{cases}

where $α (s)$ is selected s.t.

max_{a \in A} {h^{⊤} X^{- 1} (s) N a} = h^{⊤} X^{- 1} (t) N α (s) .

Then

\int_{E} \underset{< 0}{\underset{⏟}{h^{⊤} X^{- 1} (s) N (α^{*} - α (s))}} d s \geq 0.

This contradicts Step 2 above.

◻

For later reference, we pause here to rewrite the foregoing into different notation; this will turn out to be a special case of the general theory developed later in Chapter 4. First of all, define the Hamiltonian:

Definition: Hamiltonian

H (x, p, a) := (M x + N a) \cdot p (x, p \in R^{n}, a \in A) .

Theorem 3.4: Another way to write Pontryagin Maximum Principle for Linear Time-Optimal Control

Let $α^{*} (\cdot)$ be a time optimal control and $x^{*} (\cdot)$ the corresponding response.

Then there exists a function $p^{*} (\cdot) : [0, τ^{*}] \to R^{n}$ , s.t.

\begin{aligned} {\dot{x}}^{*} (t) & = \nabla_{p} H (x^{*} (t), p^{*} (t), α^{*} (t)), & (O D E) \\ {\dot{p}}^{*} (t) & = - \nabla_{x} H (x^{*} (t), p^{*} (t), α^{*} (t)), & (A D J) \\ H (x^{*} (t), p^{*} (t), α^{*} (t)) & = max_{a \in A} H (x^{*} (t), p^{*} (t), a) . & (M) \end{aligned}

We call (ADJ) the adjoint equations and (M) the maximization principle. The function $p^{*} (\cdot)$ is the costate.

Proof:

Select the vector $h$ as in Theorem 3.3, and consider the system

{\begin{cases} {\dot{p}}^{*} (t) = - M^{⊤} p^{*} (t) \\ p^{*} (0) = h . \end{cases}

The solution is $p^{*} (t) = e^{- t M^{⊤}} h$ ; and hence

p^{*} (t)^{⊤} = h^{⊤} X^{- 1} (t),

Since ${(e^{- t M^{⊤}})}^{⊤} = e^{- t M} = X^{- 1} (t)$ .

We know from condition (M) in Theorem 3.3 that

h^{⊤} X^{- 1} (t) N α^{*} (t) = max_{a \in A} {h^{⊤} X^{- 1} (t) N a} .

Since $p^{*} (t)^{⊤} = h^{⊤} X^{- 1} (t)$ , this means that

p^{*} (t)^{⊤} (M x^{*} (t) + N α^{*} (t)) = max_{a \in A} {p^{*} (t)^{⊤} (M x^{*} (t) + N a)} .

Finally, we observe that according to the definition of the Hamiltonian $H$ , the dynamical equations for $x^{*} (\cdot), p^{*} (\cdot)$ take the form (ODE) and (ADJ), as stated in the Theorem.
$◻$

3.3 Examples

Example 1: Rocket Railroad Car

We recall this example, introduced in §1.2. We have

\begin{matrix} (ODE) & \dot{x} (t) = \underset{M}{\underset{⏟}{[\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}]}} x (t) + \underset{N}{\underset{⏟}{[\begin{matrix} 0 \\ 1 \end{matrix}]}} α (t) \end{matrix}

for

x (t) = (x^{1} (t), x^{2} (t))^{⊤}, A = [- 1, 1] .

According to the Pontryagin Maximum Principle, there exists $h \neq 0$ s.t.

h^{⊤} X^{- 1} (t) N α^{*} (t) = max_{| a | \leq 1} {h^{⊤} X^{- 1} (t) N a} .

We will extract the interesting fact that an optimal control $α^{*}$ switches at most one time.

We must compute $e^{t M}$ . To do so, we observe:

M^{0} = I, M = [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}] M^{2} = [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}] [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}] = 0;

and therefore $M^{k} = 0$ for all $k \geq 2$ . Consequently,

e^{t M} = I + t M = [\begin{matrix} 1 & t \\ 0 & 1 \end{matrix}]

Then

\begin{aligned} X^{- 1} (t) & = [\begin{array}{c} 1 & - t \\ 0 & 1 \end{array}] \\ X^{- 1} (t) N & = [\begin{array}{c} 1 & - t \\ 0 & 1 \end{array}] [\begin{array}{c} 0 \\ 1 \end{array}] = [\begin{array}{c} - t \\ 1 \end{array}] \\ h^{⊤} X^{- 1} (t) N = (h_{1}, h_{2}) [\begin{array}{c} - t \\ 1 \end{array}] = - t h_{1} + h_{2} . \end{aligned}

The Maximum Principle asserts

(- t h_{1} + h_{2}) α^{*} (t) = max_{| a | \leq 1} {(- t h_{1} + h_{2}) a};

and this implies that

α^{*} (t) = sgn (- t h_{1} + h_{2})

for the sign function

sgn = {\begin{cases} 1 & x > 0 \\ 0 & x = 0 \\ - 1 & x < 0 \end{cases}

Therefore the optimal control $α^{*}$ switches at most once( $- t h_{1} + h_{2}$ is a function of $t$ , i.e. a line); and if $h_{1} = 0$ , then $α^{*}$ is constant.

Since the optimal control switches at most once, then the control we constructed by a geometric method in §1.3 must have been optimal.

◻

Example 2: Control of a Vibrating Spring

Consider next thesimple dynamics

\ddot{x} + x = α,

where we interpret the control as an exterior force acting on an oscillating weight (of unit mass) hanging from a spring. Our goal is to design an optimal exterior forcing $α^{*} (\cdot)$ that brings the motion to a stop in minimum time.

We have $n = 2, m = 1$ . The individual dynamical equations read:

{\begin{cases} {\dot{x}}^{1} (t) = x^{2} (t) \\ {\dot{x}}^{2} (t) = - x^{1} (t) + α (t); \end{cases}

which in vector notation become

\dot{x} (t) = \underset{M}{\underset{⏟}{[\begin{matrix} 0 & 1 \\ - 1 & 0 \end{matrix}]}} x (t) + \underset{N}{\underset{⏟}{[\begin{matrix} 0 \\ 1 \end{matrix}]}} α (t)

for $| α (t) | \leq 1$ . That is, $A = [- 1, 1]$ .

Using the maximum principle

We employ the Pontryagin Maximum Principle, which asserts that there exists $h \neq 0$ s.t.

\begin{matrix} (M) & h^{⊤} X^{- 1} (t) N α^{*} (t) = max_{| a | \leq 1} {h^{⊤} X^{- 1} (t) N a} . \end{matrix}

To extract useful information from (M) we must compute $X (\cdot)$ . To do so, we observe that the matrix $M$ is skew symmetric, and thus

M^{0} = I, M = [\begin{matrix} 0 & 1 \\ - 1 & 0 \end{matrix}] M^{2} = [\begin{matrix} 0 & 1 \\ - 1 & 0 \end{matrix}] [\begin{matrix} - 1 & 0 \\ 0 & - 1 \end{matrix}] = - I;

Therefore

\begin{aligned} M^{k} & = I & if k = 0, 4, 8, \dots \\ M^{k} & = M & if k = 1, 5, 9, \dots \\ M^{k} & = - I & if k = 2, 6, \dots \\ M^{k} & = - M & if k = 3, 7, \dots \end{aligned}

and consequently

\begin{aligned} e^{t M} & = I + t M + \frac{t^{2}}{2!} M^{2} + \dots \\ = I + t M - \frac{t^{2}}{2!} I - \frac{t^{3}}{3!} M + \frac{t^{4}}{4!} I + \dots \\ = (1 - \frac{t^{2}}{2!} + \frac{t^{4}}{4!} - \dots) I + (t - \frac{t^{3}}{3!} + \frac{t^{5}}{5!} - \dots) M \\ = \cos t I + \sin t M = [\begin{array}{c} \cos t & \sin t \\ - \sin t & \cos t \end{array}] . \end{aligned}

So we have

X^{- 1} (t) = [\begin{matrix} \cos t & - \sin t \\ \sin t & \cos t \end{matrix}] .

and

X^{- 1} (t) N = [\begin{matrix} \cos t & - \sin t \\ \sin t & \cos t \end{matrix}] [\begin{matrix} 0 \\ 1 \end{matrix}] = [\begin{matrix} - \sin t \\ \cos t \end{matrix}]

whence

h^{⊤} X^{- 1} (t) N = (h_{1}, h_{2}) [\begin{matrix} - \sin t \\ \cos t \end{matrix}] = - h_{1} \sin t + h_{2} \cos t .

According to condition (M), for each time $t$ we have

(- h_{1} \sin t + h_{2} \cos t) α^{*} (t) = max_{| a | \leq 1} {(- h_{1} \sin t + h_{2} \cos t) a}

Therefore

α^{*} (t) = sgn (- h_{1} \sin t + h_{2} \cos t) .

Finding the optimal control

To simplify further, we may assume $h_{1}^{2} + h_{2}^{2} = 1$ (unit vector). Recall the trig identity $\sin (x + y) = \sin x \cos y + \cos x \sin y$ , and choose $δ$ s.t. $- h_{1} = \cos δ, h_{2} = \sin δ$ . Then

α^{*} (t) = sgn (\cos δ \sin t + \sin δ \cos t) = sgn (\sin (t + δ)) .

We deduce therefore that $α^{*}$ switches from $+ 1$ to $- 1$ , and vice versa, every $π$ units of time.

Geometric interpretation

Next, we figure out the geometric consequences.

When $α \equiv 1$ , our (ODE) becomes
${\begin{cases} {\dot{x}}^{1} = x^{2} \\ {\dot{x}}^{2} = - x^{1} + 1 \end{cases}$
In this case, we can calculate that
$\begin{aligned} \frac{d}{d t} [(x^{1} (t) - 1)^{2} + (x^{2} (t))^{2}] & = 2 (x^{1} (t) - 1) {\dot{x}}^{1} (t) + 2 x^{2} (t) {\dot{x}}^{2} (t) \\ = 2 (x^{1} (t) - 1) x^{2} (t) + 2 x^{2} (t) (- x^{1} (t) + 1) = 0. \end{aligned}$
Consequently, the motion satisfies $(x^{1} (t) - 1)^{2} + (x^{2} (t))^{2} \equiv r_{1}^{2}$ , for some radius $r_{1}$ , and therefore the trajectory lies on a circle with center $(1, 0)$ , as illustrated.
If $α \equiv - 1$ , our (ODE) instead becomes
${\begin{cases} {\dot{x}}^{1} = x^{2} \\ {\dot{x}}^{2} = - x^{1} - 1 \end{cases}$
In which case
$\frac{d}{d t} [(x^{1} (t) + 1)^{2} + (x^{2} (t))^{2}] = 0.$
Thus $(x^{1} (t) + 1)^{2} + (x^{2} (t))^{2} \equiv r_{2}^{2}$ , for some radius $r_{2}$ , and motion lies on a circle with center $(- 1, 0)$ , as illustrated.

In summary, to get to the origin we must switch our control $α (\cdot)$ back and forth between the values $\pm 1$ , causing the trajectory to switch between lying on circles centered at $(\pm 1, 0)$ . The switches occur each π units of time.

3 Linear time-Optimal Control ​

3.1 Existence of Time-Optimal Control ​

3.2 The Maximum Principle for Linear Time-Optimal Control ​

3.3 Examples ​

Example 1: Rocket Railroad Car ​

Example 2: Control of a Vibrating Spring ​

Using the maximum principle ​

Finding the optimal control ​

Geometric interpretation ​

3 Linear time-Optimal Control

3.1 Existence of Time-Optimal Control

3.2 The Maximum Principle for Linear Time-Optimal Control

3.3 Examples

Example 1: Rocket Railroad Car

Example 2: Control of a Vibrating Spring

Using the maximum principle

Finding the optimal control

Geometric interpretation