Formation Control

1 Introduction [1]

Consider a system of $n$ mobile agents where $q_{i} \in R^{m}$ is the position of the $i^{th}$ agent relative to an Earth-fixed coordinate frame, and $u_{i} \in R^{m}$ is the corresponding control input. In subsequent parts, $u_{i}$ will be a velocity-, acceleration-, or actuator-level input depending on the mathematical model used to describe the agent motion.

Let the desired formation for the agents be represented by an infinitesimally and minimally rigid framework $F^{*} = (G^{*}, q^{*})$ where $G^{*} = (V^{*}, E^{*})$ is the formation graph, $\dim (V^{*}) = n, \dim (E^{*}) = l$ , and $q^{*} = [q_{1}^{*}, \dots, q_{n}^{*}]$ . The constant desired distance between agents $i$ and $j$ is given by

\begin{matrix} (1.1) & d_{i j} = ∥ q_{i}^{*} - q_{j}^{*} ∥ > 0, i, j \in V^{*} . \end{matrix}

In practice, the geometric shape/structure of the desired formation is dictated by the mission to be accomplished by the agents. When translating the desired shape into a framework, one needs to include enough edges to ensure that $F^{*}$ is indeed infinitesimally and minimally rigid.

The actual formation of the agents is represented by the framework $F (t) = (G_{s}, q (t))$ where $G_{s}$ represents the sensor graph and $q = [q_{1}, \dots, q_{n}]$ . It is important to clarify the difference between the formation graph $G^{*}$ and the sensor graph $G_{s}$ , which in general need not be the same.

$G^{*}$ indicates the minimum number of inter-agent distances that need to be controlled for the desired formation to be successfully reached.
$G_{s}$ indicates the agent pairs that can sense and/or communicate with each other.

We make the following assumptions regarding the desired and actual formations:

Assumption 1

The set where the agents achieve the desired formation is nonempty, i.e., there exist $q^{*}$ such that $r_{G} (q^{*}) = d$ where $d = [\dots, d_{i j}^{2}, \dots] \in R^{l}$ .

Assumption 2

The formation and sensor graphs are the same, i.e., $G_{s} = G^{*}$ . Furthermore, inter-agent connectivity is always maintained in the sense that agent $i$ is always within the sensing/communication range of agent $j$ , $\forall j \in N_{i} (E^{*})$ . In other words, $G^{*}$ is fixed.

Connectivity maintenance prevents the occurrence of flex ambiguities since temporary loss of edges cannot happen.

Assumption 3

At $t = 0$ , the agents do not satisfy the desired inter-agent distance constraints, i.e., $∥ q_{i} (0) - q_{j} (0) ∥ =\neq d_{i j}, i, j \in V^{*}$ .

Assumption 4

The only position information being measured is the relative position of agent pairs in $E^{*}, q_{i} - q_{j}, (i, j) \in E^{*}$ ^[1]. That is, the global position of the agents, $q_{i}, i = 1, \dots, n$ , are not available to the control.

We will deal with three types of control problems:

Formation Acquisition
Formation Maneuvering
Target Interception

Problem 1: Formation Acquisition

The goal is for the agents to acquire and maintain a pre-defined geometric shape in space. The control objective for formation acquisition, which serves as the common, primary objective for the other two problems, can be mathematically described as to design $u_{i}$ such that

\begin{matrix} (1.2) & F (t) \to Iso (F^{*}) as t \to \infty . \end{matrix}

which is equivalent to

\begin{matrix} (1.3) & ∥ q_{i} (t) - q_{j} (t) ∥ \to d_{i j} as t \to \infty, i, j \in V^{*} . \end{matrix}

Since only the inter-agent distances are to be directly controlled, the actual formation can converge to any isometry of $F^{*}$ . That is, the meaning is that the formation will converge to one framework in the set $Iso (F^{*})$ with the specific one being determined by the initial position of the agents, $q_{i} (0), i = 1, \dots, n$ .

Problem 2: Formation Maneuvering

The agents are required to simultaneously acquire a formation (i.e., satisfy $(1.2)$ ) and maneuver cohesively according to some pre-defined trajectory. Thus, the secondary objective is

\begin{matrix} (1.4) & {\dot{q}}_{i} (t) - v_{d i} (t) \to 0 as t \to \infty, i = 1, \dots, n . \end{matrix}

where $v_{d i} \in R^{3}$ represents the desired rigid body velocity for the swarm of agents. That is, the fixed-shape, desired formation evolves in space as a virtual rigid body undergoing translation and/or rotation.

In practice, the selection of $v_{d i}$ is mission-dependent. For example, it could be related to a path planning algorithm that provides an optimal solution to the coverage problem where agents cooperatively maximize the coverage area of a given mission under certain time and/or fuel consumption constraints.

When $v_{d i}$ only includes a translation velocity, the formation maneuvering problem is also called flocking. For the case where $v_{d i}$ has a rotational component, we assign the $n^{th}$ agent (without lost of generality) to be the “leader” while the remaining agents are “followers”. This assignment is for the sole purpose of one agent serving as a reference point for the axis of rotation of the virtual rigid body. Therefore, $F^{*}$ should be constructed with the following additional conditions:

$q_{n}^{*} \in conv {q_{1}^{*}, \dots, q_{n - 1}^{*}}$ .
$(i, n) \in E^{*}, i = 1, \dots, n - 1$ , i.e., there is an edge between each follower and the leader.

1-example — Figure 1.1: Example of the construction of $F^∗$: a tetrahedron formation where L stands for leader and F for follower.

An example of $F^{*}$ is illustrated by the 3D formation in Figure 1.1 where the leader is located in the interior of the tetrahedron. The axis of rotation passes through the leader, which is inside the tetrahedron. Since $n = 5$ , we need $3 n - 6 = 9$ for the framework to be minimally rigid. The solid lines indicate edges that form the faces of the tetrahedron while the dashed lines are edges in its interior. Notice that edge $(1, 4)$ is not necessary.

The association of a leader agent (instead of a virtual leader) with the axis of rotation is done for convenience (not necessity) since the leader’s relative position to the followers can be measured and it will not have to undergo any rotation. Note that if one uses a virtual leader, its location would have to be known in order to calculate its position relative to the agents (see $(3)$ in Distance). This in turn would require extra measurements and/or calculations.

Problem 3: Target Interception

The agents should intercept and surround a (possibly evading) moving target with a pre-defined formation. Here, we will also use the leader–follower approach by taking the $n^{th}$ agent to be the leader while the remaining agents are followers. The control protocol will consist of:

Selecting $F^{*}$ such that $q_{n}^{*} \in conv {q_{1}^{*}, \dots, q_{n - 1}^{*}}$ (Unlike formation maneuvering with rotation, we do not need the second condition for target interception)
The leader chasing the target
The followers tracking the leader while maintaining the desired formation. Thus, if $q_{T} \in R^{m}$ denotes the target position, the secondary objective for this problem is that $q_{T} (t)$ approach $conv {q_{1} (t), q_{2} (t), \dots, q_{n - 1} (t)}$ as time evolves, which (with abuse of notation) we express as

\begin{matrix} (1.5) & q_{T} (t) \in conv {q_{1} (t), q_{2} (t), \dots, q_{n - 1} (t)} as t \to \infty . \end{matrix}

Before beginning with the control design, some theorem and corollary statements will be made without proof.

Theorem 1.1 (Originally from [1] of Theorem C. 1)

Consider the SISO LTI system

\begin{aligned} \dot{x} = A x + B u \\ y = C x \end{aligned}

where $A \in R^{n \times n}$ is a Hurwitz matrix. Then, the following results hold:

If $u (t) \in L_{2}$ , then $y (t) \in L_{2} \cap L_{\infty}, y (t) \in L_{2}, y (t)$ is continuous, and $y (t) \to 0$ as $t \to \infty$ .
If $u (t) \in L_{\infty}$ , then $y (t) \in L_{\infty}, y (t) \in L_{\infty}$ , and $y (t)$ is uniformly continuous. If, in addition, $u (t) \to 0$ as $t \to \infty$ , then $y (t) \to 0$ as $t \to \infty$ .

Theorem 1.2 (Originally from [1] of Theorem C.2)

Let $V : D \times R_{⩾ 0} \to R$ be a continuously differentiable function such that

\begin{aligned} U_{1} (x) ⩽ V (x, t) ⩽ U_{2} (x) \\ \dot{V} = \frac{\partial V}{\partial t} + \frac{\partial V}{\partial x} f (x, t) ⩽ - U_{3} (x) \end{aligned}

for all $t ⩾ 0$ and for all $x \in D$ , where $U_{i} (x), i = 1, 2, 3$ are continuous positive definite functions on $D$ . Then, $x_{e} = 0$ is uniformly asymptotically stable.

Corollary 1.1 (Originally from [1] of Corollary C.1)

If $U_{i} (x) = c_{i} ∥ x ∥^{p}, i = 1, 2, 3$ where $c_{i}, p > 0$ in Theorem 1.1, then $x_{e} = 0$ is exponentially stable.

Input-to-state stability bridges the gap between the notions of Lyapunov stability and input–output stability by quantifying the effects of both initial conditions and external (control or disturbance) inputs on the system state.

Definition: Input-to-State Stability

A dynamical system $\dot{x} = f (x, u), x (0) = x_{0}$ with $f : R^{n} \times R^{m} \to R^{n}$ is said to be input-to-state stable if there exist a class $KL$ function $β$ and a class $K$ function $α$ such that, for any $x_{0}$ and any $u (t) \in L_{\infty}$ , the solution $x (t)$ exists for all $t ⩾ 0$ and satisfies

∥ x (t) ∥ ⩽ β (∥ x_{0} ∥, t) + α (sup_{0 ⩽ τ ⩽ t} ∥ u (τ) ∥) .

The above inequality has several implications.

For any bounded input, the state is bounded.
As $t \to \infty$ , the state is ultimately bounded by function $α$ .
If $u (t) \to 0$ as $t \to \infty$ , so does $x (t)$ .

Theorem 1.3 (A corollary to Barbalat's Lemma, originally from [1] of Theorem C.3)

Consider the function $F : R_{⩾ 0} \to R$ . If $f (t) \in L_{\infty}, \dot{f} (t) \in L_{\infty}$ , and $f (t) \in L_{2}$ , then

f (t) \to 0 as t \to \infty .

Theorem 1.4 (Originally from [1] of Theorem C.4)

Consider that $f (x, u)$ in $\dot{x = f (x, u)}, x (0) = x_{0}$ is locally Lipschitz in $(x, u)$ in some neighborhood of $(x = 0, u = 0)$ . Then, the system is locally input-to-state stable if and only if the unforced system $\dot{x} = f (x, 0)$ has a locally asymptotically stable equilibrium point at the origin.

Theorem 1.5 (Originally from [1] of Theorem C.5)

Consider the interconnected system

\begin{matrix} (1.6) & \begin{aligned} Σ_{1} : \dot{x} = f (x, y) \\ Σ_{2} : \dot{y} = g (y) . \end{aligned} \end{matrix}

If subsystem $Σ_{1}$ with input $y$ is locally input-to-state stable and $y = 0$ is a locally asymptotically stable equilibrium point of subsystem $Σ_{2}$ , then $[x, y] = 0$ is a locally asymptotically stable equilibrium point of the interconnected system.

Theorem 1.6 (Originally from [1] of Theorem C. 6)

If $0 \in K [f] (0, t)$ in a region $Q \supset B (0, δ) \times [t_{0}, \infty)$ and $V$ : $D \times R_{⩾ 0} \to R$ is a regular function satisfying $V (0, t) = 0$ ,

α_{1} (∥ x ∥) ⩽ V (x, t) ⩽ α_{2} (∥ x ∥) \forall x \neq 0

and

\dot{V} \overset{a.e.}{\in} \underset{ξ \in \partial V (x, t)}{\cap} ξ^{⊤} [\begin{matrix} K [f] (x, t) \\ 1 \end{matrix}] ⩽ - α_{3} (∥ x ∥)

in $Q$ where $α_{i} (\cdot), i = 1, 2, 3$ are class $K$ functions, $K [f] (x, t)$ is an upper semi-continuous, nonempty, compact, convex-valued map on $D$ defined as

K [f] (x, t) := \underset{δ > 0}{\cap} \underset{μ N = 0}{\cap} \overset{―}{co} f (B (x, δ) ∖ N, t),

where $\underset{μ N = 0}{\cap}$ denotes the intersection over all sets $N$ of Lebesgue measure zero, $\overset{―}{co}$ is the convex closure, and $B$ was defined as

B (\bar{x}, r) = {x \in R^{n} : ∥ x - \bar{x} ∥ < r}

represents the "ball" of radius $r$ centered at $\bar{x}$ .

Then $x = 0$ is a uniformly asymptotically stable equilibrium point of system $\dot{x} = f (x, t), x (t_{0}) = x_{0}$ where $f : D \times R_{⩾ 0} \to R^{n}$ is discontinuous in $x$ and piecewise continuous in $t$ on $D \times R_{⩾ 0}$ .

2 Single-Integrator Model [1]

This section will set the foundation for the formation control designs. We use here a very simple model for the motion of the agents known as the single-integrator model, which only includes two variables: position and velocity. This is a simplified kinematic model for omnidirectional robots (e.g., mobile robots with Swedish wheels). Specifically, we consider a system of $n$ agents governed by the first-order differential equation

\begin{matrix} (2.1) & {\dot{q}}_{i} = u_{i}, i = 1, \dots, n . \end{matrix}

where $q_{i} \in R^{m}$ is the position and $u_{i} \in R^{m}$ is the velocity-level control input of the $i^{th}$ agent with respect to an Earth-fixed coordinate frame. The name “single integrator” originates from the fact that the transfer function matrix of $(2.1)$ is

\begin{matrix} (2.2) & G_{i} (s) = \frac{1}{s} I_{m} \end{matrix}

where $s$ is the Laplace variable, i.e., the inputs and outputs are separated by one integrator.

Formation controllers based on $(2.1)$ are called high-level control laws because they are often embedded in controllers designed for more refined agent models. Therefore, the control laws introduced in this section will form the basis for all subsequent designs.

2.1 Formation Acquisition

We begin with the formation acquisition problem defined in Section 1. Given $(2.1)$ , we seek to design $u_{i} = u_{i} (q_{i} - q_{j}, d_{i j}), i = 1, \dots, n$ and $j \in N_{i} (E^{*})$ , where $N_{i} (\cdot)$ was defined in Preliminary of Graph Theory to achieve the control objective described by $(1.2)$ (or equivalently $(1.3)$ ).

It is appropriate at this point to elaborate on an issue mentioned at the end of Section of framework ambiguities. The inputs $u_{i}, i = 1, \dots, n$ will directly control the distances $∥ q_{i} - q_{j} ∥, (i, j) \in E^{*}$ . Therefore, they can only directly ensure that

\begin{matrix} (2.3) & ∥ q_{i} (t) - q_{j} (t) ∥ \to d_{i j} as t \to \infty, (i, j) \in E^{*}, \end{matrix}

which is equivalent to

\begin{matrix} (2.4) & r_{G} (q (t)) \to r_{G} (q^{*}) = d as t \to \infty . \end{matrix}

Note that $(2.3)$ is different than $(1.3)$ since it is only defined for $(i, j) \in E^{*}$ while $(1.3)$ is defined for all $i, j \in V^{*}$ . This is potentially problematic since (with abuse of notation) $r_{G} (Iso (F^{*})) = r_{G} (Amb (F^{*}))$ . Therefore, the control scheme will need to avoid the possibility that $F (t) \to Amb (F^{*})$ as $t \to \infty$ . This will be accomplished by initializing the agents sufficiently close to $Iso (F^{*})$ in the sense that $dist (q (0), Iso (F^{*})) < dist (q (0), Amb (F^{*}))$ .

To simplify the notation in the following derivations, we define the relative position of two agents as

\begin{matrix} (2.5) & {\tilde{q}}_{i j} = q_{i} - q_{j} . \end{matrix}

and let $\tilde{q} = [\dots, {\tilde{q}}_{i j}, \dots] \in R^{m l}, (i, j) \in E^{*}$ with the same ordering of terms as the edge function $r_{G} (\cdot)$ . The distance error is given by

\begin{matrix} (2.6) & e_{i j} = ∥ {\tilde{q}}_{i j} ∥ - d_{i j} . \end{matrix}

Note that $(1.3)$ is equivalent to $e_{i j} (t) \to 0$ as $t \to \infty, i, j \in V^{*}$ . The distance error dynamics can be derived from $(2.6)$ and $(2.1)$ as

\begin{aligned} {\dot{e}}_{i j} & = \frac{d}{d t} (\sqrt{{\tilde{q}}_{i j}^{⊤} {\tilde{q}}_{i j}}) \\ = ({\tilde{q}}_{i j}^{⊤} {\tilde{q}}_{i j})^{- \frac{1}{2}} {\tilde{q}}_{i j}^{⊤} (u_{i} - u_{j}) \\ (2.7) & = \frac{{\tilde{q}}_{i j}^{⊤} (u_{i} - u_{j})}{e_{i j} + d_{i j}} . \end{aligned}

Let

\begin{matrix} (2.8) & z_{i j} = ∥ {\tilde{q}}_{i j} ∥^{2} - d_{i j}^{2}, \end{matrix}

which can be rewritten as

\begin{matrix} (2.9) & z_{i j} = e_{i j} (e_{i j} + 2 d_{i j}) \end{matrix}

using $(2.6)$ . Given that $∥ {\tilde{q}}_{i j} ∥ ⩾ 0$ (or equivalently, $e_{i j} ⩾ - d_{i j}$ ), it is not difficult to see that $z_{i j} = 0$ if and only if $e_{i j} = 0$ . We now introduce the following Lyapunov function candidate

\begin{matrix} (2.10) & W (e) = \frac{1}{4} \sum_{(i, j) \in E^{*}} z_{i j}^{2} = \frac{1}{4} z^{⊤} z \end{matrix}

where $e = [\dots, e_{i j}, \dots] \in R^{l}$ and $z = [\dots, z_{i j}, \dots] \in R^{l}, (i, j) \in E^{*}$ are ordered as $r_{G}$ . This function is positive definite in $e$ and its level surfaces, $W (e) = c$ for some $c > 0$ , are closed since $e_{i j} ⩾ - d_{i j}$ . The time derivative of $(2.10)$ along $(2.7)$ is given by

\begin{matrix} (2.11) & \dot{W} = \sum_{(i, j) \in E^{*}} e_{i j} (e_{i j} + 2 d_{i j}) {\tilde{q}}_{i j}^{⊤} (u_{i} - u_{j}) . \end{matrix}

Using definition of rigidity matrix $R_{D}$ i.e., $(6)$ in Infinitesimal Rigidity, and $(2.9)$ , $(2.11)$ can be conveniently written as^[2]

\begin{matrix} (2.12) & \dot{W} = z^{⊤} R_{D} (\tilde{q}) u \end{matrix}

where $u = [u_{1}, \dots, u_{n}] \in R^{m n}$ is the stacked vector of control inputs. Before presenting the main result, we introduce a lemma that establishes the relationship between Corollary of Theorem 2 in Graph Rigidity and the level surfaces of the Lyapunov function candidate.

Lemma 2.1

For nonnegative constants $c$ and $δ$ , the level set $W (e) ⩽ c$ is equivalent to $Ψ (F, F^{*}) ⩽ δ$ where $Ψ$ and $W$ were defined in Corollary of Theorem 2 in Graph Rigidity and $(2.10)$ , respectively.

Proof:

Details of Proof

First, from the definition of $Ψ (\cdot, \cdot)$ in Corollary of Theorem 2 in Graph Rigidity, $(1.1)$ , $(2.5)$ , $(2.6)$ , we have that

\begin{aligned} Ψ (F, F^{*}) & = \sum_{(i, j) \in E^{*}} (∥ q_{i} - q_{j} ∥ - ∥ q_{i}^{*} - q_{j}^{*} ∥)^{2} \\ = \sum_{(i, j) \in E^{*}} (∥ q_{i} - q_{j} ∥ - d_{i j})^{2} \\ (2.13) & = \sum_{(i, j) \in E^{*}} e_{i j}^{2} \end{aligned}

From $(2.10)$ , we know $W (e) ⩽ c$ implies that $e_{i j}, (i, j) \in E^{*}$ is bounded. This boundedness along with $(2.13)$ implies $Ψ (F, F^{*}) ⩽ δ$ where $δ$ is some nonnegative constant. Now, given $Ψ (F, F^{*}) ⩽ δ$ , it follows from $(2.13)$ that $e_{i j}$ is bounded for $(i, j) \in E^{*}$ . This implies $z_{i j}, (i, j) \in E^{*}$ is bounded, and $W (e) ⩽ c$ where $c$ is some nonnegative constant. Q.E.D.

◻

The control law for solving the formation acquisition problem is given in the following theorem. Its structure is based on $(2.12)$ and Lyapunov stability theory. Specifically, the goal is to make the time derivative of the Lyapunov function candidate negative definite.

Theorem 2.1

Consider the formation $F (t) = (G^{*}, q (t))$ , and let the initial conditions of the error dynamics be such that $e (0) \in Ω_{1} \cap Ω_{2}$ where

\begin{aligned} Ω_{1} = {e \in R^{l} ∣ Ψ (F, F^{*}) ⩽ δ}, \\ (2.14) & Ω_{2} = {e \in R^{l} ∣ dist (q, Iso (F^{*})) < dist (q, Amb (F^{*}))}, \end{aligned}

and $δ$ is a sufficiently small positive constant. The control law^[3]

\begin{matrix} (2.15) & u = u_{a} := - k_{v} R_{D}^{⊤} (\tilde{q}) z, \end{matrix}

where $k_{v} > 0$ is a user-defined control gain, renders $e = 0$ exponentially stable and ensures $(1.2)$ is satisfied.

Proof:

Details of Proof

Given that $F^{*}$ and $F (t)$ have the same number of edges and that $F^{*}$ is minimally rigid by design, then $F (t)$ is minimally rigid for all $t ⩾ 0$ . Substituting $(2.15)$ into $(2.12)$ yields

\begin{matrix} (2.16) & \dot{W} = - k_{v} z^{⊤} R_{D} (\tilde{q}) R_{D}^{⊤} (\tilde{q}) z . \end{matrix}

Since $F^{*}$ is infinitesimally rigid, we know from Corollary of Theorem 2 in Graph Rigidity that $F (t)$ is infinitesimally rigid for $e (t) \in Ω_{1}$ . Therefore, we know $F (t)$ is infinitesimally and minimally rigid for $e (t) \in Ω_{1}$ , so we can invoke Corollary of Theorem 3 in Minimal Rigidity to state

\begin{matrix} (2.17) & \dot{W} ⩽ - k λ_{min} (R_{D} R_{D}^{⊤}) z^{⊤} z = - 4 k λ_{min} (R_{D} R_{D}^{⊤}) W for e (t) \in Ω_{1} \end{matrix}

where $(2.10)$ was used. From $(2.17)$ , we know that $\dot{W} (t) ⩽ 0$ for all $t ⩾ 0$ ; hence, $W (t)$ is nonincreasing for all $t ⩾ 0$ . Then, since $e (t) \in Ω_{1}$ is equivalent to $e (t) \in {e \in R^{3 n} ∣ W (e) ⩽ c}$ from Lemma 2.1, a sufficient condition for $(2.17)$ is given by

\begin{matrix} (2.18) & \dot{W} ⩽ - 4 k λ_{min} (R_{D} R_{D}^{⊤}) W for e (0) \in Ω_{1} . \end{matrix}

From the form of $(2.18)$ and the fact that $W$ is positive definite in $e$ , we can invoke Corollary 1.1 to conclude that $e = 0$ is exponentially stable for $e (0) \in Ω_{1}$ . Given that $e$ is only defined for $(i, j) \in E^{*}$ , the exponential stability of $e = 0$ implies that $F (t) \to Iso (F^{*})$ or $F (t) \to Amb (F^{*})$ as $t \to \infty$ . If we choose $e (0) \in Ω_{1} \cap Ω_{2}$ , we have from $(2.14)$ that

\begin{matrix} (2.19) & dist (q (0), Iso (F^{*} (0))) < dist (q (0), Amb (F^{*} (0))) . \end{matrix}

Due to $(2.19)$ , the energy-like function $W (t)$ would need to increase for a period of time for $F (t) \to Amb (F^{*})$ as $t \to \infty$ , which is a contradiction since $(2.18)$ establishes that $W (t)$ is nonincreasing for all $t ⩾ 0$ . Therefore, we know $F (t) \to Iso (F^{*})$ as $t \to \infty$ for $e (0) \in Ω_{1} \cap Ω_{2}$ . This argument is conceptually illustrated by Figure 2.1, where the ball, representing $F (t)$ , would have to overcome the energy barrier to reach $Amb (F^{*})$ . Q.E.D.

◻

2-energy — Figure 2.1: Energy landscape showing the two equilibrium points, Iso(\mathcal{F}^{*}) and Amb(\mathcal{F}^{*})$, at the bottom of each well.

The initial condition $e (0) \in Ω_{1} \cap Ω_{2}$ in Theorem 2.1 is a sufficient condition for the actual formation $F (t)$ to

Remain infinitesimally rigid for all time and
Be closer to a framework in $Iso (F^{*})$ at $t = 0$ than to one in $Amb (F^{*})$ in order to avoid converging to an ambiguous framework.

The former constraint is satisfied by $e (0) \in Ω_{1}$ while the latter is satisfied by $e (0) \in Ω_{2}$ . The set $Ω_{1} \cap Ω_{2}$ exists because it is always possible to select $F (0)$ sufficiently close to a framework in $Iso (F^{*})$ .

The control $(2.15)$ can be expressed element-wise as

\begin{matrix} (2.20) & u_{i} = - k_{v} \sum_{j \in N_{i} (E^{*})} {\tilde{q}}_{i j} z_{i j}, i = 1, \dots n, \end{matrix}

which is only a function of ${\tilde{q}}_{i j}$ and $d_{i j}$ for $(i, j) \in E^{*}$ . Thus, the control law is decentralized since it only requires the $i^{th}$ agent to measure its relative position to neighboring agents.

Notice that each individual term of the summation in $(2.20)$ is a vector whose direction is along ${\tilde{q}}_{i j}$ . If all $n$ agents are positioned collinearly at $t = 0$ , the control input of each one will necessarily be directed along the line. As a result, the agents will be stuck in a collinear formation and will never converge to the desired formation. In other words, the collinear formation is an invariant set. However, if at least one agent is not initially collinear with the others, the agents will not necessarily remain collinear because the edges between these agents and the noncollinear ones will create control components whose directions are not parallel to the line.

The stability result of Theorem 2.1 guarantees that the desired formation is acquired up to rotation and translation. In other words, the formation acquisition controller does not regulate the formation to a pre-defined global location in space. This is a reflection of the facts that $u_{i}$ is not a function of $q_{i}$ but only of the relative positions ${\tilde{q}}_{i j}, (i, j) \in E^{*}$ and that the control objective is to regulate $‖ {\tilde{q}}_{i j} ‖$ .

Since we are only concerned with the inter-agent distances, any coordinate frame can be used to implement $u_{i}$ . That is, although the above analysis was done with the variables defined with respect to a common, fixed coordinate frame for convenience, $(2.20)$ can be implemented in practice with respect to the $i^{th}$ agent's own local coordinate frame. This means that the agents do not need to have a common sense of orientation and $(2.20)$ is rotationally invariant. To see this, let $F_{0}$ and $F_{i}$ denote the Earth-fixed coordinate frame and the local coordinate frame of the $i^{th}$ agent, respectively (see Figure 2.2). If $R_{i}^{0} \in R^{m}$ denotes the rotation matrix representing the orientation of $F_{i}$ with respect to $F_{0}$ , we have that

\begin{aligned} {\tilde{q}}_{i j} := {\tilde{q}}_{i j}^{0} = R_{i}^{0} {\tilde{q}}_{i j}^{i} \\ u_{i} := u_{i}^{0} = R_{i}^{0} u_{i}^{i} \end{aligned}

where the superscript denotes the coordinate frame in which the vector is specified. From $(2.20)$ , we can then write

\begin{aligned} u_{i}^{i} & = - k_{v} \sum_{j \in N_{i} (E^{*})} {(R_{i}^{0})}^{T} {\tilde{q}}_{i j} z_{i j} \\ = - k_{v} \sum_{j \in N_{i} (E^{*})} {\tilde{q}}_{i j}^{i} z_{i j} \end{aligned}

since $z_{i j}$ is independent of the coordinate frame.

2-coordinate — Figure 2.2: Fixed and local coordinate frames.

Finally, the control $(2.7)$ is in fact the standard gradient descent law that often appears in the literature. If we rewrite $z$ as

\begin{matrix} (2.21) & z = r_{G} (q) - r_{G} (q^{*}) \end{matrix}

where $r_{G}$ and $(2.8)$ were used, it follows from $(2.10)$ that

\begin{matrix} (2.22) & W = \frac{1}{4} {‖ r_{G} (q) - r_{G} (q^{*}) ‖}^{2} . \end{matrix}

The derivative of $(2.22)$ with respect to $q$ is given by

\frac{\partial W}{\partial q} = \frac{1}{2} {(r_{G} (q) - r_{G} (q^{*}))}^{⊤} \frac{\partial r_{G} (q)}{\partial q} = {(r_{G} (q) - r_{G} (q^{*}))}^{⊤} R_{D} (\tilde{q})

where $R_{D} (p)$ was used. Therefore,

u = - \nabla_{q} W = - {(\frac{\partial W}{\partial q})}^{⊤} = - R_{D}^{⊤} (\tilde{q}) z,

which is the same as $(2.7)$ without the control gain. That is, since $(2.22)$ (also called a potential function) has a minimum when $r_{G} (q) = r_{G} (q^{*})$ , it is well known from optimization theory that the negative gradient causes the system trajectory to approach the local minimum.

2.2 Formation Maneuvering

In this section, we solve the formation maneuvering problem defined in Section 1.4 using model $(2.1)$ . Since formation acquisition is embedded in the formation maneuvering problem, we use $(2.12)$ as the starting point. The control law here will take the form $u_{i} = u_{i} ({\tilde{q}}_{i j}, d_{i j}, v_{d i}), i = 1, \dots, n$ and $j \in N_{i} (E^{*})$ where $v_{d i} (t)$ , which was defined in $(1.4)$ , is a bounded continuous function.

Theorem 2.2

Consider the formation $F (t) = (G^{*}, q (t))$ with the initial conditions on $e (0)$ given in Theorem 2.1. Then, the control

\begin{matrix} (2.23) & u = u_{a} + v_{d}, \end{matrix}

where $u_{a}$ was defined in $(2.15)$ , $v_{d} = [v_{d 1}, \dots, v_{d n}] \in R^{3 n}$ is the desired rigid body velocity specified by^[4]

\begin{matrix} (2.24) & v_{d i} = v_{0} + ω_{0} \times {\tilde{q}}_{i n}, i = 1, \dots, n \end{matrix}

$v_{0} (t) \in R^{3}$ denotes the desired translation velocity for the formation, $ω_{0} (t) \in R^{3}$ is the desired angular velocity, renders $e = 0$ exponentially stable and ensures that $(1.2)$ and $(1.4)$ are satisfied.

Proof:

Details of Proof

Substituting $(2.23)$ into $(2.12)$ yields

\begin{matrix} (2.25) & \dot{W} = - k_{v} z^{⊤} R_{D} (\tilde{q}) R_{D}^{⊤} (\tilde{q}) z + z^{⊤} R_{D} (\tilde{q}) v_{d} . \end{matrix}

TODO It follows from (1.20) and $(2.24)$ that

\begin{matrix} (2.26) & R_{D} (\tilde{q}) v_{d} = 0 . \end{matrix}

Therefore, the proof of Theorem 2.1 can be directly followed to show that $e = 0$ is exponentially stable for $e (0) \in Ω_{1} \cap Ω_{2}$ and $(1.2)$ holds.

From $(2.9)$ it is clear that $z \to 0$ as $e \to 0$ . The exponential stability of $e = 0$ implies that $\tilde{q}$ is bounded from $(2.6)$ . Therefore, $R_{D} (\tilde{q})$ is bounded and we know from $(2.23)$ and $(2.15)$ that

\begin{matrix} (2.27) & u \to v_{d} as e \to 0 \end{matrix}

Since we proved that $e (t) \to 0$ as $t \to \infty$ , it follows from $(2.27)$ and $(2.1)$ that $(1.4)$ holds. Q.E.D.

◻

The control $(2.23)$ has two independent components: the term $u_{a}$ is responsible for formation acquisition while $v_{d}$ is responsible for rigid body maneuvers of the whole formation. We can see from $(2.26)$ that the control exploits the special structure of the rigidity matrix to disassociate the formation acquisition stability analysis from the formation maneuvering analysis.

Another interesting point is that, despite being based on the single-integrator model, $(2.24)$ is generally not open-loop in nature since it depends on feedback of ${\tilde{q}}_{i n}$ . That is, $(2.24)$ has an open-loop form only when the maneuver is purely translational.

The control law can be written element-wise as

u_{i} = - k_{v} \sum_{j \in N_{i} (E^{*})} {\tilde{q}}_{i j} z_{i j} + v_{0} + ω_{0} \times {\tilde{q}}_{i n}, i = 1, \dots n,

which shows that it is decentralized. Note that in many applications the signals $v_{0}$ and $ω_{0}$ are known a priori and therefore can be stored on each agent's onboard computer. Also, since ${\tilde{q}}_{n n} = 0$ , the formation maneuvering term of the leader only has the translation component $v_{0}$ . This is expected since the leader by design lies on the axis of rotation of the virtual rigid body.

2.3 Flocking

Here, we consider the special case of formation maneuvering where the desired velocity only includes the translation component. Recall from Section 1 that this is commonly referred to as flocking. Unlike last Section, we consider that the desired flocking velocity is only available to a subset of agents. We will overcome this constraint by employing a distributed observer that estimates this velocity by exploiting the connectedness of the formation graph.

Constant Flocking Velocity

We first consider the case where the flocking velocity is constant. Let $v_{0} \in R^{m}$ be the constant flocking velocity and $V_{0} \subset V^{*}$ be the nonempty subset of agents that have direct access to $v_{0}$ . To solve this flocking problem, we use the continuous controller-observer scheme

\begin{aligned} (2.28a) & u = u_{a} + \hat{v} \\ (2.28b) & {\dot{\hat{v}}}_{i} = - α \sum_{j \in N_{i} (E^{*})} ({\hat{v}}_{i} - {\hat{v}}_{j}) - α b_{i} ({\hat{v}}_{i} - v_{0}), i = 1, \dots n \end{aligned}

where

\begin{matrix} (2.29) & b_{i} = {\begin{cases} 1, & if i \in V_{0} \\ 0, & otherwise \end{cases} \end{matrix}

$u_{a}$ was defined in $(2.15)$ , $\hat{v} = [{\hat{v}}_{1}, \dots, {\hat{v}}_{n}] \in R^{m n}$ contains the velocity estimates for each agent, and $α > 0$ is a user-defined observer gain.

Theorem 2.3

Consider the formation $F (t) = (G^{*}, q (t))$ with the initial conditions in Theorem 2.1. Then, the controller-observer scheme $(2.28)$ with any $\hat{v} (0)$ renders $e = 0$ asymptotically stable and ensures that $(1.2)$ and $(1.4)$ are satisfied with $v_{d i} = v_{0}, i = 1, \dots, n$ .

Proof:

Details of Proof

Let

\begin{matrix} (2.30) & {\tilde{v}}_{i} = {\hat{v}}_{i} - v_{0} \end{matrix}

denote the flocking velocity estimation error for agent $i$ . If $\tilde{v} = [{\tilde{v}}_{1}, \dots, {\tilde{v}}_{n}] \in R^{m n}$ , then

\begin{matrix} (2.31) & \tilde{v} = \hat{v} - 1_{n} \otimes v_{0} . \end{matrix}

As part of this proof, we will show that $(2.28 b)$ guarantees $\tilde{v} (t) \to 0$ as $t \to \infty$ . From the time derivative of $(2.8)$ , we have that

\begin{matrix} (2.32) & \dot{z} = 2 R_{D} (\tilde{q}) u . \end{matrix}

After substituting $(2.28 a)$ into $(2.32)$ , we get the closed-loop system

\begin{matrix} (2.33) & \dot{z} = - 2 k_{v} R_{D} (\tilde{q}) R_{D}^{⊤} (\tilde{q}) z + 2 R_{D} (\tilde{q}) \hat{v} . \end{matrix}

Using $(2.31)$ in $(2.33)$ yields

\begin{matrix} (2.34) & \dot{z} = - 2 k_{v} R_{D} (\tilde{q}) R_{D}^{⊤} (\tilde{q}) z + 2 R_{D} (\tilde{q}) \tilde{v} \end{matrix}

upon application of Property $R_{D} (p) (1_{n} \otimes x) = 0$ in Infinitesimal Rigidity.

Now, we turn our attention to deriving the dynamics of the estimation error. First, notice that

\sum_{j \in N_{i} (E^{*})} ({\hat{v}}_{i} - {\hat{v}}_{j}) = \sum_{j = 1}^{n} a_{i j} ({\hat{v}}_{i} - {\hat{v}}_{j})

where $a_{i j}$ are the elements of the adjacency matrix. Taking the time derivative of $(2.31)$ and substituting $(2.28 b)$ gives

\begin{aligned} \dot{\tilde{v}} & = - α (L \otimes I_{m}) \tilde{v} - α (B \otimes I_{m}) \tilde{v} \\ (2.35) & = - α (M \otimes I_{m}) \tilde{v} \end{aligned}

where we used the fact that ${\hat{v}}_{i} - {\hat{v}}_{j} = {\tilde{v}}_{i} - {\tilde{v}}_{j}, B := diag (b_{1}, \dots b_{n}), L$ is the Laplacian matrix defined in $(1.4)$ , and $M := L + B$ is symmetric. Our overall closed-loop system is composed of two interconnected subsystems, $(2.34)$ and $(2.35)$ , which are in the form of $(1.6)$ . Notice that $(2.34)$ with $\tilde{v} = 0$ is input-to-state stable by Theorem 1.4 since it reduces to the closed-loop system analyzed in Theorem 2.1. Since the graph of a rigid framework is always connected, we know that $G^{*}$ is connected. Therefore, we know from Lemmas 1.1 and nonautonomous (time-varying) system $\dot{x} = f (x, t), x (t_{0}) = x_{0}$ that $M$ and $M \otimes I_{m}$ are positive definite, respectively. It then follows from $(2.35)$ that $\tilde{v} = 0$ is exponentially stable. We can now invoke Theorem 1.5 to claim that $(z, \tilde{v}) = 0$ is an asymptotically stable equilibrium point of the interconnected system. Since $z = 0$ if and only if $e = 0$ , we know $e = 0$ is asymptotically stable. Finally, by virtue of the initial conditions, we know that $F (t) \to Iso (F^{*})$ as $t \to \infty$ as argued in the proof of Theorem 2.1.

Finally, due to the asymptotic stability of $e = 0$ , we know $u_{a} (t) \to 0$ as $t \to \infty$ and therefore from $(2.28 a)$ that $u (t) - \hat{v} (t) \to 0$ as $t \to \infty$ . Since ${\tilde{v}}_{i} (t) = {\hat{v}}_{i} (t) - v_{0} \to 0$ as $t \to \infty$ , then we know from $(2.1)$ that $(1.4)$ holds. Q.E.D.

◻

The form of $(2.28 b)$ is inspired by multi-agent consensus algorithms. The premise behind the observer is that agents that do not have direct access to $v_{0}$ can acquire this information from its neighbors since the graph modeling the communication network is connected. Note that the observer $(2.28 b)$ can accommodate a leader-follower strategy (only one agent has access to $v_{0}$ ) as well as the general case where the velocity information exchange happens between any two agents.

Time-Varying Flocking Velocity

The observer scheme in $(2.28 b)$ cannot be proven to ensure $\tilde{v} (t) \to 0$ as $t \to \infty$ for the case where the flocking velocity varies with time. In this situation, one can use the variable structure-type observer

\begin{matrix} (2.36) & {\dot{\hat{v}}}_{i} = - α sgn (\sum_{j \in N_{i} (E^{*})} ({\hat{v}}_{i} - {\hat{v}}_{j}) + b_{i} ({\hat{v}}_{i} - v_{0})), i = 1, \dots n \end{matrix}

where $v_{0} (t) \in L_{\infty}$ is the time-varying flocking velocity, which is assumed to be differentiable with ${‖ {\dot{v}}_{0} (t) ‖}_{L_{\infty}} ⩽ γ$ for all time, $b_{i}$ was defined in $(2.29)$ , and $sgn (\cdot)$ is the standard signum function:

\begin{matrix} (2.37) & sgn (x) = {\begin{array}{cl} 1 & for x > 0 \\ 0 & for x = 0 \\ - 1 & for x < 0 \end{array} \end{matrix}

The dynamics of the estimation error now become

\begin{matrix} (2.38) & \dot{\tilde{v}} = - α sgn ((M \otimes I_{m}) \tilde{v}) - 1_{n} \otimes {\dot{v}}_{0} \end{matrix}

where $sgn (x) = [sgn (x_{1}), \dots, sgn (x_{n})], \forall x \in R^{n}$ . Notice that $(2.38)$ has a discontinuous right-hand side; thus, its solution needs to be studied using nonsmooth analysis. Since $sgn (\cdot)$ is Lebesgue measurable and essentially locally bounded, one can show the existence of generalized solutions by embedding the differential equation into the differential inclusion

\begin{matrix} (2.39) & \dot{\tilde{v}} \in K [f] (\tilde{v}, t) \end{matrix}

where $K [\cdot]$ is a nonempty, compact, convex, upper semicontinuous set-valued map and $f (\tilde{v}, t) = - α sgn ((M \otimes I_{m}) \tilde{v}) - 1_{n} \otimes {\dot{v}}_{0}$ .

If we define the Lyapunov function candidate

\begin{matrix} (2.40) & W_{f} = \frac{1}{2} {\tilde{v}}^{⊤} (M \otimes I_{m}) \tilde{v} \end{matrix}

we get that

\begin{aligned} {\dot{W}}_{f} & \overset{a.e.}{\in} \frac{\partial W_{f}}{\partial \tilde{v}} K [f] (\tilde{v}, t) \\ (2.41) & \subset - α {\tilde{v}}^{⊤} (M \otimes I_{m}) sgn ((M \otimes I_{m}) \tilde{v}) - {\tilde{v}}^{⊤} (M \otimes I_{m}) (1_{n} \otimes {\dot{v}}_{0}) \end{aligned}

where a.e. is the abbreviation for the term "almost everywhere". If we define $SGN (x) := [SGN (x_{1}), \dots, SGN (x_{n})], \forall x \in R^{n}$ where

\begin{matrix} (2.42) & SGN (x_{i}) = {\begin{cases} 1 & for x_{i} > 0 \\ [- 1, 1] & for x_{i} = 0 \\ - 1 & for x_{i} < 0 \end{cases} \end{matrix}

then $(2.41)$ becomes

\begin{aligned} {\dot{W}}_{f} & = - α {\tilde{v}}^{⊤} (M \otimes I_{m}) SGN ((M \otimes I_{m}) \tilde{v}) - {\tilde{v}}^{⊤} (M \otimes I_{m}) (1_{n} \otimes {\dot{v}}_{0}) \\ = - α {‖ (M \otimes I_{m}) \tilde{v} ‖}_{1} - {(1_{n} \otimes {\dot{v}}_{0})}^{⊤} (M \otimes I_{m}) \tilde{v} \\ = - α {‖ (M \otimes I_{m}) \tilde{v} ‖}_{1} + {\dot{v}}_{0}^{⊤} \sum_{i = 1}^{m n} {[(M \otimes I_{m}) \tilde{v}]}_{i} \\ ⩽ - α {‖ (M \otimes I_{m}) \tilde{v} ‖}_{1} + {‖ {\dot{v}}_{0} ‖}_{1} {‖ (M \otimes I_{m}) \tilde{v} ‖}_{1} \\ (2.43) & ⩽ - (α - γ) {‖ (M \otimes I_{m}) \tilde{v} ‖}_{1} . \end{aligned}

By choosing $α > γ$ , we get that ${\dot{W}}_{f}$ is negative definite. Therefore, from Theorem 1.6, we know that $\tilde{v} = 0$ is asymptotically stable.

Now the proof that $(2.15)$ and $(2.36)$ guarantee that $(1.2)$ and $(1.4)$ are satisfied directly follows from the proof of Theorem 2.3.

2.4 Target Interception with Unknown Target Velocity

We now turn our attention to the target interception problem defined in Section 1. We assume the target motion is such that $q_{T} (t)$ is three times continuously differentiable and $d^{i} q_{T} / d t^{i} \in L_{\infty}, i = 0, 1, 2, 3$ . Furthermore, we consider the target velocity ${\dot{q}}_{T}$ to be unknown to all agents, but that the leader can measure the target's relative position $q_{T} - q_{n}$ with its onboard sensors and can broadcast this information to the followers.

To simplify the notation, we let $v_{T} := {\dot{q}}_{T}$ and

\begin{matrix} (2.44) & e_{T} = q_{T} - q_{n} \end{matrix}

denote the interception error between the leader and target. The control, which will include a term to "learn" the unknown target velocity, will take the general form $u_{i} = u_{i} ({\tilde{q}}_{i j}, d_{i j}, e_{T}, {\hat{v}}_{T}), i = 1, \dots, n$ and $j \in N_{i} (E^{*})$ where ${\hat{v}}_{T}$ is the target velocity estimate. This term is generated by the following continuous dynamic estimation mechanism

\begin{matrix} (2.45) & {\hat{v}}_{T} (t) = \int_{0}^{t} [k_{1} e_{T} (τ) + k_{2} sgn (e_{T} (τ))] d τ \end{matrix}

where $k_{1}, k_{2} > 0$ are user-defined control gains. This mechanism allows one to learn or compensate for sufficiently smooth, nonperiodic signals.

Before presenting the control law, a lemma is related to $(2.45)$ is introduced.

Lemma 2.2

Let

\begin{matrix} (2.46) & L := {(k_{1} e_{T} + {\dot{e}}_{T})}^{⊤} ({\dot{v}}_{T} - k_{2} sgn (e_{T})) . \end{matrix}

If $k_{2}$ in (2.45) is selected to satisfy the following sufficient condition

\begin{matrix} (2.47) & k_{2} > {‖ {\dot{v}}_{T} ‖}_{L_{\infty}} + \frac{1}{k_{1}} {‖ {\ddot{v}}_{T} ‖}_{L_{\infty}}, \end{matrix}

then

\begin{matrix} (2.48) & \int_{0}^{t} L (τ) d τ ⩽ ζ_{b} \end{matrix}

where the positive constant $ζ_{b}$ is defined as

\begin{matrix} (2.49) & ζ_{b} = k_{2} {‖ e_{T} (0) ‖}_{1} - e_{T}^{⊤} (0) {\dot{v}}_{T} (0) \end{matrix}

Proof:

Details of Proof

Integrating $(2.46)$ over time yields

\begin{aligned} \int_{0}^{t} L (τ) d τ = & \int_{0}^{t} {(k_{1} e_{T} (τ) + {\dot{e}}_{T} (τ))}^{⊤} [{\dot{v}}_{T} (τ) - k_{2} sgn (e_{T} (τ))] d τ \\ = & \int_{0}^{t} k_{1} e_{T}^{⊤} (τ) [{\dot{v}}_{T} (τ) - k_{2} sgn (e_{T} (τ))] d τ + \int_{0}^{t} {\dot{e}}_{T}^{⊤} (τ) {\dot{v}}_{T} (τ) d τ \\ (2.50) & - \int_{0}^{t} k_{2} {\dot{e}}_{T}^{⊤} (τ) sgn (e_{T} (τ)) d τ \end{aligned}

After integrating by parts the second integral on the right-hand side of (2.50) and applying Lemma 1 of [44] to the third integral, we obtain

\begin{aligned} \int_{0}^{t} L (τ) d τ = & \int_{0}^{t} k_{1} e_{T}^{⊤} (τ) [{\dot{v}}_{T} (τ) - k_{2} sgn (e_{T} (τ))] d τ \\ + {e_{T}^{⊤} (τ) {\dot{v}}_{T} (τ) |}_{0}^{t} - \int_{0}^{t} e_{T}^{⊤} (τ) {\ddot{v}}_{T} (τ) d τ - {k_{2} {‖ e_{T} (τ) ‖}_{1} |}_{0}^{t} \\ = & \int_{0}^{t} k_{1} e_{T}^{⊤} (τ) [{\dot{v}}_{T} (τ) - \frac{1}{k_{1}} {\ddot{v}}_{T} (τ) - k_{2} sgn (e_{T} (τ))] d τ \\ (2.51) & + e_{T}^{⊤} (t) {\dot{v}}_{T} (t) - e_{T}^{⊤} (0) {\dot{v}}_{T} (0) - k_{2} {‖ e_{T} (t) ‖}_{1} + k_{2} {‖ e_{T} (0) ‖}_{1} \end{aligned}

Using the fact that $∥ x ∥_{1} ⩾ ∥ x ∥$ for any $x \in R^{n}$ , we can upper bound the right-hand side of $(2.51)$ by

\begin{aligned} \int_{0}^{t} L (τ) d τ ⩽ & \int_{0}^{t} k_{1} ‖ e_{T} (τ) ‖ (‖ {\dot{v}}_{T} (τ) ‖ + \frac{1}{k_{1}} ‖ {\ddot{v}}_{T} (τ) ‖ - k_{2}) d τ \\ (2.52) & + ‖ e_{T} (t) ‖ (‖ {\dot{v}}_{T} (t) ‖ - k_{2}) + k_{2} {‖ e_{T} (0) ‖}_{1} - e_{T}^{⊤} (0) {\dot{v}}_{T} (0) \end{aligned}

Applying $(2.47)$ to $(2.52)$ gives $(2.48)$ . Finally, the positiveness of (2.49) follows from the fact that

k_{2} {‖ e_{T} (0) ‖}_{1} - e_{T}^{⊤} (0) {\dot{v}}_{T} (0) ⩾ ‖ e_{T} (0) ‖ (k_{2} - ‖ {\dot{v}}_{T} (0) ‖) > 0

when $k_{2}$ is selected according to $(2.47)$ . Q.E.D.

◻

Theorem 2.4

Consider the formation $F (t) = (G^{*}, q (t))$ with the initial conditions on $e (0)$ given in Theorem 2.1. Then, the control

\begin{matrix} (2.53) & u = u_{a} + 1_{n} \otimes h, \end{matrix}

where $u_{a} = [u_{a 1}, \dots, u_{a n}]$ was defined in $(2.15)$ and

\begin{matrix} (2.54) & h = (k_{1} + 1) e_{T} + {\hat{v}}_{T} - u_{a n}, \end{matrix}

renders $e = 0$ exponentially stable and ensures that $(1.2)$ and $(1.5)$ are satisfied. Further, the target velocity can be identified in the sense that $v_{T} (t) - {\hat{v}}_{T} (t) \to 0$ as $t \to \infty$ .

Proof:

Details of Proof

After substituting $(2.53)$ into $(2.12)$ , we obtain

\begin{matrix} (2.55) & \dot{W} = - k_{v} z^{⊤} R (\tilde{q}) R^{⊤} (\tilde{q}) z + z^{⊤} R (\tilde{q}) (1_{n} \otimes h) . \end{matrix}

Due to Property in the Infinitesimal Rigidity, the second term on the right-hand side of $(2.55)$ disappears and the proof of Theorem 2.1 can be again followed to prove the exponential stability of $e = 0$ and $(1.2)$ .

We now proceed to prove $(1.5)$ . From $(2.53)$ and $(2.54)$ , we have that the leader control input is^[5]

\begin{matrix} (2.56) & u_{n} = (k_{1} + 1) e_{T} + {\hat{v}}_{T} . \end{matrix}

Differentiating $(2.44)$ and using $(2.56)$ yields

\begin{aligned} (2.57) & {\dot{e}}_{T} & = v_{T} - u_{n} \\ (2.58) & = v_{T} - (k_{1} + 1) e_{T} - {\hat{v}}_{T} \\ (2.59) & = - k_{1} e_{T} + w \end{aligned}

where

\begin{matrix} (2.60) & w = v_{T} - e_{T} - {\hat{v}}_{T} \end{matrix}

The derivative of $(2.60)$ is given by

\begin{matrix} (2.61) & \dot{w} = {\dot{v}}_{T} - {\dot{e}}_{T} - k_{1} e_{T} - k_{2} sgn (e_{T}) = - w + {\dot{v}}_{T} - k_{2} sgn (e_{T}) \end{matrix}

where $(2.45)$ and $(2.59)$ were used. Next, define the auxiliary function

\begin{matrix} (2.62) & P = \frac{1}{2} w^{⊤} w, \end{matrix}

whose derivative along $(2.61)$ is given by

\begin{matrix} (2.63) & \dot{P} = w^{⊤} (- w + {\dot{v}}_{T} - k_{2} sgn (e_{T})) = - w^{⊤} w + L \end{matrix}

where $(2.46)$ was used. After integrating both sides of $(2.63)$ with respect to time and applying Lemma 2.2, we obtain

\begin{aligned} \int_{0}^{t} \dot{P} (τ) d τ = P (t) - P (0) & = - \int_{0}^{t} w^{⊤} (τ) w (τ) d τ + \int_{0}^{t} L (τ) d τ \\ (2.64) & ⩽ - \int_{0}^{t} w^{⊤} (τ) w (τ) d τ + ζ_{b} ⩽ ζ_{b} \end{aligned}

[^2]Since $P (0)$ is finite, it follows from (2.64) that $P (t) \in L_{\infty}$ , which implies that $w (t) \in L_{\infty}$ from (2.62). From (2.64), we also have that

\int_{0}^{t} w^{⊤} (τ) w (τ) d τ ⩽ ζ_{b} + P (0) - P (t) < \infty

which means that $w (t) \in L_{2}$ . Therefore, we know from $(2.59)$ and Theorem 1.1 that $e_{T} (t) \to 0$ as $t \to \infty$ . We can also use $(2.59)$ to claim that ${\dot{e}}_{T} \in L_{\infty}$ , which implies from $(2.57)$ (together with the boundedness of $v_{T} (t)$ ) that $u_{n} (t) \in L_{\infty}$ . From $(2.56)$ , we then know that ${\hat{v}}_{T} (t) \in L_{\infty}$ . Since (1.26) holds and $F^{*}$ is constructed such that $q_{n}^{*} \in conv {q_{1}^{*}, \dots, q_{n - 1}^{*}}$ , we know that $q_{n} (t) \in conv {q_{1} (t), q_{2} (t), \dots, q_{n - 1} (t)}$ as $t \to \infty$ . Therefore, from the fact that $e_{T} (t) \to 0$ as $t \to \infty$ , we conclude that $(1.5)$ holds.

Finally, we know $\dot{w} (t) \in L_{\infty}$ from $(2.61)$ since ${\dot{v}}_{T}$ is assumed bounded. It then follows from Theorem 1.3 that $w (t) \to 0$ as $t \to \infty$ . Therefore, we can use (2.59) to show that ${\dot{e}}_{T} (t) \to 0$ as $t \to \infty$ , and then (2.58) to conclude that $v_{T} (t) - {\hat{v}}_{T} (t) \to 0$ as $t \to \infty$ . Q.E.D.

◻

Similar to the formation maneuvering control, the target interception controller $(2.53)$ and $(2.54)$ has two components with well-defined roles:

$u_{a}$ ensures formation acquisition
$h$ guarantees target interception

The controller for the followers can be written element-wise as

u_{i} = - k_{v} \sum_{j \in N_{i} (E^{*})} {\tilde{q}}_{i j} z_{i j} + (k_{1} + 1) e_{T} + \int_{0}^{t} [k_{1} e_{T} (τ) + k_{2} sgn (e_{T} (τ))] d τ - u_{a n}

for $i = 1, \dots, n - 1$ where

u_{a n} = - k_{v} \sum_{j \in N_{n} (E^{*})} {\tilde{q}}_{n j} z_{n j}

whereas the control for the leader is given by $(2.56)$ . As one can see, each follower control input depends on its relative position to neighboring agents, the target interception error, and the formation acquisition control term of the leader. Therefore, it is less decentralized than the formation acquisition and maneuvering controllers since now information needs to be wirelessly broadcast from the leader to the followers.

Finally, note that the target interception error $(2.44)$ could be redefined to include a constant offset so that the leader does not collide with the target, i.e., $e_{T} = q_{n} - q_{T} - c$ where $c \in R^{m}$ is a constant vector.

2.5 Dynamic Formation Acquisition

So far, we have only considered formation acquisition when the desired formation $F^{*}$ is static. In certain applications it may be necessary that the formation size and/or geometric shape change in time, such as to avoid obstacles, dynamically adapt to a change of mission, or adapt to limits in communication range and bandwidth. Thus, we consider now the problem of dynamic formation acquisition in the sense that the desired formation is a function of time, $F^{*} (t)$ . In control systems jargon, we will deal here with the more general tracking problem instead of the simpler setpoint problem.

Note that dynamic formation acquisition is independent of what we call formation maneuvering. In the former, the time-varying nature is related to the formation itself, whereas in the latter, the formation (whether static or dynamic) maneuvers as a virtual rigid body. The formal statement of the dynamic formation acquisition problem is as follows.

Problem 4 (Dynamic Formation Acquisition)

Let the desired formation be represented by a dynamic, infinitesimally and minimally rigid framework $F^{*} (t) = (G^{*}, q^{*} (t))$ ^[6] where the time-varying desired distance between agents $i$ and $j$ is given by

\begin{matrix} (2.65) & d_{i j} (t) = ‖ q_{i}^{*} (t) - q_{j}^{*} (t) ‖ > 0, i, j \in V^{*} . \end{matrix}

We assume the desired distances are sufficiently smooth functions of time^[7]. The control objective is to design $u_{i}$ such that

\begin{matrix} (2.66) & F (t) - Iso (F^{*} (t)) \to 0 as t \to \infty, \end{matrix}

or equivalently

\begin{matrix} (2.67) & e_{i j} (t) \to 0 as t \to \infty, i, j \in V^{*} . \end{matrix}

Because of the time-varying nature of $(2.65)$ , the distance error dynamics is now given by

\begin{matrix} (2.68) & {\dot{e}}_{i j} = \frac{{\tilde{q}}_{i j}^{⊤} (u_{i} - u_{j})}{e_{i j} + d_{i j}} - {\dot{d}}_{i j}, \end{matrix}

where $(2.6)$ and $(2.1)$ were used. As a result, the derivative of $(2.10)$ along $(2.68)$ becomes

\begin{matrix} (2.69) & \dot{W} = \sum_{(i, j) \in E^{*}} e_{i j} (e_{i j} + 2 d_{i j}) [{\tilde{q}}_{i j}^{⊤} (u_{i} - u_{j}) - d_{i j} {\dot{d}}_{i j}] = z^{⊤} (R_{D} (\tilde{q}) u - d_{v}) \end{matrix}

where

\begin{matrix} (2.70) & d_{v} = [\dots, d_{i j} {\dot{d}}_{i j}, \dots] \in R^{l}, (i, j) \in E^{*} \end{matrix}

with elements ordered as $r_{G}$ . We assume $d_{i j}$ is a continuously differentiable function of time and $d_{i j} (t), {\dot{d}}_{i j} (t) \in L_{\infty}$ . The presence of the extra term, $d_{v}$ , in the derivative of the Lyapunov function candidate will dictate a different control structure.

Theorem 2.5

Consider the formation $F (t) = (G^{*}, q (t))$ with the initial conditions given in Theorem 2.1. The control law

\begin{matrix} (2.71) & u = R_{D}^{†} (\tilde{q}) (- k_{v} z + d_{v}) \end{matrix}

where $R_{D}^{†} (\tilde{q}) = R_{D}^{⊤} (\tilde{q}) {[R_{D} (\tilde{q}) R_{D}^{⊤} (\tilde{q})]}^{- 1}$ is the Moore-Penrose pseudoinverse, yields $e = 0$ exponentially stable and guarantees that $(2.66)$ is satisfied.

The proof of this theorem is nearly identical to the proof of Theorem 2.1 so the details are omitted. The main difference is that, since $R_{D} (\tilde{q})$ has full row rank for $e (t) \in Ω_{1}$ , then $R_{D} (\tilde{q}) R_{D}^{†} (\tilde{q}) = I$ for $e (t) \in Ω_{1}$ . Therefore, substituting $(2.71)$ into $(2.69)$ yields

\begin{matrix} (2.72) & \dot{W} = - k_{v} z^{⊤} z = - 4 k_{v} W for e (t) \in Ω_{1} . \end{matrix}

From this point on, the proof of Theorem 2.1 can be directly followed to show that $(2.66)$ holds for $e (0) \in Ω_{1} \cap Ω_{2}$ .

A fundamental difference exists in the implementation of (2.71) in comparison to the previous controllers of this chapter. Namely, the matrix $R_{D}^{†} (\tilde{q})$ couples the variables such that $u_{i} = u_{i} ({\tilde{q}}_{i j}, d_{i j}, {\dot{d}}_{i j}), i = 1, \dots, n$ and $(i, j) \in E^{*}$ . That is, unlike in the previous cases where $j \in N_{i} (E^{*})$ for the $i^{th}$ input, here each input is dependent on all $(i, j) \in E^{*}$ variables.

Formation maneuvering can be performed on top of dynamic formation acquisition by modifying $(2.71)$ to

\begin{matrix} (2.73) & u = R_{D}^{†} (\tilde{q}) (- k_{v} z + d_{v}) + v_{d} \end{matrix}

where $v_{d}$ was defined in $(2.24)$ . It is straightforward to show that $(2.73)$ ensures $(1.4)$ properties by following the proof of Theorem 2.2.

Notes

The directionality of the information exchange among agents is an important design factor. This issue is of practical importance since it relates to the number of communication, sensing, and/or control channels of the multi-agent system.

In the case of bidirectional information exchange, a pair of agents concurrently controls the distance between them, whereas only one agent in the pair is responsible for this task in the unidirectional case. In terms of graph theory, bidirectional (resp., unidirectional) formation controllers are based on undirected (resp., directed) graphs. Undirected formation controllers have built-in redundancy, providing robustness. However, it can also lead to instability in the formation acquisition if agent pairs use slightly different values for the distance between them due to measurement errors. It was shown that this measurement mismatch causes a distortion of the formation from its desired shape and a circular (resp., helical) orbit of the distorted formation in 2D (resp., 3D).

One possible remedy for this problem is to have the agents communicate their respective measurements to one another and then use a common value for control (e.g., the average of the two measurements).

Yet another solution is to use a directed graph-based controller since it reduces the overall number of communication/sensing/control channels while avoiding the potential conflict between a pair of agents trying to achieve the same objective. However, in directed graphs it is possible to have cycles in the pathways, which are more challenging to control and can lead to formation instability. Therefore, the issue of cyclic versus acyclic graphs is an important consideration for directed formation control.

3 Double-Integrator Model [1]

In this section, we re-discuss the class of formation controllers presented in Chapter 2 in the context of a slightly more refined model, viz., the double-integrator model. We will follow the same format as the previous section for ease of correlation.

The double-integrator model accounts for the agent acceleration by treating the agent as a point mass. Therefore, it can be considered a very simple dynamic model for omnidirectional robots. Given a system of $n$ agents, the equations of motion for the double-integrator model are

\begin{aligned} (3.1a) & {\dot{q}}_{i} = v_{i} \\ (3.1b) & {\dot{v}}_{i} = u_{i}, i = 1, \dots, n \end{aligned}

where $v_{i} \in R^{m}$ represents the velocity of the $i$ th agent with respect to an Earth-fixed coordinate frame, $u_{i} \in R^{m}$ is the acceleration-level control input, and $q_{i}$ is defined as in $(2.1)$ . Since the agent velocity is now a system state rather than the control input, the formation control laws in this section will be a function of the agent velocities in addition to the positions.

Note that the system transfer function matrix is now $G_{i} (s) = 1 / s^{2} I_{m}$ , which gives rise to the model name. Since the only difference between this transfer function and $(2.2)$ is an additional integrator, the extension of the single-integrator-based control laws to $(3.1)$ is rather seamless if one exploits the integrator backstepping methodology (see Appendix A).

Double-Integrator Model for Formation Control

As in Section 2.1, we begin by deriving the distance error dynamics. To this end, we use $(2.6)$ and $(3.1 a)$ to obtain

\begin{matrix} (3.2) & {\dot{e}}_{i j} = \frac{{\tilde{q}}_{i j}^{⊤} (v_{i} - v_{j})}{e_{i j} + d_{i j}} \end{matrix}

Differentiating $(2.10)$ along $(3.2)$ gives

\begin{matrix} (3.3) & \dot{W} = \frac{1}{2} z^{⊤} \dot{z} = z^{⊤} R_{D} (\tilde{q}) v \end{matrix}

where $v = [v_{1}, \dots, v_{n}] \in R^{m n}$ .

Given that $v$ in $(3.3)$ cannot be directly prescribed since it is a system state, we follow the backstepping technique and introduce the following variable

\begin{matrix} (3.4) & s = v - v_{f} \end{matrix}

where $v_{f} \in R^{m n}$ denotes the fictitious (or desired) velocity input, which will be specified later. The variable $s$ quantifies the error between the actual agent velocity and the desired velocity-level input. The design of $v_{f}$ will be problem-specific, and will come from the velocity-level control laws of Chapter 2. That is, generally speaking, $v_{f} = u^{S I}$ where the superscript $S I$ stands for one of the control input designs for the single-integrator model. The block diagrams in Figure 3.1 illustrate the relationship between the control designs for the single- and double-integrator models. As one can see, the velocity-level, position control algorithms from Chapter 2 will be embedded in the acceleration-level, velocity control loop to be designed in this chapter.

diff — Figure 3.1: Relationship between the (a) single- and (b) double-integrator control designs.

Due to the new error variable $(3.4)$ , we introduce the augmented Lyapunov function candidate

\begin{matrix} (3.5) & W_{d} (e, s) = W (e) + \frac{1}{2} s^{⊤} s \end{matrix}

where $W$ was defined in $(2.10)$ . Notice that $W$ is a potential energy-like term since it is only position dependent, whereas $\frac{1}{2} s^{⊤} s$ is a kinetic energy-like term due to its dependence on velocity. Therefore, $W_{d}$ captures the total energy of the double-integrator model formation.

After taking the time derivative of $(3.5)$ , we obtain

\begin{aligned} {\dot{W}}_{d} & = z^{⊤} R_{D} (\tilde{q}) v + s^{⊤} \dot{s} \\ = z^{⊤} R_{D} (\tilde{q}) (s + v_{f}) + s^{⊤} (u - {\dot{v}}_{f}) \\ (3.6) & = z^{⊤} R_{D} (\tilde{q}) v_{f} + s^{⊤} (u + R_{D}^{⊤} (\tilde{q}) z - {\dot{v}}_{f}) \end{aligned}

where $(3.3)$ , $(3.1 b)$ , and $(3.4)$ were used. Equation $(3.6)$ is the analogue of $(2.12)$ since it will be the starting point for all double-integrator control designs as $(2.12)$ was for the single-integrator designs.

3.1 Cross-Edge Energy

Before presenting the formation controllers, we need to discuss a complication in the stability analysis of the closed-loop system that arises from the double-integrator model. Specifically, this complication is related to the avoidance of flip ambiguities.

Recall that for the single-integrator model, the position of the initial formation needs to be restricted to prevent convergence to a flip ambiguity since the velocity-level control input is designed to promote convergence to $Iso (F^{*})$ or $Amb (F^{*})$ , whichever is closer at $t = 0$ . Unfortunately, this condition is not sufficient for the double-integrator model. In this case, the agents' velocity will also affect the convergence since it is a system state. This idea is conceptually illustrated by Figure 3.2. Note that even if the formation position is closer to $Iso (F^{*})$ , the formation will overcome the energy barrier and converge to $Amb (F^{*})$ if its velocity is large enough. In other words, the total formation energy is now affected by the combination of potential energy and kinetic energy. The implication of this for stability is that a restriction also needs to be imposed on the initial velocity of the formation, which means that we need to limit the initial total energy of the formation.

While the need for an upper bound on the initial energy of the formation is evident, its precise value is difficult to calculate in general. For simple formations, one may be able to calculate a conservative value for the energy upper bound as illustrated next. Consider the desired triangular formation in Figure 3.3 along with one of its flipped versions. Note that a flip may occur whenever an agent has enough energy to cross the edge connecting the two other agents, e.g., agent 1 crossing edge $(2, 3)$ . Once the agent crosses the edge, it is closer to $Amb (F^{*})$ and may be attracted to this undesired equilibrium.

Figure 3.3: Desired formation (solid line) and a flip ambiguity (dashed line).

The question is then: What is the minimum energy needed for this to happen? Hereafter, we refer to this minimum energy as the cross-edge energy, $E_{c}$ .

A conservative estimate for the cross-edge energy can be made by using the following observations:

The cross-edge energy is related to the energy that drives the agents to a collinear formation
The minimum collinearity energy is given by the agent with the smallest distance to its cross-edge, e.g., the dotted line in Figure 3.3.

These rules facilitate the cross-edge energy estimation because they are only position dependent. Furthermore, we have from $(3.5)$ and $(2.10)$ that $W_{d} ⩾ W = \frac{1}{4} z^{⊤} z$ , which is also only position dependent. That is, a sufficient condition for $E_{c}$ can be determined by calculating the minimum value of $W$ when the three agents are collinear. For example, let $d_{12} = d_{13} = \sqrt{2}$ and $d_{23} = 2$ . When agent 1 is collinear with agents 2 and 3 , we have that $‖ {\tilde{q}}_{12} ‖ + ‖ {\tilde{q}}_{13} ‖ = ‖ {\tilde{q}}_{23} ‖$ . For notational convenience, we use $q \in C$ where $q = [q_{1}, q_{2}, q_{3}]$ to denote that the agents are collinear. Therefore,

\begin{aligned} E_{c} & = min_{q \in C} W = min_{q \in C} \frac{1}{4} (z_{12}^{2} + z_{13}^{2} + z_{23}^{2}) \\ = min_{q \in C} \frac{1}{4} [{({‖ {\tilde{q}}_{12} ‖}^{2} - d_{12}^{2})}^{2} + {({‖ {\tilde{q}}_{13} ‖}^{2} - d_{13}^{2})}^{2} + {({‖ {\tilde{q}}_{23} ‖}^{2} - d_{23}^{2})}^{2}] \\ = min \frac{1}{4} [{({‖ {\tilde{q}}_{12} ‖}^{2} - 2)}^{2} + {({(‖ {\tilde{q}}_{23} ‖ - ‖ {\tilde{q}}_{12} ‖)}^{2} - 2)}^{2} + {({‖ {\tilde{q}}_{23} ‖}^{2} - 4)}^{2}] . \end{aligned}

It can be found that the above function reaches a minimum at $‖ {\tilde{q}}_{12} ‖ = ‖ {\tilde{q}}_{23} ‖ / 2 = \sqrt{10} / 3$ and $E_{c} = 0.444$ . This means that if $W_{d} (0) ⩽ E_{c}$ , the agents will not converge to the flip ambiguity.

Notice that the condition $W_{d} (0) ⩽ E_{c}$ imposes a trade-off between the initial distance error and the initial velocity error. The larger the initial distance error, the smaller the initial velocity error needs to be, and vice versa. Based on $(3.4)$ , a small $s$ implies that the agents' velocities are close to $v_{f}$ , which is the desired velocity that ensures convergence to $Iso (F^{*})$ .

For formations with $n > 3$ , one may apply the above estimation method by triangulating the framework and comparing the cross-edge energy of each triangle to estimate $E_{c}$ . For example, consider the infinitesimally rigid framework in Figure 3.4. The agents most likely to flip are agents 2 and 6 about cross-edges $(1, 3)$ and $(1, 5)$ , respectively, since they only have two edges (constraints) each. Thus, $E_{c} = min {E_{c 2}, E_{c 6}}$ where $E_{c i}$ denotes the cross-edge energy of agent $i$ . Note that higher order flips are also possible, but they would require more energy than aforementioned single-agent flips. For example, agents ${2, 3}$ or ${5, 6}$ could simultaneously also flip about cross-edge $(1, 4)$ , or agents ${2, 3, 4, 5, 6}$ could simultaneously flip about agent 1 , leading to a full reflection of the formation.

Figure 3.4: Triangulated hexagon framework.

3.2 Formation Acquisition

The formation acquisition controller for $(3.1)$ will have the general form $u_{i} = u_{i} (q_{i} - q_{j}, v_{i} - v_{j}, v_{i}, d_{i j}), i = 1, \dots, n$ and $j \in N_{i} (E^{*})$ . Based on $(3.6)$ , the following theorem introduces the control law that solves the formation acquisition problem.

Theorem 3.1

Given the formation $F (t) = (G^{*}, q (t))$ , let the initial conditions be such that $(e (0), s (0)) \in Ω_{1} \cap Ω_{2} \cap Ω_{3}$ where $Ω_{1}$ and $Ω_{2}$ were defined in $(2.14)$ ,

\begin{matrix} (3.7) & Ω_{3} = {e \in R^{l}, s \in R^{m n} ∣ W_{d} ⩽ E_{c}}, \end{matrix}

and $E_{c}$ is the total cross-edge energy of the formation. Then, the control

\begin{matrix} (3.8) & u = - k_{a} s + {\dot{v}}_{f} - R_{D}^{⊤} (\tilde{q}) z, \end{matrix}

where

\begin{matrix} (3.9) & v_{f} = u_{a}, \end{matrix}

$u_{a}$ was defined in $(2.15)$ , and $k_{a} > 0$ is a user-defined control gain, renders $(e, s) = 0$ exponentially stable and ensures that $(1.2)$ is satisfied.

Proof:

Details of Proof

Substituting $(3.8)$ and $(3.9)$ into $(3.6)$ yields

\begin{matrix} (3.10) & {\dot{W}}_{d} = - k_{v} z^{⊤} R_{D} (\tilde{q}) R_{D}^{⊤} (\tilde{q}) z - k_{a} s^{⊤} s . \end{matrix}

Following the arguments used in the proof of Theorem 2.1, we have that

\begin{aligned} {\dot{W}}_{d} & ⩽ - k_{v} λ_{min} (R_{D} R_{D}^{⊤}) z^{⊤} z - k_{a} s^{⊤} s \\ (3.11) & ⩽ - min {2 k_{a}, 4 k_{v} λ_{min} (R_{D} R_{D}^{⊤})} W_{d} \end{aligned}

for $e (0) \in Ω_{1}$ . From $(3.5)$ and $(3.11)$ , we know that $(e, s) = 0$ is exponentially stable for $e (0) \in Ω_{1}$ from Corollary 1.1, and therefore, $F (t) \to Iso (F^{*})$ or $F (t) \to Amb (F^{*})$ as $t \to \infty$ for $e (0) \in Ω_{1}$ . Now, if the initial conditions are chosen such that $(e (0), s (0)) \in Ω_{1} \cap Ω_{2} \cap Ω_{3}$ , we know that the formation starts closer to $Iso (F^{*})$ than $Amb (F^{*})$ and $W_{d} (e (0), s (0)) < E_{c}$ . Since ${\dot{W}}_{d} ⩽ 0$ , we know that $W_{d} (e (t), s (t)) < E_{c}$ for all $t > 0$ , indicating that the formation energy is always less than the minimum energy required for a flip to occur. Thus, we have that $(1.2)$ holds for $(e (0), s (0)) \in Ω_{1} \cap Ω_{2} \cap Ω_{3}$ . Q.E.D.

◻

The expression for ${\dot{v}}_{f}$ in $(3.8)$ is given by

\begin{matrix} (3.12) & {\dot{v}}_{f} = - k_{v} {\dot{R}}_{D}^{⊤} z - k_{v} R_{D}^{⊤} \dot{z} \end{matrix}

where from $(6)$ in Distance

\begin{matrix} (3.13) & {\dot{R}}_{D} (\tilde{q}) = R_{D} (\tilde{v}), \end{matrix}

$\tilde{v} = [\dots, v_{i} - v_{j}, \dots] \in R^{l}, (i, j) \in E^{*}$ , and from $(3.3)$

\begin{matrix} (3.14) & \dot{z} = 2 R_{D} (\tilde{q}) v . \end{matrix}

The control $(3.8)$ - $(3.9)$ can be written element-wise as

\begin{matrix} (3.15) & u_{i} = - k_{a} v_{i} - \sum_{j \in N_{i} (E^{*})} [(k_{a} k_{v} + 1) {\tilde{q}}_{i j} z_{i j} + k_{v} (z_{i j} I_{2} + 2 {\tilde{q}}_{i j} {\tilde{q}}_{i j}^{⊤}) {\tilde{v}}_{i j}] \end{matrix}

for $i = 1, \dots n$ and

\begin{matrix} (3.16) & {\tilde{v}}_{i j} = v_{i} - v_{j}, (i, j) \in E^{*} . \end{matrix}

This control is decentralized since its implementation only requires each agent to measure its own velocity and the relative position and relative velocity to neighboring agents. The agent's velocity can be measured using onboard sensors such as an odometer and a compass.

3.3 Formation Maneuvering

The formation maneuvering control law for the double-integrator model $(3.1 a)$ - $(3.1 b)$ is simply a combination of the designs in Sections 2.2 and 3.2. Specifically, $u$ is given by $(3.8)$ with

\begin{matrix} (3.17) & v_{f} = u_{a} + v_{d} \end{matrix}

where the formation maneuvering velocity $v_{d}$ was specified in $(2.24)$ . Note that $(3.17)$ is exactly the right-hand side of $(2.23)$ .

We will not present the formal statement and proof of this result, but only discuss the aspects in which it differs from the proofs of Theorems 2.2 and 3.1. This is namely the proof that $(1.4)$ holds. First, after substituting $(3.17)$ into $(3.6)$ , the proofs of the exponentially stability of $(e, s) = 0$ and $(1.2)$ are straightforward given that $R_{D} (\tilde{q}) v_{d} = 0$ (see $(1.20)$ and $(2.24)$ ). Now, since $e (t) \to 0$ as $t \to \infty$ , we know from $(2.9)$ that $z (t) \to 0$ as $t \to \infty$ . Since $R_{D} (\tilde{q})$ is bounded, then $u_{a} (t) \to 0$ as $t \to \infty$ from $(2.15)$ . Therefore, we have that $v_{f} (t) \to v_{d} (t)$ as $t \to \infty$ from $(3.17)$ . Since we know $s (t) \to 0$ as $t \to \infty$ , it follows from $(3.4)$ that $v (t) - v_{f} (t) \to 0$ as $t \to \infty$ . Therefore, $v_{i} (t) - v_{d i} (t) \to 0$ as $t \to \infty, i = 1, \dots, n$ , which is the same as $(1.4)$ due to $(3.1 a)$ .

The term ${\dot{v}}_{f}$ in $(3.8)$ will contain additional terms from the derivative of $v_{d}$ . Specifically, from $(2.24)$ , we have that

\begin{matrix} (3.18) & {\dot{v}}_{d i} = {\dot{v}}_{t} + \dot{ω} \times {\tilde{q}}_{i n} + ω \times {\tilde{v}}_{i n}, i = 1, \dots n \end{matrix}

where ${\dot{v}}_{t} \in R^{3}$ denotes the desired translational acceleration and $\dot{ω} \in R^{3}$ is the desired angular acceleration for the virtual rigid body. Therefore, for the double-integrator model, $v_{t}$ and $ω$ need to be continuously differentiable functions of time with bounded first derivative for the control input to be continuous and bounded. Note that element-wise the formation maneuvering control law is simply made up of the sum of the right-hand sides of $(3.15)$ and $(3.18)$ . Like $v_{t}$ and $ω$ , the signals ${\dot{v}}_{t}$ and $\dot{ω}$ can be stored on each agent's onboard computer since they are typically known a priori.

3.4 Target Interception with Unknown Target Acceleration

Solving the target interception problem for the double-integrator model requires a more elaborate solution than the one presented in Section 2.4 for the single-integrator model. Here, we consider that the target position $q_{T} (t)$ is twice continuously differentiable and $q_{T} (t), {\dot{q}}_{T} (t), {\ddot{q}}_{T} (t) \in L_{\infty}$ . We also assume the signals $q_{T} - q_{n}, {\dot{q}}_{T} - {\dot{q}}_{n}, {\dot{q}}_{n}$ , and ${\dot{q}}_{T}$ are known and can be broadcast from the leader to the followers; however, the signal ${\ddot{q}}_{T}$ is unknown. A variable structure-type control term will be used to compensate for the unknown target acceleration. As a result, the right-hand side of the resulting error system dynamics will be discontinuous, requiring us to apply some ideas from Lyapunov stability of nonsmooth systems. As in Section 2.4, we let $v_{T} := {\dot{q}}_{T}$ to simplify the notation.

Theorem 3.2

Let the initial conditions be such that $(e (0), s (0)) \in Ω_{1} \cap Ω_{2} \cap Ω_{3}$ where $Ω_{1}$ and $Ω_{2}$ were defined in $(2.14)$ and $Ω_{3}$ was defined in $(3.7)$ . Consider the control

\begin{matrix} (3.19) & u = - k_{a} s + {\dot{u}}_{a} + 1_{n} \otimes k_{T} (v_{T} - v_{n}) - k_{s} sgn (s) - R_{D}^{⊤} (\tilde{q}) z \end{matrix}

where $s$ was defined in $(3.4)$ ,

\begin{aligned} (3.20) & v_{f} & = u_{a} + 1_{n} \otimes h, \\ (3.21) & h & = k_{T} e_{T} + v_{T}, \end{aligned}

$u_{a}$ was defined in $(2.15)$ , $e_{T}$ was defined in $(2.44)$ , $k_{s} ⩾ \sqrt{n} {‖ {\dot{v}}_{T} ‖}_{L_{\infty}}$ , and $k_{T} > 0$ . Then, $(3.19)$ renders $(e, s) = 0$ asymptotically stable and ensures that $(1.2)$ and $(1.5)$ are satisfied.

Proof:

Details of Proof

First, notice that the differential equations describing the $(e, s)$ -error system dynamics in a closed loop with $(3.19)$ - $(3.21)$ have a discontinuous right-hand side due to the term $sgn (s)$ in $(3.19)$ . That is, if $\dot{ξ} = f (ξ, t)$ denotes the closed-loop system where $ξ = [e, s]$ , then $f (ξ, t)$ is continuous everywhere except in the set ${(ξ, t) ∣ s = 0}$ . For such a system, a generalized solution exists by embedding the differential equations into the differential inclusions $\dot{ξ} \in K [f] (ξ, t)$ . In this case, the time derivative of $(3.5)$ is given by

\begin{aligned} {\dot{W}}_{d} & \overset{a.e.}{\in} \frac{\partial W_{d}}{\partial ξ} K [f] (ξ, t) \\ (3.22) & \subset z^{⊤} R_{D} (\tilde{q}) v_{f} + s^{⊤} (u + R_{D}^{⊤} (\tilde{q}) z - {\dot{v}}_{f}) \end{aligned}

where $(3.6)$ was used. Substituting $(2.15)$ , $(3.19)$ , $(3.20)$ , and $(3.21)$ into $(3.22)$ and then applying $R_{D} (p) (1_{n} \otimes x) = 0$ gives [42]

\begin{aligned} {\dot{W}}_{d} & \subset - k_{v} z^{⊤} R_{D} R_{D}^{⊤} z - k_{a} s^{⊤} s - s^{⊤} (k_{s} sgn (s) + 1_{n} \otimes {\dot{v}}_{T}) \\ = - k_{v} z^{⊤} R_{D} R_{D}^{⊤} z - k_{a} s^{⊤} s - s^{⊤} (k_{s} SGN (s) + 1_{n} \otimes {\dot{v}}_{T}) \\ (3.23) & ⩽ - k_{v} z^{⊤} R_{D} R_{D}^{⊤} z - k_{a} s^{⊤} s + ∥ s ∥ (\sqrt{n} {‖ {\dot{v}}_{T} ‖}_{L_{\infty}} - k_{s}) \end{aligned}

where $sgn (\cdot)$ and $SGN (x)$ were defined in $(2.37)$ and $(2.42)$ , respectively.

For $k_{s} ⩾ \sqrt{n} {‖ {\dot{v}}_{T} ‖}_{L_{\infty}}$ , $(3.23)$ reduces to $(3.11)$ so ${\dot{W}}_{d}$ is negative definite for $e (0) \in Ω_{1}$ . Therefore, from Theorem 1.6, we know that $(z, s) = 0$ is asymptotically stable. Since $W_{d}$ is positive definite in $e$ , we know that $(e, s) = 0$ is asymptotically stable for $e (0) \in Ω_{1}$ . The proof of $(1.2)$ for $(e (0), s (0)) \in Ω_{1} \cap Ω_{2} \cap Ω_{3}$ now follows from the same arguments used in the proof of Theorem 3.1.

Next, from $(3.20)$ , we have

\begin{matrix} (3.24) & v_{f n} = u_{a n} + v_{T} + k_{T} e_{T} \end{matrix}

where the subscript $n$ denotes the $n$ th element of the corresponding vector. Differentiating $(2.44)$ and applying $(3.24)$ yields

\begin{aligned} {\dot{e}}_{T} & = v_{T} - v_{n} = v_{T} - (v_{f n} + s_{n}) \\ (3.25) & = - k_{1} e_{T} + r \end{aligned}

where $r := - s_{n} - u_{a n}$ . Since $(3.25)$ is a stable linear system with input $r$ and output $e_{T}$ , the output will converge to zero if the input converges to zero (see Theorem 1.1). Given that $(z, s) = 0$ is asymptotically stable, we know that $(s (t), z (t)) \to 0$ as $t \to \infty$ and therefore $u_{a} (t) \to 0$ as $t \to \infty$ . As a result, $r (t) \to 0$ as $t \to \infty$ and, from $(3.25)$ , $e_{T} (t) \to 0$ as $t \to \infty$ . Finally, since $(1.2)$ implies that $q_{n} (t) \in conv {q_{1} (t), q_{2} (t), \dots, q_{n - 1} (t)}$ as $t \to \infty$ due to the manner in which $F^{*}$ is constructed for the target interception problem, we conclude from the convergence of $e_{T}$ to zero that $(1.5)$ holds. Q.E.D.

◻

A few observations are in order concerning the structure of $(3.19)$ - $(3.21)$ .

${\dot{v}}_{f}$ is not included in $(3.19)$ as it is in $(3.8)$ because the derivative of $(3.20)$ is a function of the unknown signal ${\dot{v}}_{T}$ . Hence, only the measurable terms of ${\dot{v}}_{f}$ appear in $(3.19)$ . Since ${\dot{v}}_{T}$ cannot be directly cancelled by the control, it is instead dominated by the variable structure term $k_{s} sgn (s)$ as shown in $(3.23)$ .
Comparing $(2.54)$ and $(3.21)$ , notice the absence of the term $- u_{a n}$ in the latter. Unlike the control in Theorem 2.4, the presence of this term in $(3.21)$ is not necessary for proving the converge of $e_{T}$ to zero. If $- u_{a n}$ was included $(3.21)$ , the above stability analysis would still hold with the exception that the auxiliary variable $r$ in $(3.25)$ would become simply $r = - s_{n}$ .

When expressed element-wise, the control $(3.19)$ - $(3.21)$ takes the form

\begin{aligned} u_{i} = & - k_{a} v_{i} + k_{a} (v_{T} + k_{T} e_{T}) + k_{T} (v_{T} - v_{n}) \\ - \sum_{j \in N_{i} (E^{*})} [k_{v} (z_{i j} I_{2} + 2 {\tilde{q}}_{i j} {\tilde{q}}_{i j}^{⊤}) {\tilde{v}}_{i j} + (k_{a} k_{v} + 1) {\tilde{q}}_{i j} z_{i j}] \\ - k_{s} sgn (v_{i} - v_{T} - k_{T} e_{T} + k_{v} \sum_{j \in N_{i}} {\tilde{q}}_{i j} z_{i j}) \end{aligned}

As one can see, the $i^{th}$ agent's control input is dependent on its own velocity and the relative position/velocity to neighboring agents, $e_{T}, v_{T}$ , and $v_{n}$ .

3.5 Dynamic Formation Acquisition

When solving the dynamic formation acquisition problem (see Problem 4 in Section 2.5) for the double-integrator model, we require that the time-varying distance $d_{i j} (t)$ be twice continuously differentiable and $d_{i j} (t), {\dot{d}}_{i j} (t), {\ddot{d}}_{i j} (t) \in L_{\infty}$ for the control law to be continuous and bounded.

Similar to the formation maneuvering control law of this chapter, the dynamic formation acquisition control input will take the form of $(3.8)$ but with the problem-specific design for $v_{f}$ . That is, $v_{f}$ is set to the right-hand side of $(2.71)$ for dynamic formation acquisition.

The term ${\dot{v}}_{f}$ in $(3.8)$ can be explicitly calculated from $(2.71)$ as follows

\begin{matrix} (3.26) & {\dot{v}}_{f} = {\dot{R}}^{†} (\tilde{q}) (- k_{v} z + d_{v}) + R^{†} (\tilde{q}) (- k_{v} \dot{z} + {\dot{d}}_{v}) \end{matrix}

where $d_{v}$ was defined in $(2.70)$ ,

\begin{aligned} {\dot{d}}_{v} & = [\dots, {\dot{d}}_{i j}^{2} + d_{i j} {\ddot{d}}_{i j}, \dots], (i, j) \in E^{*}, \\ \dot{z} & = 2 (R_{D} (\tilde{q}) v - d_{v}), \\ {\dot{R}}^{†} & = {\dot{R}}_{D}^{⊤} {(R_{D} R_{D}^{⊤})}^{- 1} - R_{D}^{⊤} {(R_{D} R_{D}^{⊤})}^{- 1} \frac{d (R_{D} R_{D}^{⊤})}{d t} {(R_{D} R_{D}^{⊤})}^{- 1}, \end{aligned}

and ${\dot{R}}_{D}$ was defined in $(3.13)$ . It is not difficult to see that $(3.26)$ is a function of ${\tilde{q}}_{i j}, {\tilde{v}}_{i j}, d_{i j}, {\dot{d}}_{i j}$ , and ${\ddot{d}}_{i j}$ for $(i, j) \in E^{*}$ . This control also suffers from the coupling issue discussed in Section 2.5 due to the presence of the pseudoinverse matrix $R^{†}$ in $(2.71)$ and $(3.26)$ .

The proof of stability uses the same Lyapunov function candidate $(3.5)$ and combines the arguments from the proofs of Theorems 2.5 and 3.1. A sketch of the proof is as follows. Substituting $(3.8)$ and $(2.71)$ into $(3.6)$ yields

\begin{matrix} (3.27) & {\dot{W}}_{d} = - k_{v} z^{⊤} z - k_{a} s^{⊤} s ⩽ - 2 min (k_{v}, k_{a}) W_{d} \end{matrix}

for $e (t) \in Ω_{1}$ from which we conclude that $(e, s) = 0$ is exponentially stable for $e (0) \in Ω_{1}$ in the same vein of Theorem 3.1. The proof of $(2.66)$ for $(e (0), s (0)) \in Ω_{1} \cap Ω_{2} \cap Ω_{3}$ proceeds as in Theorem 3.1.

As in the single-integrator case, formation maneuvering can be performed concurrently with dynamic formation acquisition by setting $v_{f}$ to the right-hand side of $(2.73)$ . The derivative of $v_{f}$ will then be given by $(3.26)$ plus ${\dot{v}}_{d}$ as defined in $(3.18)$ .

Appendix

A. Integrator Backstepping Methodology

Integrator backstepping is a recursive control design methodology for systems in so-called strict-feedback form. It provides a systematic way of designing Lyapunov functions and nonlinear controllers for systems of any order. Unlike the feedback linearization method, backstepping can accommodate model uncertainties and avoid the unnecessary cancellation of "useful" (stabilizing) nonlinearities.

Since the dynamic model of the individual agents here have at most order two, we illustrate the backstepping technique by considering the system

\begin{aligned} (A.1) & \dot{x} = f (x) + η \\ (A.2) & \dot{η} = u \end{aligned}

where $[x, η] \in R^{2}$ is the system state, $u \in R$ is the control input, and $f (x)$ is continuously differentiable with $f (0) = 0$ . Say that our control objective is to stabilize the system at the equilibrium point $[x, η] = 0$ for any initial conditions.

Notice that the above system is a cascaded connection of subsystems $(A .1)$ and $(A .2)$ . The idea behind backstepping is to first consider $η$ as a control input for subsystem $(A .1)$ . Under this assumption, we could design $η = - f (x) - x$ to obtain the exponentially stable closed-loop system $\dot{x} = - x$ . Since in reality $η$ is a system state and thus cannot be directly manipulated, we use the trick of adding and subtracting a fictitious control input $η_{f}$ to the right-hand side of $(A .1)$ and introducing the variable transformation

ξ = η - η_{f}

As a result, our system becomes

\begin{aligned} \dot{x} & = f (x) + η_{f} + ξ \\ \dot{ξ} & = u - {\dot{η}}_{f} . \end{aligned}

Now, if we design

\begin{aligned} η_{f} & = - f (x) - x \\ (A.3) & u & = {\dot{η}}_{f} - ξ - x \end{aligned}

where

{\dot{η}}_{f} = \frac{\partial η_{f}}{\partial x} (f (x) + η),

we get the closed-loop system

\begin{aligned} \dot{x} & = - x + ξ \\ (A.4) & \dot{ξ} & = - ξ - x \end{aligned}

whose unique equilibrium point is $[x, ξ] = 0$ . Using the Lyapunov function candidate

V (x, ξ) = \frac{1}{2} (x^{2} + ξ^{2})

and taking its time derivative along $(A .4)$ yields

\dot{V} = - x^{2} - ξ^{2}

From Corollary 1.1, we can conclude that $[x, ξ] = 0$ is exponentially stable. Since $η_{f} (x = 0) = 0$ , we know that $[x, η] = 0$ is an exponentially stable equilibrium point for $(A .1)$ and $(A .2)$ in closed-loop with $(A .3)$ .

Marcio de Queiroz, Xiaoyu Cai, and Matthew Feemster, *Formation Control of Multi-Agent Systems: A Graph Rigidity Approach. USA: John Wiley & Sons, Ltd, 2019. Accessed: Dec. 31, 2025: Section 2 & 3, Appendix C.

The control could also be a function of other, nonposition-related variables depending on the agent model and formation problem being solved. ↩︎
Although the argument of the rigidity matrix function is commonly written as q, it is obvious from $r_{G}$ and $R_{D}$ that $R_{D}$ is dependent on $\tilde{q}$ only. Henceforth, we write $R_{D} (\tilde{q})$ so it is clear that the matrix is a function of the relative position. ↩︎
The variable $u_{a}$ in $(2.15)$ $ denotes the basic formation acquisition control term that will be embedded in all control algorithms. ↩︎
Recall from the statement of the formation maneuvering problem in Section 1 that agent $n$ serves as the reference point through which the rotation axis passes. Therefore, ${\tilde{q}}_{in}$ in $(2.24)$ is the relative position between each agent and agent $n$ . ↩︎
The introduction of the term $- u_{a n}$ in $(2.54)$ is crucial for the following stability analysis of the target interception error since it allows $u_{n}$ to have the simple form in $(2.56)$ . ↩︎
It is important to point out that the framework $F^{*} (t)$ is required to be infinitesimally and minimally rigid for all time. ↩︎
Since the precise smoothness properties are agent model-dependent, they will be specified later. ↩︎

Formation Control ​

1 Introduction [1] ​

2 Single-Integrator Model [1] ​

2.1 Formation Acquisition ​

2.2 Formation Maneuvering ​

2.3 Flocking ​

Constant Flocking Velocity ​

Time-Varying Flocking Velocity ​

2.4 Target Interception with Unknown Target Velocity ​

2.5 Dynamic Formation Acquisition ​

Notes ​

3 Double-Integrator Model [1] ​

Double-Integrator Model for Formation Control ​

3.1 Cross-Edge Energy ​

3.2 Formation Acquisition ​

3.3 Formation Maneuvering ​

3.4 Target Interception with Unknown Target Acceleration ​

3.5 Dynamic Formation Acquisition ​

Appendix ​

A. Integrator Backstepping Methodology ​

Formation Control

1 Introduction [1]

2 Single-Integrator Model [1]

2.1 Formation Acquisition

2.2 Formation Maneuvering

2.3 Flocking

Constant Flocking Velocity

Time-Varying Flocking Velocity

2.4 Target Interception with Unknown Target Velocity

2.5 Dynamic Formation Acquisition

Notes

3 Double-Integrator Model [1]

Double-Integrator Model for Formation Control

3.1 Cross-Edge Energy

3.2 Formation Acquisition

3.3 Formation Maneuvering

3.4 Target Interception with Unknown Target Acceleration

3.5 Dynamic Formation Acquisition

Appendix

A. Integrator Backstepping Methodology