Skip to content

Formation Control

1 Introduction [1]

Consider a system of n mobile agents where qiRm is the position of the ith agent relative to an Earth-fixed coordinate frame, and uiRm is the corresponding control input. In subsequent parts, ui will be a velocity-, acceleration-, or actuator-level input depending on the mathematical model used to describe the agent motion.

Let the desired formation for the agents be represented by an infinitesimally and minimally rigid framework F=(G,q) where G=(V,E) is the formation graph, dim(V)=n,dim(E)=l, and q=[q1,,qn]. The constant desired distance between agents i and j is given by

(1.1)dij=qiqj>0,i,jV.

In practice, the geometric shape/structure of the desired formation is dictated by the mission to be accomplished by the agents. When translating the desired shape into a framework, one needs to include enough edges to ensure that F is indeed infinitesimally and minimally rigid.

The actual formation of the agents is represented by the framework F(t)=(Gs,q(t)) where Gs represents the sensor graph and q=[q1,,qn]. It is important to clarify the difference between the formation graph G and the sensor graph Gs, which in general need not be the same.

  • G indicates the minimum number of inter-agent distances that need to be controlled for the desired formation to be successfully reached.
  • Gs indicates the agent pairs that can sense and/or communicate with each other.

We make the following assumptions regarding the desired and actual formations:

Assumption 1

The set where the agents achieve the desired formation is nonempty, i.e., there exist q such that rG(q)=d where d=[,dij2,]Rl.

Assumption 2

The formation and sensor graphs are the same, i.e., Gs=G. Furthermore, inter-agent connectivity is always maintained in the sense that agent i is always within the sensing/communication range of agent j, jNi(E). In other words, G is fixed.

Connectivity maintenance prevents the occurrence of flex ambiguities since temporary loss of edges cannot happen.

Assumption 3

At t=0, the agents do not satisfy the desired inter-agent distance constraints, i.e., qi(0)qj(0)=≠dij,i,jV.

Assumption 4

The only position information being measured is the relative position of agent pairs in E,qiqj,(i,j)E[1]. That is, the global position of the agents, qi,i=1,,n, are not available to the control.

We will deal with three types of control problems:

  • Formation Acquisition
  • Formation Maneuvering
  • Target Interception

Problem 1: Formation Acquisition

The goal is for the agents to acquire and maintain a pre-defined geometric shape in space. The control objective for formation acquisition, which serves as the common, primary objective for the other two problems, can be mathematically described as to design ui such that

(1.2)F(t)Iso(F) as t.

which is equivalent to

(1.3)qi(t)qj(t)dij as t,i,jV.

Since only the inter-agent distances are to be directly controlled, the actual formation can converge to any isometry of F. That is, the meaning is that the formation will converge to one framework in the set Iso(F) with the specific one being determined by the initial position of the agents, qi(0),i=1,,n.

Problem 2: Formation Maneuvering

The agents are required to simultaneously acquire a formation (i.e., satisfy (1.2)) and maneuver cohesively according to some pre-defined trajectory. Thus, the secondary objective is

(1.4)q˙i(t)vdi(t)0 as t,i=1,,n.

where vdiR3 represents the desired rigid body velocity for the swarm of agents. That is, the fixed-shape, desired formation evolves in space as a virtual rigid body undergoing translation and/or rotation.

In practice, the selection of vdi is mission-dependent. For example, it could be related to a path planning algorithm that provides an optimal solution to the coverage problem where agents cooperatively maximize the coverage area of a given mission under certain time and/or fuel consumption constraints.

When vdi only includes a translation velocity, the formation maneuvering problem is also called flocking. For the case where vdi has a rotational component, we assign the nth agent (without lost of generality) to be the “leader” while the remaining agents are “followers”. This assignment is for the sole purpose of one agent serving as a reference point for the axis of rotation of the virtual rigid body. Therefore, F should be constructed with the following additional conditions:

  • qnconv{q1,,qn1}.
  • (i,n)E,i=1,,n1, i.e., there is an edge between each follower and the leader.
1-example
Figure 1.1: Example of the construction of $F^∗$: a tetrahedron formation where L stands for leader and F for follower.

An example of F is illustrated by the 3D formation in Figure 1.1 where the leader is located in the interior of the tetrahedron. The axis of rotation passes through the leader, which is inside the tetrahedron. Since n=5, we need 3n6=9 for the framework to be minimally rigid. The solid lines indicate edges that form the faces of the tetrahedron while the dashed lines are edges in its interior. Notice that edge (1,4) is not necessary.

The association of a leader agent (instead of a virtual leader) with the axis of rotation is done for convenience (not necessity) since the leader’s relative position to the followers can be measured and it will not have to undergo any rotation. Note that if one uses a virtual leader, its location would have to be known in order to calculate its position relative to the agents (see (3) in Distance). This in turn would require extra measurements and/or calculations.

Problem 3: Target Interception

The agents should intercept and surround a (possibly evading) moving target with a pre-defined formation. Here, we will also use the leader–follower approach by taking the nth agent to be the leader while the remaining agents are followers. The control protocol will consist of:

  1. Selecting F such that qnconv{q1,,qn1} (Unlike formation maneuvering with rotation, we do not need the second condition for target interception)
  2. The leader chasing the target
  3. The followers tracking the leader while maintaining the desired formation. Thus, if qTRm denotes the target position, the secondary objective for this problem is that qT(t) approach conv{q1(t),q2(t),,qn1(t)} as time evolves, which (with abuse of notation) we express as

(1.5)qT(t)conv{q1(t),q2(t),,qn1(t)} as t.

Before beginning with the control design, some theorem and corollary statements will be made without proof.

Theorem 1.1 (Originally from [1] of Theorem C. 1)

Consider the SISO LTI system

x˙=Ax+Buy=Cx

where ARn×n is a Hurwitz matrix. Then, the following results hold:

  • If u(t)L2, then y(t)L2L,y(t)L2,y(t) is continuous, and y(t)0 as t.
  • If u(t)L, then y(t)L,y(t)L, and y(t) is uniformly continuous. If, in addition, u(t)0 as t, then y(t)0 as t.

Theorem 1.2 (Originally from [1] of Theorem C.2)

Let V:D×R0R be a continuously differentiable function such that

U1(x)V(x,t)U2(x)V˙=Vt+Vxf(x,t)U3(x)

for all t0 and for all xD, where Ui(x),i=1,2,3 are continuous positive definite functions on D. Then, xe=0 is uniformly asymptotically stable.

Corollary 1.1 (Originally from [1] of Corollary C.1)

If Ui(x)=cixp,i=1,2,3 where ci,p>0 in Theorem 1.1, then xe=0 is exponentially stable.

Input-to-state stability bridges the gap between the notions of Lyapunov stability and input–output stability by quantifying the effects of both initial conditions and external (control or disturbance) inputs on the system state.

Definition: Input-to-State Stability

A dynamical system x˙=f(x,u),x(0)=x0 with f:Rn×RmRn is said to be input-to-state stable if there exist a class KL function β and a class K function α such that, for any x0 and any u(t)L, the solution x(t) exists for all t0 and satisfies

x(t)β(x0,t)+α(sup0τtu(τ)).

The above inequality has several implications.

  • For any bounded input, the state is bounded.
  • As t, the state is ultimately bounded by function α.
  • If u(t)0 as t, so does x(t).

Theorem 1.3 (A corollary to Barbalat's Lemma, originally from [1] of Theorem C.3)

Consider the function F:R0R. If f(t)L,f˙(t)L, and f(t)L2, then

f(t)0 as t.

Theorem 1.4 (Originally from [1] of Theorem C.4)

Consider that f(x,u) in x=f(x,u)˙,x(0)=x0 is locally Lipschitz in (x,u) in some neighborhood of (x=0,u=0). Then, the system is locally input-to-state stable if and only if the unforced system x˙=f(x,0) has a locally asymptotically stable equilibrium point at the origin.

Theorem 1.5 (Originally from [1] of Theorem C.5)

Consider the interconnected system

(1.6)Σ1:x˙=f(x,y)Σ2:y˙=g(y).

If subsystem Σ1 with input y is locally input-to-state stable and y=0 is a locally asymptotically stable equilibrium point of subsystem Σ2, then [x,y]=0 is a locally asymptotically stable equilibrium point of the interconnected system.

Theorem 1.6 (Originally from [1] of Theorem C. 6)

If 0K[f](0,t) in a region QB(0,δ)×[t0,) and V : D×R0R is a regular function satisfying V(0,t)=0,

α1(x)V(x,t)α2(x)x0

and

V˙ a.e. ξV(x,t)ξ[K[f](x,t)1]α3(x)

in Q where αi(),i=1,2,3 are class K functions, K[f](x,t) is an upper semi-continuous, nonempty, compact, convex-valued map on D defined as

K[f](x,t):=δ>0μN=0cof(B(x,δ)N,t),

where μN=0 denotes the intersection over all sets N of Lebesgue measure zero, co is the convex closure, and B was defined as

B(x¯,r)={xRn:xx¯<r}

represents the "ball" of radius r centered at x¯.

Then x=0 is a uniformly asymptotically stable equilibrium point of system x˙=f(x,t),x(t0)=x0 where f:D×R0Rn is discontinuous in x and piecewise continuous in t on D×R0.

2 Single-Integrator Model [1]

This section will set the foundation for the formation control designs. We use here a very simple model for the motion of the agents known as the single-integrator model, which only includes two variables: position and velocity. This is a simplified kinematic model for omnidirectional robots (e.g., mobile robots with Swedish wheels). Specifically, we consider a system of n agents governed by the first-order differential equation

(2.1)q˙i=ui,i=1,,n.

where qiRm is the position and uiRm is the velocity-level control input of the ith agent with respect to an Earth-fixed coordinate frame. The name “single integrator” originates from the fact that the transfer function matrix of (2.1) is

(2.2)Gi(s)=1sIm

where s is the Laplace variable, i.e., the inputs and outputs are separated by one integrator.

Formation controllers based on (2.1) are called high-level control laws because they are often embedded in controllers designed for more refined agent models. Therefore, the control laws introduced in this section will form the basis for all subsequent designs.

2.1 Formation Acquisition

We begin with the formation acquisition problem defined in Section 1. Given (2.1), we seek to design ui=ui(qiqj,dij),i=1,,n and jNi(E), where Ni() was defined in Preliminary of Graph Theory to achieve the control objective described by (1.2) (or equivalently (1.3)).

It is appropriate at this point to elaborate on an issue mentioned at the end of Section of framework ambiguities. The inputs ui,i=1,,n will directly control the distances qiqj,(i,j)E. Therefore, they can only directly ensure that

(2.3)qi(t)qj(t)dij as t,(i,j)E,

which is equivalent to

(2.4)rG(q(t))rG(q)=d as t.

Note that (2.3) is different than (1.3) since it is only defined for (i,j)E while (1.3) is defined for all i,jV. This is potentially problematic since (with abuse of notation) rG(Iso(F))=rG(Amb(F)). Therefore, the control scheme will need to avoid the possibility that F(t)Amb(F) as t. This will be accomplished by initializing the agents sufficiently close to Iso(F) in the sense that dist(q(0),Iso(F))<dist(q(0),Amb(F)).

To simplify the notation in the following derivations, we define the relative position of two agents as

(2.5)q~ij=qiqj.

and let q~=[,q~ij,]Rml,(i,j)E with the same ordering of terms as the edge function rG(). The distance error is given by

(2.6)eij=q~ijdij.

Note that (1.3) is equivalent to eij(t)0 as t,i,jV. The distance error dynamics can be derived from (2.6) and (2.1) as

e˙ij=ddt(q~ijq~ij)=(q~ijq~ij)12q~ij(uiuj)(2.7)=q~ij(uiuj)eij+dij.

Let

(2.8)zij=q~ij2dij2,

which can be rewritten as

(2.9)zij=eij(eij+2dij)

using (2.6). Given that q~ij0 (or equivalently, eijdij), it is not difficult to see that zij=0 if and only if eij=0. We now introduce the following Lyapunov function candidate

(2.10)W(e)=14(i,j)Ezij2=14zz

where e=[,eij,]Rl and z=[,zij,]Rl,(i,j)E are ordered as rG. This function is positive definite in e and its level surfaces, W(e)=c for some c>0, are closed since eijdij. The time derivative of (2.10) along (2.7) is given by

(2.11)W˙=(i,j)Eeij(eij+2dij)q~ij(uiuj).

Using definition of rigidity matrix RD i.e., (6) in Infinitesimal Rigidity, and (2.9), (2.11) can be conveniently written as[2]

(2.12)W˙=zRD(q~)u

where u=[u1,,un]Rmn is the stacked vector of control inputs. Before presenting the main result, we introduce a lemma that establishes the relationship between Corollary of Theorem 2 in Graph Rigidity and the level surfaces of the Lyapunov function candidate.

Lemma 2.1

For nonnegative constants c and δ, the level set W(e)c is equivalent to Ψ(F,F)δ where Ψ and W were defined in Corollary of Theorem 2 in Graph Rigidity and (2.10), respectively.

Proof:

Details of Proof

First, from the definition of Ψ(,) in Corollary of Theorem 2 in Graph Rigidity, (1.1), (2.5), (2.6), we have that

Ψ(F,F)=(i,j)E(qiqjqiqj)2=(i,j)E(qiqjdij)2(2.13)=(i,j)Eeij2

From (2.10), we know W(e)c implies that eij,(i,j)E is bounded. This boundedness along with (2.13) implies Ψ(F,F)δ where δ is some nonnegative constant. Now, given Ψ(F,F)δ, it follows from (2.13) that eij is bounded for (i,j)E. This implies zij,(i,j)E is bounded, and W(e)c where c is some nonnegative constant. Q.E.D.

The control law for solving the formation acquisition problem is given in the following theorem. Its structure is based on (2.12) and Lyapunov stability theory. Specifically, the goal is to make the time derivative of the Lyapunov function candidate negative definite.

Theorem 2.1

Consider the formation F(t)=(G,q(t)), and let the initial conditions of the error dynamics be such that e(0)Ω1Ω2 where

Ω1={eRlΨ(F,F)δ},(2.14)Ω2={eRldist(q,Iso(F))<dist(q,Amb(F))},

and δ is a sufficiently small positive constant. The control law[3]

(2.15)u=ua:=kvRD(q~)z,

where kv>0 is a user-defined control gain, renders e=0 exponentially stable and ensures (1.2) is satisfied.

Proof:

Details of Proof

Given that F and F(t) have the same number of edges and that F is minimally rigid by design, then F(t) is minimally rigid for all t0. Substituting (2.15) into (2.12) yields

(2.16)W˙=kvzRD(q~)RD(q~)z.

Since F is infinitesimally rigid, we know from Corollary of Theorem 2 in Graph Rigidity that F(t) is infinitesimally rigid for e(t)Ω1. Therefore, we know F(t) is infinitesimally and minimally rigid for e(t)Ω1, so we can invoke Corollary of Theorem 3 in Minimal Rigidity to state

(2.17)W˙kλmin(RDRD)zz=4kλmin(RDRD)W for e(t)Ω1

where (2.10) was used. From (2.17), we know that W˙(t)0 for all t0; hence, W(t) is nonincreasing for all t0. Then, since e(t)Ω1 is equivalent to e(t){eR3nW(e)c} from Lemma 2.1, a sufficient condition for (2.17) is given by

(2.18)W˙4kλmin(RDRD)W for e(0)Ω1.

From the form of (2.18) and the fact that W is positive definite in e, we can invoke Corollary 1.1 to conclude that e=0 is exponentially stable for e(0)Ω1. Given that e is only defined for (i,j)E, the exponential stability of e=0 implies that F(t)Iso(F) or F(t)Amb(F) as t. If we choose e(0)Ω1Ω2, we have from (2.14) that

(2.19)dist(q(0),Iso(F(0)))<dist(q(0),Amb(F(0))).

Due to (2.19), the energy-like function W(t) would need to increase for a period of time for F(t)Amb(F) as t, which is a contradiction since (2.18) establishes that W(t) is nonincreasing for all t0. Therefore, we know F(t)Iso(F) as t for e(0)Ω1Ω2. This argument is conceptually illustrated by Figure 2.1, where the ball, representing F(t), would have to overcome the energy barrier to reach Amb(F). Q.E.D.

2-energy
Figure 2.1: Energy landscape showing the two equilibrium points, Iso(\mathcal{F}^{*}) and Amb(\mathcal{F}^{*})$, at the bottom of each well.

The initial condition e(0)Ω1Ω2 in Theorem 2.1 is a sufficient condition for the actual formation F(t) to

  1. Remain infinitesimally rigid for all time and
  2. Be closer to a framework in Iso(F) at t=0 than to one in Amb(F) in order to avoid converging to an ambiguous framework.

The former constraint is satisfied by e(0)Ω1 while the latter is satisfied by e(0)Ω2. The set Ω1Ω2 exists because it is always possible to select F(0) sufficiently close to a framework in Iso(F).

The control (2.15) can be expressed element-wise as

(2.20)ui=kvjNi(E)q~ijzij,i=1,n,

which is only a function of q~ij and dij for (i,j)E. Thus, the control law is decentralized since it only requires the ith agent to measure its relative position to neighboring agents.

Notice that each individual term of the summation in (2.20) is a vector whose direction is along q~ij. If all n agents are positioned collinearly at t=0, the control input of each one will necessarily be directed along the line. As a result, the agents will be stuck in a collinear formation and will never converge to the desired formation. In other words, the collinear formation is an invariant set. However, if at least one agent is not initially collinear with the others, the agents will not necessarily remain collinear because the edges between these agents and the noncollinear ones will create control components whose directions are not parallel to the line.

The stability result of Theorem 2.1 guarantees that the desired formation is acquired up to rotation and translation. In other words, the formation acquisition controller does not regulate the formation to a pre-defined global location in space. This is a reflection of the facts that ui is not a function of qi but only of the relative positions q~ij,(i,j)E and that the control objective is to regulate q~ij.

Since we are only concerned with the inter-agent distances, any coordinate frame can be used to implement ui. That is, although the above analysis was done with the variables defined with respect to a common, fixed coordinate frame for convenience, (2.20) can be implemented in practice with respect to the ith agent's own local coordinate frame. This means that the agents do not need to have a common sense of orientation and (2.20) is rotationally invariant. To see this, let F0 and Fi denote the Earth-fixed coordinate frame and the local coordinate frame of the ith agent, respectively (see Figure 2.2). If Ri0Rm denotes the rotation matrix representing the orientation of Fi with respect to F0, we have that

q~ij:=q~ij0=Ri0q~ijiui:=ui0=Ri0uii

where the superscript denotes the coordinate frame in which the vector is specified. From (2.20), we can then write

uii=kvjNi(E)(Ri0)Tq~ijzij=kvjNi(E)q~ijizij

since zij is independent of the coordinate frame.

2-coordinate
Figure 2.2: Fixed and local coordinate frames.

Finally, the control (2.7) is in fact the standard gradient descent law that often appears in the literature. If we rewrite z as

(2.21)z=rG(q)rG(q)

where rG and (2.8) were used, it follows from (2.10) that

(2.22)W=14rG(q)rG(q)2.

The derivative of (2.22) with respect to q is given by

Wq=12(rG(q)rG(q))rG(q)q=(rG(q)rG(q))RD(q~)

where RD(p) was used. Therefore,

u=qW=(Wq)=RD(q~)z,

which is the same as (2.7) without the control gain. That is, since (2.22) (also called a potential function) has a minimum when rG(q)=rG(q), it is well known from optimization theory that the negative gradient causes the system trajectory to approach the local minimum.

2.2 Formation Maneuvering

In this section, we solve the formation maneuvering problem defined in Section 1.4 using model (2.1). Since formation acquisition is embedded in the formation maneuvering problem, we use (2.12) as the starting point. The control law here will take the form ui=ui(q~ij,dij,vdi),i=1,,n and jNi(E) where vdi(t), which was defined in (1.4), is a bounded continuous function.

Theorem 2.2

Consider the formation F(t)=(G,q(t)) with the initial conditions on e(0) given in Theorem 2.1. Then, the control

(2.23)u=ua+vd,

where ua was defined in (2.15), vd=[vd1,,vdn]R3n is the desired rigid body velocity specified by[4]

(2.24)vdi=v0+ω0×q~in,i=1,,n

v0(t)R3 denotes the desired translation velocity for the formation, ω0(t)R3 is the desired angular velocity, renders e=0 exponentially stable and ensures that (1.2) and (1.4) are satisfied.

Proof:

Details of Proof

Substituting (2.23) into (2.12) yields

(2.25)W˙=kvzRD(q~)RD(q~)z+zRD(q~)vd.

TODO It follows from (1.20) and (2.24) that

(2.26)RD(q~)vd=0.

Therefore, the proof of Theorem 2.1 can be directly followed to show that e=0 is exponentially stable for e(0)Ω1Ω2 and (1.2) holds.

From (2.9) it is clear that z0 as e0. The exponential stability of e=0 implies that q~ is bounded from (2.6). Therefore, RD(q~) is bounded and we know from (2.23) and (2.15) that

(2.27)uvd as e0

Since we proved that e(t)0 as t, it follows from (2.27) and (2.1) that (1.4) holds. Q.E.D.

The control (2.23) has two independent components: the term ua is responsible for formation acquisition while vd is responsible for rigid body maneuvers of the whole formation. We can see from (2.26) that the control exploits the special structure of the rigidity matrix to disassociate the formation acquisition stability analysis from the formation maneuvering analysis.

Another interesting point is that, despite being based on the single-integrator model, (2.24) is generally not open-loop in nature since it depends on feedback of q~in. That is, (2.24) has an open-loop form only when the maneuver is purely translational.

The control law can be written element-wise as

ui=kvjNi(E)q~ijzij+v0+ω0×q~in,i=1,n,

which shows that it is decentralized. Note that in many applications the signals v0 and ω0 are known a priori and therefore can be stored on each agent's onboard computer. Also, since q~nn=0, the formation maneuvering term of the leader only has the translation component v0. This is expected since the leader by design lies on the axis of rotation of the virtual rigid body.

2.3 Flocking

Here, we consider the special case of formation maneuvering where the desired velocity only includes the translation component. Recall from Section 1 that this is commonly referred to as flocking. Unlike last Section, we consider that the desired flocking velocity is only available to a subset of agents. We will overcome this constraint by employing a distributed observer that estimates this velocity by exploiting the connectedness of the formation graph.

Constant Flocking Velocity

We first consider the case where the flocking velocity is constant. Let v0Rm be the constant flocking velocity and V0V be the nonempty subset of agents that have direct access to v0. To solve this flocking problem, we use the continuous controller-observer scheme

(2.28a)u=ua+v^(2.28b)v^˙i=αjNi(E)(v^iv^j)αbi(v^iv0),i=1,n

where

(2.29)bi={1, if iV00, otherwise 

ua was defined in (2.15), v^=[v^1,,v^n]Rmn contains the velocity estimates for each agent, and α>0 is a user-defined observer gain.

Theorem 2.3

Consider the formation F(t)=(G,q(t)) with the initial conditions in Theorem 2.1. Then, the controller-observer scheme (2.28) with any v^(0) renders e=0 asymptotically stable and ensures that (1.2) and (1.4) are satisfied with vdi=v0,i=1,,n.

Proof:

Details of Proof

Let

(2.30)v~i=v^iv0

denote the flocking velocity estimation error for agent i. If v~=[v~1,,v~n]Rmn, then

(2.31)v~=v^1nv0.

As part of this proof, we will show that (2.28b) guarantees v~(t)0 as t. From the time derivative of (2.8), we have that

(2.32)z˙=2RD(q~)u.

After substituting (2.28a) into (2.32), we get the closed-loop system

(2.33)z˙=2kvRD(q~)RD(q~)z+2RD(q~)v^.

Using (2.31) in (2.33) yields

(2.34)z˙=2kvRD(q~)RD(q~)z+2RD(q~)v~

upon application of Property RD(p)(1nx)=0 in Infinitesimal Rigidity.

Now, we turn our attention to deriving the dynamics of the estimation error. First, notice that

jNi(E)(v^iv^j)=j=1naij(v^iv^j)

where aij are the elements of the adjacency matrix. Taking the time derivative of (2.31) and substituting (2.28b) gives

v~˙=α(LIm)v~α(BIm)v~(2.35)=α(MIm)v~

where we used the fact that v^iv^j=v~iv~j,B:=diag(b1,bn),L is the Laplacian matrix defined in (1.4), and M:=L+B is symmetric. Our overall closed-loop system is composed of two interconnected subsystems, (2.34) and (2.35), which are in the form of (1.6). Notice that (2.34) with v~=0 is input-to-state stable by Theorem 1.4 since it reduces to the closed-loop system analyzed in Theorem 2.1. Since the graph of a rigid framework is always connected, we know that G is connected. Therefore, we know from Lemmas 1.1 and nonautonomous (time-varying) system x˙=f(x,t),x(t0)=x0 that M and MIm are positive definite, respectively. It then follows from (2.35) that v~=0 is exponentially stable. We can now invoke Theorem 1.5 to claim that (z,v~)=0 is an asymptotically stable equilibrium point of the interconnected system. Since z=0 if and only if e=0, we know e=0 is asymptotically stable. Finally, by virtue of the initial conditions, we know that F(t)Iso(F) as t as argued in the proof of Theorem 2.1.

Finally, due to the asymptotic stability of e=0, we know ua(t)0 as t and therefore from (2.28a) that u(t)v^(t)0 as t. Since v~i(t)=v^i(t)v00 as t, then we know from (2.1) that (1.4) holds. Q.E.D.

The form of (2.28b) is inspired by multi-agent consensus algorithms. The premise behind the observer is that agents that do not have direct access to v0 can acquire this information from its neighbors since the graph modeling the communication network is connected. Note that the observer (2.28b) can accommodate a leader-follower strategy (only one agent has access to v0 ) as well as the general case where the velocity information exchange happens between any two agents.

Time-Varying Flocking Velocity

The observer scheme in (2.28b) cannot be proven to ensure v~(t)0 as t for the case where the flocking velocity varies with time. In this situation, one can use the variable structure-type observer

(2.36)v^˙i=αsgn(jNi(E)(v^iv^j)+bi(v^iv0)),i=1,n

where v0(t)L is the time-varying flocking velocity, which is assumed to be differentiable with v˙0(t)Lγ for all time, bi was defined in (2.29), and sgn() is the standard signum function:

(2.37)sgn(x)={1 for x>00 for x=01 for x<0

The dynamics of the estimation error now become

(2.38)v~˙=αsgn((MIm)v~)1nv˙0

where sgn(x)=[sgn(x1),,sgn(xn)],xRn. Notice that (2.38) has a discontinuous right-hand side; thus, its solution needs to be studied using nonsmooth analysis. Since sgn() is Lebesgue measurable and essentially locally bounded, one can show the existence of generalized solutions by embedding the differential equation into the differential inclusion

(2.39)v~˙K[f](v~,t)

where K[] is a nonempty, compact, convex, upper semicontinuous set-valued map and f(v~,t)=αsgn((MIm)v~)1nv˙0.

If we define the Lyapunov function candidate

(2.40)Wf=12v~(MIm)v~

we get that

W˙f a.e. Wfv~K[f](v~,t)(2.41)αv~(MIm)sgn((MIm)v~)v~(MIm)(1nv˙0)

where a.e. is the abbreviation for the term "almost everywhere". If we define SGN(x):=[SGN(x1),,SGN(xn)],xRn where

(2.42)SGN(xi)={1 for xi>0[1,1] for xi=01 for xi<0

then (2.41) becomes

W˙f=αv~(MIm)SGN((MIm)v~)v~(MIm)(1nv˙0)=α(MIm)v~1(1nv˙0)(MIm)v~=α(MIm)v~1+v˙0i=1mn[(MIm)v~]iα(MIm)v~1+v˙01(MIm)v~1(2.43)(αγ)(MIm)v~1.

By choosing α>γ, we get that W˙f is negative definite. Therefore, from Theorem 1.6, we know that v~=0 is asymptotically stable.

Now the proof that (2.15) and (2.36) guarantee that (1.2) and (1.4) are satisfied directly follows from the proof of Theorem 2.3.

2.4 Target Interception with Unknown Target Velocity

We now turn our attention to the target interception problem defined in Section 1. We assume the target motion is such that qT(t) is three times continuously differentiable and diqT/dtiL,i=0,1,2,3. Furthermore, we consider the target velocity q˙T to be unknown to all agents, but that the leader can measure the target's relative position qTqn with its onboard sensors and can broadcast this information to the followers.

To simplify the notation, we let vT:=q˙T and

(2.44)eT=qTqn

denote the interception error between the leader and target. The control, which will include a term to "learn" the unknown target velocity, will take the general form ui=ui(q~ij,dij,eT,v^T),i=1,,n and jNi(E) where v^T is the target velocity estimate. This term is generated by the following continuous dynamic estimation mechanism

(2.45)v^T(t)=0t[k1eT(τ)+k2sgn(eT(τ))] dτ

where k1,k2>0 are user-defined control gains. This mechanism allows one to learn or compensate for sufficiently smooth, nonperiodic signals.

Before presenting the control law, a lemma is related to (2.45) is introduced.

Lemma 2.2

Let

(2.46)L:=(k1eT+e˙T)(v˙Tk2sgn(eT)).

If k2 in (2.45) is selected to satisfy the following sufficient condition

(2.47)k2>v˙TL+1k1v¨TL,

then

(2.48)0tL(τ) dτζb

where the positive constant ζb is defined as

(2.49)ζb=k2eT(0)1eT(0)v˙T(0)

Proof:

Details of Proof

Integrating (2.46) over time yields

0tL(τ) dτ=0t(k1eT(τ)+e˙T(τ))[v˙T(τ)k2sgn(eT(τ))] dτ=0tk1eT(τ)[v˙T(τ)k2sgn(eT(τ))] dτ+0te˙T(τ)v˙T(τ) dτ(2.50)0tk2e˙T(τ)sgn(eT(τ)) dτ

After integrating by parts the second integral on the right-hand side of (2.50) and applying Lemma 1 of [44] to the third integral, we obtain

0tL(τ) dτ=0tk1eT(τ)[v˙T(τ)k2sgn(eT(τ))] dτ+eT(τ)v˙T(τ)|0t0teT(τ)v¨T(τ) dτk2eT(τ)1|0t=0tk1eT(τ)[v˙T(τ)1k1v¨T(τ)k2sgn(eT(τ))] dτ(2.51)+eT(t)v˙T(t)eT(0)v˙T(0)k2eT(t)1+k2eT(0)1

Using the fact that x1x for any xRn, we can upper bound the right-hand side of (2.51) by

0tL(τ) dτ0tk1eT(τ)(v˙T(τ)+1k1v¨T(τ)k2) dτ(2.52)+eT(t)(v˙T(t)k2)+k2eT(0)1eT(0)v˙T(0)

Applying (2.47) to (2.52) gives (2.48). Finally, the positiveness of (2.49) follows from the fact that

k2eT(0)1eT(0)v˙T(0)eT(0)(k2v˙T(0))>0

when k2 is selected according to (2.47). Q.E.D.

Theorem 2.4

Consider the formation F(t)=(G,q(t)) with the initial conditions on e(0) given in Theorem 2.1. Then, the control

(2.53)u=ua+1nh,

where ua=[ua1,,uan] was defined in (2.15) and

(2.54)h=(k1+1)eT+v^Tuan,

renders e=0 exponentially stable and ensures that (1.2) and (1.5) are satisfied. Further, the target velocity can be identified in the sense that vT(t)v^T(t)0 as t.

Proof:

Details of Proof

After substituting (2.53) into (2.12), we obtain

(2.55)W˙=kvzR(q~)R(q~)z+zR(q~)(1nh).

Due to Property in the Infinitesimal Rigidity, the second term on the right-hand side of (2.55) disappears and the proof of Theorem 2.1 can be again followed to prove the exponential stability of e=0 and (1.2).

We now proceed to prove (1.5). From (2.53) and (2.54), we have that the leader control input is[5]

(2.56)un=(k1+1)eT+v^T.

Differentiating (2.44) and using (2.56) yields

(2.57)e˙T=vTun(2.58)=vT(k1+1)eTv^T(2.59)=k1eT+w

where

(2.60)w=vTeTv^T

The derivative of (2.60) is given by

(2.61)w˙=v˙Te˙Tk1eTk2sgn(eT)=w+v˙Tk2sgn(eT)

where (2.45) and (2.59) were used. Next, define the auxiliary function

(2.62)P=12ww,

whose derivative along (2.61) is given by

(2.63)P˙=w(w+v˙Tk2sgn(eT))=ww+L

where (2.46) was used. After integrating both sides of (2.63) with respect to time and applying Lemma 2.2, we obtain

0tP˙(τ) dτ=P(t)P(0)=0tw(τ)w(τ) dτ+0tL(τ) dτ(2.64)0tw(τ)w(τ) dτ+ζbζb

[^2]Since P(0) is finite, it follows from (2.64) that P(t)L, which implies that w(t)L from (2.62). From (2.64), we also have that

0tw(τ)w(τ) dτζb+P(0)P(t)<

which means that w(t)L2. Therefore, we know from (2.59) and Theorem 1.1 that eT(t)0 as t. We can also use (2.59) to claim that e˙TL, which implies from (2.57) (together with the boundedness of vT(t) ) that un(t)L. From (2.56), we then know that v^T(t)L. Since (1.26) holds and F is constructed such that qnconv{q1,,qn1}, we know that qn(t)conv{q1(t),q2(t),,qn1(t)} as t. Therefore, from the fact that eT(t)0 as t, we conclude that (1.5) holds.

Finally, we know w˙(t)L from (2.61) since v˙T is assumed bounded. It then follows from Theorem 1.3 that w(t)0 as t. Therefore, we can use (2.59) to show that e˙T(t)0 as t, and then (2.58) to conclude that vT(t)v^T(t)0 as t. Q.E.D.

Similar to the formation maneuvering control, the target interception controller (2.53) and (2.54) has two components with well-defined roles:

  • ua ensures formation acquisition
  • h guarantees target interception

The controller for the followers can be written element-wise as

ui=kvjNi(E)q~ijzij+(k1+1)eT+0t[k1eT(τ)+k2sgn(eT(τ))] dτuan

for i=1,,n1 where

uan=kvjNn(E)q~njznj

whereas the control for the leader is given by (2.56). As one can see, each follower control input depends on its relative position to neighboring agents, the target interception error, and the formation acquisition control term of the leader. Therefore, it is less decentralized than the formation acquisition and maneuvering controllers since now information needs to be wirelessly broadcast from the leader to the followers.

Finally, note that the target interception error (2.44) could be redefined to include a constant offset so that the leader does not collide with the target, i.e., eT=qnqTc where cRm is a constant vector.

2.5 Dynamic Formation Acquisition

So far, we have only considered formation acquisition when the desired formation F is static. In certain applications it may be necessary that the formation size and/or geometric shape change in time, such as to avoid obstacles, dynamically adapt to a change of mission, or adapt to limits in communication range and bandwidth. Thus, we consider now the problem of dynamic formation acquisition in the sense that the desired formation is a function of time, F(t). In control systems jargon, we will deal here with the more general tracking problem instead of the simpler setpoint problem.

Note that dynamic formation acquisition is independent of what we call formation maneuvering. In the former, the time-varying nature is related to the formation itself, whereas in the latter, the formation (whether static or dynamic) maneuvers as a virtual rigid body. The formal statement of the dynamic formation acquisition problem is as follows.

Problem 4 (Dynamic Formation Acquisition)

Let the desired formation be represented by a dynamic, infinitesimally and minimally rigid framework F(t)=(G,q(t))[6] where the time-varying desired distance between agents i and j is given by

(2.65)dij(t)=qi(t)qj(t)>0,i,jV.

We assume the desired distances are sufficiently smooth functions of time[7]. The control objective is to design ui such that

(2.66)F(t)Iso(F(t))0 as t,

or equivalently

(2.67)eij(t)0 as t,i,jV.

Because of the time-varying nature of (2.65), the distance error dynamics is now given by

(2.68)e˙ij=q~ij(uiuj)eij+dijd˙ij,

where (2.6) and (2.1) were used. As a result, the derivative of (2.10) along (2.68) becomes

(2.69)W˙=(i,j)Eeij(eij+2dij)[q~ij(uiuj)dijd˙ij]=z(RD(q~)udv)

where

(2.70)dv=[,dijd˙ij,]Rl,(i,j)E

with elements ordered as rG. We assume dij is a continuously differentiable function of time and dij(t),d˙ij(t)L. The presence of the extra term, dv, in the derivative of the Lyapunov function candidate will dictate a different control structure.

Theorem 2.5

Consider the formation F(t)=(G,q(t)) with the initial conditions given in Theorem 2.1. The control law

(2.71)u=RD(q~)(kvz+dv)

where RD(q~)=RD(q~)[RD(q~)RD(q~)]1 is the Moore-Penrose pseudoinverse, yields e=0 exponentially stable and guarantees that (2.66) is satisfied.

The proof of this theorem is nearly identical to the proof of Theorem 2.1 so the details are omitted. The main difference is that, since RD(q~) has full row rank for e(t)Ω1, then RD(q~)RD(q~)=I for e(t)Ω1. Therefore, substituting (2.71) into (2.69) yields

(2.72)W˙=kvzz=4kvW for e(t)Ω1.

From this point on, the proof of Theorem 2.1 can be directly followed to show that (2.66) holds for e(0)Ω1Ω2.

A fundamental difference exists in the implementation of (2.71) in comparison to the previous controllers of this chapter. Namely, the matrix RD(q~) couples the variables such that ui=ui(q~ij,dij,d˙ij),i=1,,n and (i,j)E. That is, unlike in the previous cases where jNi(E) for the ith input, here each input is dependent on all (i,j)E variables.

Formation maneuvering can be performed on top of dynamic formation acquisition by modifying (2.71) to

(2.73)u=RD(q~)(kvz+dv)+vd

where vd was defined in (2.24). It is straightforward to show that (2.73) ensures (1.4) properties by following the proof of Theorem 2.2.

Notes

The directionality of the information exchange among agents is an important design factor. This issue is of practical importance since it relates to the number of communication, sensing, and/or control channels of the multi-agent system.

In the case of bidirectional information exchange, a pair of agents concurrently controls the distance between them, whereas only one agent in the pair is responsible for this task in the unidirectional case. In terms of graph theory, bidirectional (resp., unidirectional) formation controllers are based on undirected (resp., directed) graphs. Undirected formation controllers have built-in redundancy, providing robustness. However, it can also lead to instability in the formation acquisition if agent pairs use slightly different values for the distance between them due to measurement errors. It was shown that this measurement mismatch causes a distortion of the formation from its desired shape and a circular (resp., helical) orbit of the distorted formation in 2D (resp., 3D).

One possible remedy for this problem is to have the agents communicate their respective measurements to one another and then use a common value for control (e.g., the average of the two measurements).

Yet another solution is to use a directed graph-based controller since it reduces the overall number of communication/sensing/control channels while avoiding the potential conflict between a pair of agents trying to achieve the same objective. However, in directed graphs it is possible to have cycles in the pathways, which are more challenging to control and can lead to formation instability. Therefore, the issue of cyclic versus acyclic graphs is an important consideration for directed formation control.

3 Double-Integrator Model [1]

In this section, we re-discuss the class of formation controllers presented in Chapter 2 in the context of a slightly more refined model, viz., the double-integrator model. We will follow the same format as the previous section for ease of correlation.

The double-integrator model accounts for the agent acceleration by treating the agent as a point mass. Therefore, it can be considered a very simple dynamic model for omnidirectional robots. Given a system of n agents, the equations of motion for the double-integrator model are

(3.1a)q˙i=vi(3.1b)v˙i=ui,i=1,,n

where viRm represents the velocity of the i th agent with respect to an Earth-fixed coordinate frame, uiRm is the acceleration-level control input, and qi is defined as in (2.1). Since the agent velocity is now a system state rather than the control input, the formation control laws in this section will be a function of the agent velocities in addition to the positions.

Note that the system transfer function matrix is now Gi(s)=1/s2Im, which gives rise to the model name. Since the only difference between this transfer function and (2.2) is an additional integrator, the extension of the single-integrator-based control laws to (3.1) is rather seamless if one exploits the integrator backstepping methodology (see Appendix A).

Double-Integrator Model for Formation Control

As in Section 2.1, we begin by deriving the distance error dynamics. To this end, we use (2.6) and (3.1a) to obtain

(3.2)e˙ij=q~ij(vivj)eij+dij

Differentiating (2.10) along (3.2) gives

(3.3)W˙=12zz˙=zRD(q~)v

where v=[v1,,vn]Rmn.

Given that v in (3.3) cannot be directly prescribed since it is a system state, we follow the backstepping technique and introduce the following variable

(3.4)s=vvf

where vfRmn denotes the fictitious (or desired) velocity input, which will be specified later. The variable s quantifies the error between the actual agent velocity and the desired velocity-level input. The design of vf will be problem-specific, and will come from the velocity-level control laws of Chapter 2. That is, generally speaking, vf=uSI where the superscript SI stands for one of the control input designs for the single-integrator model. The block diagrams in Figure 3.1 illustrate the relationship between the control designs for the single- and double-integrator models. As one can see, the velocity-level, position control algorithms from Chapter 2 will be embedded in the acceleration-level, velocity control loop to be designed in this chapter.

diff
Figure 3.1: Relationship between the (a) single- and (b) double-integrator control designs.

Due to the new error variable (3.4), we introduce the augmented Lyapunov function candidate

(3.5)Wd(e,s)=W(e)+12ss

where W was defined in (2.10). Notice that W is a potential energy-like term since it is only position dependent, whereas 12ss is a kinetic energy-like term due to its dependence on velocity. Therefore, Wd captures the total energy of the double-integrator model formation.

After taking the time derivative of (3.5), we obtain

W˙d=zRD(q~)v+ss˙=zRD(q~)(s+vf)+s(uv˙f)(3.6)=zRD(q~)vf+s(u+RD(q~)zv˙f)

where (3.3), (3.1b), and (3.4) were used. Equation (3.6) is the analogue of (2.12) since it will be the starting point for all double-integrator control designs as (2.12) was for the single-integrator designs.

3.1 Cross-Edge Energy

Before presenting the formation controllers, we need to discuss a complication in the stability analysis of the closed-loop system that arises from the double-integrator model. Specifically, this complication is related to the avoidance of flip ambiguities.

Recall that for the single-integrator model, the position of the initial formation needs to be restricted to prevent convergence to a flip ambiguity since the velocity-level control input is designed to promote convergence to Iso(F) or Amb(F), whichever is closer at t=0. Unfortunately, this condition is not sufficient for the double-integrator model. In this case, the agents' velocity will also affect the convergence since it is a system state. This idea is conceptually illustrated by Figure 3.2. Note that even if the formation position is closer to Iso(F), the formation will overcome the energy barrier and converge to Amb(F) if its velocity is large enough. In other words, the total formation energy is now affected by the combination of potential energy and kinetic energy. The implication of this for stability is that a restriction also needs to be imposed on the initial velocity of the formation, which means that we need to limit the initial total energy of the formation.

energy
Figure 3.2: Energy landscape where the formation is at position q with velocity v.

While the need for an upper bound on the initial energy of the formation is evident, its precise value is difficult to calculate in general. For simple formations, one may be able to calculate a conservative value for the energy upper bound as illustrated next. Consider the desired triangular formation in Figure 3.3 along with one of its flipped versions. Note that a flip may occur whenever an agent has enough energy to cross the edge connecting the two other agents, e.g., agent 1 crossing edge (2,3). Once the agent crosses the edge, it is closer to Amb(F) and may be attracted to this undesired equilibrium.

amb
Figure 3.3: Desired formation (solid line) and a flip ambiguity (dashed line).

The question is then: What is the minimum energy needed for this to happen? Hereafter, we refer to this minimum energy as the cross-edge energy, Ec.

A conservative estimate for the cross-edge energy can be made by using the following observations:

  1. The cross-edge energy is related to the energy that drives the agents to a collinear formation
  2. The minimum collinearity energy is given by the agent with the smallest distance to its cross-edge, e.g., the dotted line in Figure 3.3.

These rules facilitate the cross-edge energy estimation because they are only position dependent. Furthermore, we have from (3.5) and (2.10) that WdW=14zz, which is also only position dependent. That is, a sufficient condition for Ec can be determined by calculating the minimum value of W when the three agents are collinear. For example, let d12=d13=2 and d23=2. When agent 1 is collinear with agents 2 and 3 , we have that q~12+q~13=q~23. For notational convenience, we use qC where q=[q1,q2,q3] to denote that the agents are collinear. Therefore,

Ec=minqCW=minqC14(z122+z132+z232)=minqC14[(q~122d122)2+(q~132d132)2+(q~232d232)2]=min14[(q~1222)2+((q~23q~12)22)2+(q~2324)2].

It can be found that the above function reaches a minimum at q~12=q~23/2=10/3 and Ec=0.444. This means that if Wd(0)Ec, the agents will not converge to the flip ambiguity.

Notice that the condition Wd(0)Ec imposes a trade-off between the initial distance error and the initial velocity error. The larger the initial distance error, the smaller the initial velocity error needs to be, and vice versa. Based on (3.4), a small s implies that the agents' velocities are close to vf, which is the desired velocity that ensures convergence to Iso(F).

For formations with n>3, one may apply the above estimation method by triangulating the framework and comparing the cross-edge energy of each triangle to estimate Ec. For example, consider the infinitesimally rigid framework in Figure 3.4. The agents most likely to flip are agents 2 and 6 about cross-edges (1,3) and (1,5), respectively, since they only have two edges (constraints) each. Thus, Ec=min{Ec2,Ec6} where Eci denotes the cross-edge energy of agent i. Note that higher order flips are also possible, but they would require more energy than aforementioned single-agent flips. For example, agents {2,3} or {5,6} could simultaneously also flip about cross-edge (1,4), or agents {2,3,4,5,6} could simultaneously flip about agent 1 , leading to a full reflection of the formation.

hex
Figure 3.4: Triangulated hexagon framework.

3.2 Formation Acquisition

The formation acquisition controller for (3.1) will have the general form ui=ui(qiqj,vivj,vi,dij),i=1,,n and jNi(E). Based on (3.6), the following theorem introduces the control law that solves the formation acquisition problem.

Theorem 3.1

Given the formation F(t)=(G,q(t)), let the initial conditions be such that (e(0),s(0))Ω1Ω2Ω3 where Ω1 and Ω2 were defined in (2.14),

(3.7)Ω3={eRl,sRmnWdEc},

and Ec is the total cross-edge energy of the formation. Then, the control

(3.8)u=kas+v˙fRD(q~)z,

where

(3.9)vf=ua,

ua was defined in (2.15), and ka>0 is a user-defined control gain, renders (e,s)=0 exponentially stable and ensures that (1.2) is satisfied.

Proof:

Details of Proof

Substituting (3.8) and (3.9) into (3.6) yields

(3.10)W˙d=kvzRD(q~)RD(q~)zkass.

Following the arguments used in the proof of Theorem 2.1, we have that

W˙dkvλmin(RDRD)zzkass(3.11)min{2ka,4kvλmin(RDRD)}Wd

for e(0)Ω1. From (3.5) and (3.11), we know that (e,s)=0 is exponentially stable for e(0)Ω1 from Corollary 1.1, and therefore, F(t)Iso(F) or F(t)Amb(F) as t for e(0)Ω1. Now, if the initial conditions are chosen such that (e(0),s(0))Ω1Ω2Ω3, we know that the formation starts closer to Iso(F) than Amb(F) and Wd(e(0),s(0))<Ec. Since W˙d0, we know that Wd(e(t),s(t))<Ec for all t>0, indicating that the formation energy is always less than the minimum energy required for a flip to occur. Thus, we have that (1.2) holds for (e(0),s(0))Ω1Ω2Ω3. Q.E.D.

The expression for v˙f in (3.8) is given by

(3.12)v˙f=kvR˙DzkvRDz˙

where from (6) in Distance

(3.13)R˙D(q~)=RD(v~),

v~=[,vivj,]Rl,(i,j)E, and from (3.3)

(3.14)z˙=2RD(q~)v.

The control (3.8)-(3.9) can be written element-wise as

(3.15)ui=kavijNi(E)[(kakv+1)q~ijzij+kv(zijI2+2q~ijq~ij)v~ij]

for i=1,n and

(3.16)v~ij=vivj,(i,j)E.

This control is decentralized since its implementation only requires each agent to measure its own velocity and the relative position and relative velocity to neighboring agents. The agent's velocity can be measured using onboard sensors such as an odometer and a compass.

3.3 Formation Maneuvering

The formation maneuvering control law for the double-integrator model (3.1a)-(3.1b) is simply a combination of the designs in Sections 2.2 and 3.2. Specifically, u is given by (3.8) with

(3.17)vf=ua+vd

where the formation maneuvering velocity vd was specified in (2.24). Note that (3.17) is exactly the right-hand side of (2.23).

We will not present the formal statement and proof of this result, but only discuss the aspects in which it differs from the proofs of Theorems 2.2 and 3.1. This is namely the proof that (1.4) holds. First, after substituting (3.17) into (3.6), the proofs of the exponentially stability of (e,s)=0 and (1.2) are straightforward given that RD(q~)vd=0 (see (1.20) and (2.24)). Now, since e(t)0 as t, we know from (2.9) that z(t)0 as t. Since RD(q~) is bounded, then ua(t)0 as t from (2.15). Therefore, we have that vf(t)vd(t) as t from (3.17). Since we know s(t)0 as t, it follows from (3.4) that v(t)vf(t)0 as t. Therefore, vi(t)vdi(t)0 as t,i=1,,n, which is the same as (1.4) due to (3.1a).

The term v˙f in (3.8) will contain additional terms from the derivative of vd. Specifically, from (2.24), we have that

(3.18)v˙di=v˙t+ω˙×q~in+ω×v~in,i=1,n

where v˙tR3 denotes the desired translational acceleration and ω˙R3 is the desired angular acceleration for the virtual rigid body. Therefore, for the double-integrator model, vt and ω need to be continuously differentiable functions of time with bounded first derivative for the control input to be continuous and bounded. Note that element-wise the formation maneuvering control law is simply made up of the sum of the right-hand sides of (3.15) and (3.18). Like vt and ω, the signals v˙t and ω˙ can be stored on each agent's onboard computer since they are typically known a priori.

3.4 Target Interception with Unknown Target Acceleration

Solving the target interception problem for the double-integrator model requires a more elaborate solution than the one presented in Section 2.4 for the single-integrator model. Here, we consider that the target position qT(t) is twice continuously differentiable and qT(t),q˙T(t),q¨T(t)L. We also assume the signals qTqn,q˙Tq˙n,q˙n, and q˙T are known and can be broadcast from the leader to the followers; however, the signal q¨T is unknown. A variable structure-type control term will be used to compensate for the unknown target acceleration. As a result, the right-hand side of the resulting error system dynamics will be discontinuous, requiring us to apply some ideas from Lyapunov stability of nonsmooth systems. As in Section 2.4, we let vT:=q˙T to simplify the notation.

Theorem 3.2

Let the initial conditions be such that (e(0),s(0))Ω1Ω2Ω3 where Ω1 and Ω2 were defined in (2.14) and Ω3 was defined in (3.7). Consider the control

(3.19)u=kas+u˙a+1nkT(vTvn)kssgn(s)RD(q~)z

where s was defined in (3.4),

(3.20)vf=ua+1nh,(3.21)h=kTeT+vT,

ua was defined in (2.15), eT was defined in (2.44), ksnv˙TL, and kT>0. Then, (3.19) renders (e,s)=0 asymptotically stable and ensures that (1.2) and (1.5) are satisfied.

Proof:

Details of Proof

First, notice that the differential equations describing the (e,s)-error system dynamics in a closed loop with (3.19)-(3.21) have a discontinuous right-hand side due to the term sgn(s) in (3.19). That is, if ξ˙=f(ξ,t) denotes the closed-loop system where ξ=[e,s], then f(ξ,t) is continuous everywhere except in the set {(ξ,t)s=0}. For such a system, a generalized solution exists by embedding the differential equations into the differential inclusions ξ˙K[f](ξ,t). In this case, the time derivative of (3.5) is given by

W˙d a.e. WdξK[f](ξ,t)(3.22)zRD(q~)vf+s(u+RD(q~)zv˙f)

where (3.6) was used. Substituting (2.15), (3.19), (3.20), and (3.21) into (3.22) and then applying RD(p)(1nx)=0 gives [42]

W˙dkvzRDRDzkasss(kssgn(s)+1nv˙T)=kvzRDRDzkasss(ksSGN(s)+1nv˙T)(3.23)kvzRDRDzkass+s(nv˙TLks)

where sgn() and SGN(x) were defined in (2.37) and (2.42), respectively.

For ksnv˙TL, (3.23) reduces to (3.11) so W˙d is negative definite for e(0)Ω1. Therefore, from Theorem 1.6, we know that (z,s)=0 is asymptotically stable. Since Wd is positive definite in e, we know that (e,s)=0 is asymptotically stable for e(0)Ω1. The proof of (1.2) for (e(0),s(0))Ω1Ω2Ω3 now follows from the same arguments used in the proof of Theorem 3.1.

Next, from (3.20), we have

(3.24)vfn=uan+vT+kTeT

where the subscript n denotes the nth element of the corresponding vector. Differentiating (2.44) and applying (3.24) yields

e˙T=vTvn=vT(vfn+sn)(3.25)=k1eT+r

where r:=snuan. Since (3.25) is a stable linear system with input r and output eT, the output will converge to zero if the input converges to zero (see Theorem 1.1). Given that (z,s)=0 is asymptotically stable, we know that (s(t),z(t))0 as t and therefore ua(t)0 as t. As a result, r(t)0 as t and, from (3.25), eT(t)0 as t. Finally, since (1.2) implies that qn(t)conv{q1(t),q2(t),,qn1(t)} as t due to the manner in which F is constructed for the target interception problem, we conclude from the convergence of eT to zero that (1.5) holds. Q.E.D.

A few observations are in order concerning the structure of (3.19)-(3.21).

  1. v˙f is not included in (3.19) as it is in (3.8) because the derivative of (3.20) is a function of the unknown signal v˙T. Hence, only the measurable terms of v˙f appear in (3.19). Since v˙T cannot be directly cancelled by the control, it is instead dominated by the variable structure term kssgn(s) as shown in (3.23).
  2. Comparing (2.54) and (3.21), notice the absence of the term uan in the latter. Unlike the control in Theorem 2.4, the presence of this term in (3.21) is not necessary for proving the converge of eT to zero. If uan was included (3.21), the above stability analysis would still hold with the exception that the auxiliary variable r in (3.25) would become simply r=sn.

When expressed element-wise, the control (3.19)-(3.21) takes the form

ui=kavi+ka(vT+kTeT)+kT(vTvn)jNi(E)[kv(zijI2+2q~ijq~ij)v~ij+(kakv+1)q~ijzij]kssgn(vivTkTeT+kvjNiq~ijzij)

As one can see, the ith agent's control input is dependent on its own velocity and the relative position/velocity to neighboring agents, eT,vT, and vn.

3.5 Dynamic Formation Acquisition

When solving the dynamic formation acquisition problem (see Problem 4 in Section 2.5) for the double-integrator model, we require that the time-varying distance dij(t) be twice continuously differentiable and dij(t),d˙ij(t),d¨ij(t)L for the control law to be continuous and bounded.

Similar to the formation maneuvering control law of this chapter, the dynamic formation acquisition control input will take the form of (3.8) but with the problem-specific design for vf. That is, vf is set to the right-hand side of (2.71) for dynamic formation acquisition.

The term v˙f in (3.8) can be explicitly calculated from (2.71) as follows

(3.26)v˙f=R˙(q~)(kvz+dv)+R(q~)(kvz˙+d˙v)

where dv was defined in (2.70),

d˙v=[,d˙ij2+dijd¨ij,],(i,j)E,z˙=2(RD(q~)vdv),R˙=R˙D(RDRD)1RD(RDRD)1d(RDRD)dt(RDRD)1,

and R˙D was defined in (3.13). It is not difficult to see that (3.26) is a function of q~ij,v~ij,dij,d˙ij, and d¨ij for (i,j)E. This control also suffers from the coupling issue discussed in Section 2.5 due to the presence of the pseudoinverse matrix Rin (2.71) and (3.26).

The proof of stability uses the same Lyapunov function candidate (3.5) and combines the arguments from the proofs of Theorems 2.5 and 3.1. A sketch of the proof is as follows. Substituting (3.8) and (2.71) into (3.6) yields

(3.27)W˙d=kvzzkass2min(kv,ka)Wd

for e(t)Ω1 from which we conclude that (e,s)=0 is exponentially stable for e(0)Ω1 in the same vein of Theorem 3.1. The proof of (2.66) for (e(0),s(0))Ω1Ω2Ω3 proceeds as in Theorem 3.1.

As in the single-integrator case, formation maneuvering can be performed concurrently with dynamic formation acquisition by setting vf to the right-hand side of (2.73). The derivative of vf will then be given by (3.26) plus v˙d as defined in (3.18).

Appendix

A. Integrator Backstepping Methodology

Integrator backstepping is a recursive control design methodology for systems in so-called strict-feedback form. It provides a systematic way of designing Lyapunov functions and nonlinear controllers for systems of any order. Unlike the feedback linearization method, backstepping can accommodate model uncertainties and avoid the unnecessary cancellation of "useful" (stabilizing) nonlinearities.

Since the dynamic model of the individual agents here have at most order two, we illustrate the backstepping technique by considering the system

(A.1)x˙=f(x)+η(A.2)η˙=u

where [x,η]R2 is the system state, uR is the control input, and f(x) is continuously differentiable with f(0)=0. Say that our control objective is to stabilize the system at the equilibrium point [x,η]=0 for any initial conditions.

Notice that the above system is a cascaded connection of subsystems (A.1) and (A.2). The idea behind backstepping is to first consider η as a control input for subsystem (A.1). Under this assumption, we could design η=f(x)x to obtain the exponentially stable closed-loop system x˙=x. Since in reality η is a system state and thus cannot be directly manipulated, we use the trick of adding and subtracting a fictitious control input ηf to the right-hand side of (A.1) and introducing the variable transformation

ξ=ηηf

As a result, our system becomes

x˙=f(x)+ηf+ξξ˙=uη˙f.

Now, if we design

ηf=f(x)x(A.3)u=η˙fξx

where

η˙f=ηfx(f(x)+η),

we get the closed-loop system

x˙=x+ξ(A.4)ξ˙=ξx

whose unique equilibrium point is [x,ξ]=0. Using the Lyapunov function candidate

V(x,ξ)=12(x2+ξ2)

and taking its time derivative along (A.4) yields

V˙=x2ξ2

From Corollary 1.1, we can conclude that [x,ξ]=0 is exponentially stable. Since ηf(x=0)=0, we know that [x,η]=0 is an exponentially stable equilibrium point for (A.1) and (A.2) in closed-loop with (A.3).


  1. Marcio de Queiroz, Xiaoyu Cai, and Matthew Feemster, *Formation Control of Multi-Agent Systems: A Graph Rigidity Approach. USA: John Wiley & Sons, Ltd, 2019. Accessed: Dec. 31, 2025: Section 2 & 3, Appendix C.

  1. The control could also be a function of other, nonposition-related variables depending on the agent model and formation problem being solved. ↩︎

  2. Although the argument of the rigidity matrix function is commonly written as q, it is obvious from rG and RD that RD is dependent on q~ only. Henceforth, we write RD(q~) so it is clear that the matrix is a function of the relative position. ↩︎

  3. The variable ua in (2.15)$ denotes the basic formation acquisition control term that will be embedded in all control algorithms. ↩︎

  4. Recall from the statement of the formation maneuvering problem in Section 1 that agent n serves as the reference point through which the rotation axis passes. Therefore, q~in  in (2.24) is the relative position between each agent and agent n. ↩︎

  5. The introduction of the term uan in (2.54) is crucial for the following stability analysis of the target interception error since it allows un to have the simple form in (2.56). ↩︎

  6. It is important to point out that the framework F(t) is required to be infinitesimally and minimally rigid for all time. ↩︎

  7. Since the precise smoothness properties are agent model-dependent, they will be specified later. ↩︎

Released under the MIT License.