MINCO: Geometrically Constrained Trajectory Optimization for Multicopters

Source: [1] Z. Wang, X. Zhou, C. Xu, and F. Gao, "Geometrically constrained trajectory optimization for multicopters", IEEE Trans. Robot., vol. 38, no. 5, pp. 3259–3278, Oct. 2022, doi: 10.1109/TRO.2022.3160022.

1 Preliminaries

1.1 Differential Flatness

Consider a dynamical system of the following type

\begin{matrix} (1) & \dot{x} = f (x) + g (x) u \end{matrix}

with $f : R^{n} \mapsto R^{n}$ , $g : R^{n} \mapsto R^{n \times m}$ , state $x \in R^{n}$ , and input $u \in R^{m}$ . The map $g$ is assumed to have rank $m$ . The system is said to be differentially flat, if there exists a flat output $z \in R^{m}$ determined by $x$ and finite derivatives of $u$ , such that $x$ and $u$ can both be parameterized by finite derivatives of $z$ :

\begin{array}{r} (2) & x = Ψ_{x} (z, \dot{z}, \dots, z^{(s - 1)}) \\ (3) & u = Ψ_{u} (z, \dot{z}, \dots, z^{(s)}), \end{array}

where $Ψ_{x} : {(R^{m})}^{s - 1} \mapsto R^{n}$ and $Ψ_{u} : {(R^{m})}^{s} \mapsto R^{m}$ are both induced by $f$ and $g$ . Intuitively, the state and control can be determined from $z$ without explicit integration of the system dynamics $(1)$ .

Leveraging the flatness of a system, the trajectory generation is convenient when there are only differential constraints in $(1)$ . If we introduce a new control variable $v = z^{(s)}$ and denote $z^{[s - 1]} \in R^{m s}$ as

\begin{matrix} (4) & z^{[s - 1]} = {(z^{⊤}, {\dot{z}}^{⊤}, \dots, z^{{(s - 1)}^{⊤}})}^{⊤}, \end{matrix}

the input $u = Ψ_{u} (z^{[s - 1]}, v)$ then exactly linearizes the original flat system into $m$ decoupled chains of $s$ -integrators. Let $z_{i}$ denote the $i$ -th entry in $z$ , $v_{i}$ the $i$ -th entry in $v$ and $z_{i}^{[s - 1]} = {(z_{i}, {\dot{z}}_{i}, \dots, z_{i}^{(s - 1)})}^{⊤}$ . The $i$ -th integrator chain is

\begin{matrix} (5) & {\dot{z}}_{i}^{[s - 1]} = (\begin{matrix} 0 & I_{s - 1} \\ 0 & 0^{⊤} \end{matrix}) z_{i}^{[s - 1]} + (\begin{matrix} 0 \\ 1 \end{matrix}) v_{i}, \end{matrix}

where $0$ and $I$ are a zero matrix and an identity matrix with appropriate sizes, respectively. Given an initial state and a goal state, boundary values of each integrator chain $(5)$ can be algebraically computed. Thus any trajectory integrated from these $m$ integrator chains can be transformed into a feasible trajectory for the original flat system via $(2)$ and $(3)$ .

For dynamics with a small $m$ , the flatness maps $Ψ_{x}$ and $Ψ_{u}$ further reduce the trajectory dimension and eliminate the differential constraints $(1)$ , which is illustrated in Fig. 1. As a side effect, nonlinearity coming from both $Ψ_{x}$ and $Ψ_{u}$ brings additional difficulties in trajectory generation for $z$ when there are additional state-input constraints for $(1)$ . However, such an effect is relieved if the flat-output space coincides with the configuration space of the relevant planning problem.

fig-9-1 — Figure 1[1]: Transform $\Psi_u$ and $\Psi_x$ of a flat system eliminate differential constraints (blue surface) from dynamics in the state-input space (left coordinate). The original state-input constraint $\mathcal{G}_\mathcal{D}$ (red area) is also transformed into a new constraint $\mathcal{G}$ (blue volume) in the flat-output space (right coordinate).

1.2 Direct Optimization in Flat-Output Space

Fortunately, the differential flatness of multicopters has been well studied and shown to have physically meaningful flat-output space which overlaps with the configuration space. Explicit forms of $Ψ_{x}$ and $Ψ_{u}$ are available in previous research (See original paper for details) for a variety of underactuated multicopters. More importantly, their flat outputs share the same form in general:

\begin{matrix} (6) & z = {(p_{x}, p_{y}, p_{z}, ψ)}^{⊤} \end{matrix}

where ${(p_{x}, p_{y}, p_{z})}^{⊤}$ is the translation of the Center of Gravity (CoG) and $ψ$ the yaw angle of the vehicle. The flat output $z$ , especially its translation, provides a lot of convenience for the multicopter motion planning with complex spatial constraints.

To generate feasible motions for a multicopter, we first optimize the trajectory $z (t) : [0, T] \mapsto R^{m}$ in its flat-output space such that most of the spatial constraints are directly enforced. Then, the flatness maps $Ψ_{x}$ and $Ψ_{u}$ are applied to transform $z (t)$ into the state-input trajectory $x (t)$ and $u (t)$ .

For motion smoothness, the quadratic control effort with time regularization is adopted as a cost functional of $z (t)$ . General constraints on multicopters can be classified into configuration constraints and user-defined dynamic constraints. Normally, a collision-free motion implies

\begin{matrix} (7) & z (t) \in F, \forall t \in [0, T], \end{matrix}

where $F$ is the concerned obstacle-free region in configuration space. Besides, user-defined state-input constraints such as actuator limits or task-specific constraints are denoted by

\begin{matrix} (8) & G_{D} (x (t), u (t)) ⪯ 0, \forall t \in [0, T] . \end{matrix}

Exploiting $Ψ_{x}$ and $Ψ_{u}$ , the corresponding constraints on $z (t)$ are computed as

\begin{matrix} (9) & G_{D} (Ψ_{x} (z^{[s - 1]} (t)), Ψ_{u} (z^{[s]} (t))) ⪯ 0, \forall t \in [0, T] . \end{matrix}

Apparently, via the flatness, a constraint on $x$ and $u$ has its equivalent form on the finite derivatives of $z (t)$ . For simplicity, we denote $(9)$ hereafter by

\begin{matrix} (10) & G (z (t), \dot{z} (t), \dots, z^{(s)} (t)) ⪯ 0, \forall t \in [0, T], \end{matrix}

where $G$ consists of $n_{g}$ equivalent constraints.

It is worth noting that we do not make further assumptions on the multicopter dynamics and flatness maps. In other words, the proposed framework supports a wide range of multicopters.

1.3 Problem Formulation

Concluding above descriptions gives the following problem:

\begin{aligned} (11a) & min_{z (t), T} & \int_{0}^{T} v (t)^{⊤} W v (t) d t + ρ (T), \\ (11b) & s . t . & z^{(s)} (t) = v (t), \forall t \in [0, T], \\ (11c) & G (z (t), \dots, z^{(s)} (t)) ⪯ 0, \forall t \in [0, T], \\ (11d) & z (t) \in F, \forall t \in [0, T], \\ (11e) & z^{[s - 1]} (0) = {\bar{z}}_{o}, z^{[s - 1]} (T) = {\bar{z}}_{f}, \end{aligned}

where $W \in R^{m \times m}$ is a positive diagonal matrix, $ρ : [0, \infty) \mapsto [0, \infty]$ the time regularization, ${\bar{z}}_{o} \in R^{m s}$ the initial condition and ${\bar{z}}_{f} \in R^{m s}$ the terminal condition. The control input $v$ is allowed to be discontinuous in a finite number of time instants.

The trajectory optimization $(11)$ is nontrivial because of the continuous-time constraints $G$ and the nonconvex set $F$ . We further specify some reasonable conditions to make it a well-defined problem. As for time regularization $ρ$ , it trades off between the control effort and the expectation of total time,

\begin{matrix} (12) & ρ_{s} (T) = \sum_{i = 0}^{M_{T}} b_{i} T^{i}, \end{matrix}

where $b_{M_{T}}$ is positive. Common choices are $ρ_{s} (T) = k_{ρ} T$ and $ρ_{s} (T) = k_{ρ} (T - T_{Σ})^{2}$ with an expected time $T_{Σ}$ . Besides, $ρ$ can also be defined to strictly fix the total time:

\begin{matrix} (13) & ρ_{f} (T) = {\begin{cases} 0 & if T = T_{Σ}, \\ \infty & if T \neq T_{Σ} . \end{cases} \end{matrix}

As for nonlinear constraints $G$ , they are required to be $C^{2}$ , i.e., twice continuously differentiable. As for the feasible region $F$ in the configuration space, we approximate it geometrically by the union of $M_{P}$ closed convex sets as

\begin{matrix} (14) & F ≃ \tilde{F} = ⋃_{i = 1}^{M_{P}} P_{i} . \end{matrix}

For simplicity, locally sequential connection is assumed on these convex sets:

\begin{matrix} (15) & {\begin{cases} P_{i} \cap P_{j} = \emptyset & if | i - j | = 2, \\ Int (P_{i} \cap P_{j}) \neq \emptyset & if | i - j | ⩽ 1, \end{cases} \end{matrix}

where $Int (\cdot)$ means the interior of a set. The translation of ${\bar{z}}_{o}$ and ${\bar{z}}_{f}$ is inscribed in $P_{1}$ and $P_{M_{P}}$ , respectively. As for $\tilde{F}$ , we consider the case that each $P_{i}$ is a closed $m$ -dimensional ball:

\begin{matrix} (16) & P_{i}^{B} = {x \in R^{m} | ∥ x - o_{i} ∥_{2} ⩽ r_{i}}, \end{matrix}

or, more generally, a bounded convex polytope described by its $H$ -representation with potentially redundant constraints:

\begin{matrix} (17) & P_{i}^{H} = {x \in R^{m} | A_{i} x ⪯ b_{i}} . \end{matrix}

For the optimization in $(11)$ , we aim to construct a computationally efficient solver while retaining the flexibility to handle different task-specific constraints $G_{D}$ in $(8)$ .

2 Multi-Stage Control Effort Minimization

In this section, we analyze the multi-stage control effort minimization without functional constraints.

For this problem, we propose easy-to-use optimality conditions for general cases, which are proved to be necessary and sufficient. Leveraging our conditions, the optimal trajectory is directly constructed in linear complexity of time and space, without evaluating the cost functional explicitly or implicitly.

Base on them, a novel trajectory class along with linear-complexity spatial-temporal deformation is designed to meet user-defined objectives in various trajectory planning scenarios.

2.1 Unconstrained Control Effort Minimization

When constraint $F$ exists, adjusting the waypoints or control points of a trajectory helps to ensure safety. When constraint $G$ exists, adjusting the time allocation also helps to enforce physical limits. Therefore, spatial and temporal parameters are both vital to a flexible trajectory representation. A natural problem is to generate a smooth trajectory subject to these parameters.

We solve Linear Quadratic Minimum-Time (LQMT) problems to generate trajectories from spatial-temporal parameters. Although the LQMT problems have extensive studies and applications, only single-stage problems are considered in the literature. We study the multi-stage problems where intermediate points and time vector are fixed in advance for multi-piece trajectories. Consider an $M$ -stage control effort minimization without $F$ and $G$ ,

\begin{aligned} (18a) & min_{z (t)} & \int_{t_{0}}^{t_{M}} v (t)^{⊤} W v (t) d t, \\ (18b) & s . t . & z^{(s)} (t) = v (t), \forall t \in [t_{0}, t_{M}], \\ (18c) & z^{[s - 1]} (t_{0}) = {\bar{z}}_{o}, z^{[s - 1]} (t_{M}) = {\bar{z}}_{f}, \\ (18d) & z^{[d_{i} - 1]} (t_{i}) = {\bar{z}}_{i}, 1 ⩽ i < M, \\ (18e) & t_{i - 1} < t_{i}, 1 ⩽ i ⩽ M . \end{aligned}

The time interval $[t_{0}, t_{M}]$ is split into $M$ stages by $M + 1$ fixed timestamps, with constant boundary conditions ${\bar{z}}_{o}, {\bar{z}}_{f} \in R^{m s}$ . Intermediate conditions ${\bar{z}}_{i} \in R^{m d_{i}}$ with $d_{i} ⩽ s$ specify the value of $z (t_{i}), \dot{z} (t_{i}), \dots, z^{(d_{i} - 1)} (t_{i})$ , where $d_{i}$ is number of derivatives fixed at $t_{i}$ . For example, if $z (t)$ is only required to pass a given position at $t_{i}$ , then $d_{i} = 1$ because ${\bar{z}}_{i}$ contains the $0$ -order derivative and nothing else.

Existing works focus on the necessary conditions for special cases of $(18)$ . In aerial robotics area, the QP formulation and the closed-form one implicitly or explicitly optimize unknown knot derivatives, taking parameterization as a priori. This extra computation actually makes them less efficient.

In control area, a special case where $d_{i} = 1$ is also studied via controllability Gramian. The result is for general linear systems with possibly nonpolynomial solutions while it is less intuitive considering the computational aspect. These necessary conditions can cause potential degeneracy in trajectory representation and sensitivity, if further parametric optimization on spatial-temporal parameters is needed.

2.2 Optimality Conditions

We propose necessary and sufficient optimality conditions for $(18)$ with all possible settings of $d_{i}$ , ${\bar{z}}_{i}$ , and $t_{i}$ . Thus, an optimal trajectory can be directly constructed from spatial-temporal parameters. Furthermore, the existence and uniqueness of the optimal trajectory are always guaranteed.

We transform $(18)$ into the Mayer form in which a new state $y \in R^{m s + 1}$ augmented by $\tilde{y} \in R$ is defined as

\begin{matrix} (19) & y = (\begin{matrix} z^{[s - 1]} \\ \tilde{y} \end{matrix}) . \end{matrix}

The augmented system $\dot{y} = \hat{f} (y, v)$ has the structure

\begin{matrix} (20) & \dot{y} = (\begin{matrix} \bar{A} & 0 \\ 0^{⊤} & 0 \end{matrix}) y + (\begin{matrix} 0 \\ v \\ v^{⊤} W v \end{matrix}), \end{matrix}

where

\begin{matrix} (21) & \bar{A} = (\begin{matrix} 0 & I_{m (s - 1)} \\ 0_{m \times m} & 0^{⊤} \end{matrix}) \in R^{m s \times m s} . \end{matrix}

We design a running process for the augmented system in $M$ stages, of which the $i$ -th is $Δ_{i} = [t_{i - 1}, t_{i}]$ . It is worth noting that state switching occurs in this running process. Strictly speaking, the state switching only occurs on $\tilde{y}$ at the beginning of each stage.

Denote by $y_{[i]} : Δ_{i} \mapsto R^{m s + 1}$ the augmented state trajectory in the $i$ -th stage, which consists of two parts, $z_{[i]}^{[s - 1]}$ and ${\tilde{y}}_{[i]}$ .

At each timestamp $t_{i}$ , the state transfers from $y_{[i]}$ to $y_{[i + 1]}$ , and the part $\tilde{y}$ is reset as

\begin{matrix} (22) & {\tilde{y}}_{[i + 1]} (t_{i}) = 0, 0 ⩽ i < M, \end{matrix}

thus switching the partial state from ${\tilde{y}}_{[i]} (t_{i})$ to $0$ .

The $z^{[s - 1]}$ part remains continuous between stages, which means

\begin{matrix} (23) & z_{[i]}^{[s - 1]} (t_{i}) = z_{[i + 1]}^{[s - 1]} (t_{i}), 1 ⩽ i < M . \end{matrix}

The conditions in $(18 c)$ and $(18 d)$ are still satisfied, i.e.,

\begin{aligned} (24) & z_{[1]}^{[s - 1]} (t_{0}) = {\bar{z}}_{o}, z_{[M]}^{[s - 1]} (t_{M}) = {\bar{z}}_{f}, \\ (25) & z_{[i]}^{[d_{i} - 1]} (t_{i}) = {\bar{z}}_{i}, 1 ⩽ i < M . \end{aligned}

In this process, the cost functional in $(18)$ is converted into the sum of terminal cost of each stage for the augmented system, i.e., $\sum_{i = 1}^{M} {\tilde{y}}_{[i]} (t_{i})$ . Therefore, the optimal trajectories for the augmented system and the original one are identical in $z^{[s - 1]}$ .

We utilize the Hybrid Maximum Principle to derive necessary conditions for the optimal solution.

Theorem 1: Hybrid Maximum Principle

Let $t_{0} < \dots < t_{M}$ be real numbers and $Δ_{k} = [t_{k - 1}, t_{k}]$ . For any collection of absolute continuous functions $x_{k} : Δ_{k} \mapsto R^{n_{k}}$ , define a vector, $x_{Σ} \in R^{\bar{n}}$ where $\bar{n} = 2 \sum_{k = 1}^{M} n_{k}$ , as

\begin{matrix} (26) & x_{Σ} = {(x_{1}^{⊤} (t_{0}), x_{1}^{⊤} (t_{1}), \dots, x_{M}^{⊤} (t_{M - 1}), x_{M}^{⊤} (t_{M}))}^{⊤} . \end{matrix}

On the time interval $Δ = [t_{0}, t_{M}]$ consider the problem

\begin{aligned} (27a) & min_{u_{k}, x_{k}} & J (x_{Σ}), \\ (27b) & s . t . & {\dot{x}}_{k} (t) = f_{k} (x_{k} (t), u_{k} (t)), \\ (27c) & u_{k} (t) \in U_{k} \subseteq R^{r_{k}}, \\ (27d) & \forall t \in Δ_{k}, k = 1, \dots, M, \\ (27e) & η (x_{Σ}) = 0, \end{aligned}

where $f_{k} : R^{n_{k}} \times R^{r_{k}} \mapsto R^{n_{k}}$ , $J : R^{\bar{n}} \mapsto R$ and $η : R^{\bar{n}} \mapsto R^{q}$ are continuously differentiable, $u_{k} : R \mapsto R^{r_{k}}$ are measurable and bounded on the corresponding $Δ_{k}$ .

Denote an optimal process for $(27)$ by $(x^{*} (t), u^{*} (t))$ . Then, there exists a collection $(α, γ, ψ_{1}, \dots, ψ_{M})$ , where $α ⩾ 0$ , $γ \in R^{q}$ and $ψ_{k} : Δ_{k} \mapsto R^{n_{k}}$ are Lipschitz continuous. It generates $M$ Pontryagin functions

\begin{matrix} (28) & H_{k} (ψ_{k}, x_{k}, u_{k}) = ψ_{k}^{⊤} f_{k} (x_{k}, u_{k}), t \in Δ_{k}, \end{matrix}

and a Lagrange function $L (x_{Σ}) = α J (x_{Σ}) + γ^{⊤} η (x_{Σ})$ . The following conditions are satisfied for all $k = 1, \dots, M$ .

Nontriviality condition:

\begin{matrix} (29) & (α, γ^{⊤}) \neq 0; \end{matrix}

Adjoint equations: for almost all $t \in Δ_{k}$ ,

\begin{matrix} (30) & {\dot{ψ}}_{k} (t) = - \frac{\partial H_{k}}{\partial x_{k}} (ψ_{k} (t), x_{k}^{*} (t), u_{k}^{*} (t)); \end{matrix}

Transversality conditions:

\begin{matrix} (31) & {\begin{matrix} ψ_{k} (t_{k - 1}) = L_{x_{k} (t_{k - 1})} (x_{Σ}^{*}), \\ ψ_{k} (t_{k}) = - L_{x_{k} (t_{k})} (x_{Σ}^{*}); \end{matrix} \end{matrix}

Maximality conditions: for all $t \in Δ_{k}$ ,

\begin{aligned} H_{k} (ψ_{k} (t), x_{k}^{*} (t), u_{k}^{*} (t)) \\ = & sup_{u_{k} \in U_{k}} H_{k} (ψ_{k} (t), x_{k}^{*} (t), u_{k}) \\ (32) & = & 0. \end{aligned}

Proof:

The proof can be directly adapted from Theorem 4 by Dmitruk and Kaganovich. Here we only consider each system $f_{k}$ to be time-invariant and all intervals $Δ_{k}$ to be fixed. Besides, no inequality constraints are specified on $x_{Σ}$ .

◻

According to Theorem 1, the costate $ψ_{[i]} : Δ_{i} \mapsto R^{m s + 1}$ in the $i$ -th stage is defined as

\begin{matrix} (33) & ψ_{[i]} = (\begin{matrix} λ_{[i]} \\ μ_{[i]} \end{matrix}) = {({λ_{[i]}}_{1}, {λ_{[i]}}_{2}, \dots, {λ_{[i]}}_{s}, μ_{[i]})}^{⊤}, \end{matrix}

where $μ_{[i]} : Δ_{i} \mapsto R$ . ${λ_{[i]}}_{j} : Δ_{i} \mapsto R^{m}$ is the $j$ -th map in $λ_{[i]} : Δ_{i} \mapsto R^{m s}$ . The $i$ -th Pontryagin function of $(20)$ is

\begin{aligned} H_{i} (ψ_{[i]}, y_{[i]}, v_{[i]}) = ψ_{[i]}^{⊤} \hat{f} (y_{[i]}, v_{[i]}) \\ (34) & = λ_{[i]}^{⊤} \bar{A} z_{[i]}^{[s - 1]} + {λ_{[i]}^{⊤}}_{s} v_{[i]} + μ_{[i]} v_{[i]}^{⊤} W v_{[i]} . \end{aligned}

By applying the adjoint equation $(30)$ for $μ_{[i]}$ , we have ${\dot{μ}}_{[i]} = 0$ , which means $μ_{[i]} (t) = {\bar{μ}}_{i} \in R$ is a constant in $Δ_{i}$ . Therefore, $H_{i}$ is always a quadratic function of $v_{[i]}$ ,

\begin{matrix} (35) & H_{i} (ψ_{[i]}, y_{[i]}, v_{[i]}) = λ_{[i]}^{⊤} \bar{A} z_{[i]}^{[s - 1]} + {λ_{[i]}^{⊤}}_{s} v_{[i]} + {\bar{μ}}_{i} v_{[i]}^{⊤} W v_{[i]} . \end{matrix}

By applying the adjoint equation for $λ_{[i]}$ , we obtain

\begin{matrix} (36) & {\dot{λ}}_{[i]} = - {\bar{A}}^{⊤} λ_{[i]}, \end{matrix}

which is expanded as

\begin{matrix} (37) & {\dot{λ}}_{[i]}_{j} = {\begin{cases} 0 & if j = 1, \\ - {λ_{[i]}}_{j - 1} & if 2 ⩽ j ⩽ s . \end{cases} \end{matrix}

It is obvious that ${λ_{[i]}}_{s} (t)$ is an $s - 1$ degree polynomial.

According to maximality conditions $(32)$ , the supremum of $H_{i}$ is always $0$ in $Δ_{i}$ . Thus the positive definiteness of $W$ implies ${\bar{μ}}_{i} ⩽ 0$ . If ${\bar{μ}}_{i} = 0$ , then $(35)$ becomes a linear function of $v_{[i]}$ . The zero supremum means that ${λ_{[i]}}_{s} (t) = 0$ in $Δ_{i}$ . As the result of $(36)$ , $ψ_{[i]} (t) = 0$ holds for all $t$ in $Δ_{i}$ . In such a case, a contradiction occurs that the nontriviality condition $(29)$ and the transversality conditions $(31)$ cannot be satisfied at the same time. Therefore, ${\bar{μ}}_{i} < 0$ always holds in the whole $Δ_{i}$ . The optimal control $v_{[i]}^{*}$ can be obtained from

\begin{matrix} (38) & \frac{\partial H_{i}}{\partial v_{[i]}} (ψ_{[i]}, y_{[i]}^{*}, v_{[i]}^{*}) = {λ_{[i]}}_{s} + 2 {\bar{μ}}_{i} W v_{[i]}^{*} = 0, \end{matrix}

i.e.,

v_{[i]}^{*} (t) = - \frac{1}{2 {\bar{μ}}_{i}} W^{- 1} {λ_{[i]}}_{s} (t), \forall t \in Δ_{i} .

Then, $z_{[i]}^{*}$ produced by a chain of $s$ -integrators from ${λ_{[i]}}_{s} (t)$ , is a $2 s - 1$ degree polynomial.

To further explore structures of the solution, we generate the Lagrange function using the cost of augmented system along with all constraints in $(23)$ , $(24)$ and $(25)$ . We have

\begin{aligned} L (y_{Σ}) = & α \sum_{i = 1}^{M} {\tilde{y}}_{[i]} (t_{i}) + \sum_{i = 0}^{M - 1} γ_{i} {\tilde{y}}_{[i + 1]} (t_{i}) \\ + \sum_{i = 1}^{M - 1} (ζ_{i}^{⊤}, σ_{i}^{⊤}) (z_{[i]}^{[s - 1]} (t_{i}) - z_{[i + 1]}^{[s - 1]} (t_{i})) \\ + θ_{o}^{⊤} (z_{[1]}^{[s - 1]} (t_{0}) - {\bar{z}}_{o}) + θ_{f}^{⊤} (z_{[M]}^{[s - 1]} (t_{M}) - {\bar{z}}_{f}) \\ (40) & + \sum_{i = 1}^{M - 1} θ_{i}^{⊤} (z_{[i]}^{[d_{i} - 1]} (t_{i}) - {\bar{z}}_{i}), \end{aligned}

where $γ_{i} \in R$ , $ζ_{i} \in R^{m d_{i}}$ , $σ_{i} \in R^{m (s - d_{i})}$ , $θ_{o} \in R^{m s}$ , $θ_{f} \in R^{m s}$ and $θ_{i} \in R^{m d_{i}}$ are all constant coefficients of corresponding constraints, $y_{Σ}$ is defined as in $(26)$ . Following transversality conditions $(31)$ , taking the derivative of $L$ w.r.t. $y_{Σ}$ gives the boundary values of costates $ψ_{[i]}$ and $ψ_{[i + 1]}$ , i.e.,

\begin{matrix} (41) & λ_{[i]} (t_{i}) = - (\begin{matrix} ζ_{i} + θ_{i} \\ σ_{i} \end{matrix}), λ_{[i + 1]} (t_{i}) = - (\begin{matrix} ζ_{i} \\ σ_{i} \end{matrix}), \\ (42) & μ_{[i]} (t_{i}) = μ_{[i + 1]} (t_{i + 1}) = - α . \end{matrix}

Because $μ_{[i + 1]} (t) = {\bar{μ}}_{i + 1}$ in $Δ_{i + 1}$ , we have

\begin{matrix} (43) & {\bar{μ}}_{i} = - α, 1 ⩽ i ⩽ M . \end{matrix}

Finally, by substituting $(36)$ , $(41)$ and $(43)$ into $(39)$ , we obtain that the optimal controls of two consecutive stages satisfy

\begin{matrix} (44) & {v^{*}}_{[i]}^{(j)} (t_{i}) = {v^{*}}_{[i + 1]}^{(j)} (t_{i}), 0 ⩽ j < (s - d_{i}) . \end{matrix}

We finally know that the optimal control of the problem $(18)$ is actually $s - d_{i} - 1$ times continuously differentiable at the timestamp $t_{i}$ . Accordingly, the optimal state trajectory, consisting of $M$ polynomials with $2 s - 1$ degree, is indeed $2 s - d_{i} - 1$ times continuously differentiable at $t_{i}$ .

Now we conclude the conditions derived from both $(39)$ and $(44)$ in the following theorem, which are proved to be necessary and sufficient optimality conditions of $(18)$ .

Theorem 2: Optimality Conditions

A trajectory, denoted by $z^{*} (t) : [t_{0}, t_{M}] \mapsto R^{m}$ , is optimal for the problem $(18)$ , if and only if the following conditions are satisfied:

The map $z^{*} (t) : [t_{i - 1}, t_{i}] \mapsto R^{m}$ is parameterized as a $2 s - 1$ degree polynomial for any $1 ⩽ i ⩽ M$ ;
The boundary conditions in $(18 c)$ ;
The intermediate conditions in $(18 d)$ ;
$z^{*} (t)$ is ${\bar{d}}_{i} - 1$ times continuously differentiable at $t_{i}$ for any $1 ⩽ i < M$ where ${\bar{d}}_{i} = 2 s - d_{i}$ .

Moreover, a unique trajectory exists for these conditions.

Proof (Sketch):

Details of Proof (Sketch)

The proof of necessity is evident in the analyses from $(33)$ to $(44)$ that are directly derived from Theorem 1. The proof of sufficiency is sketched below:

The first and fourth conditions always determine a linear spline space of dimension $2 s + \sum_{i = 1}^{M - 1} d_{i}$ for any sequence of $d_{i}$ ;
The second and third conditions are shown to form a square coefficient matrix on a basis of the spline space;
The matrix is proved to be nonsingular since $t_{i - 1} < t_{i}$ for each $i$ , implying the existence and uniqueness of solution;
The existence and uniqueness for the necessary conditions yield their sufficiency. This proof of sufficiency is detailed in Appendix of Paper (TODO).

To further explain the optimality conditions, we take the multi-stage jerk minimization as an example.

In this example, the position, velocity and acceleration are states of the jerk-controlled system ( $s = 3$ ). There are intermediate points ( $d_{i} = 1$ ) that the trajectory should pass through at certain timestamps.

The continuity of state only requires the continuity up to acceleration of the minimum-jerk trajectory, while jerk and snap of the optimal trajectory are also continuous everywhere. Accordingly, if we enforce all these continuity conditions, then Theorem 2 guarantees that only one trajectory exists, which is exactly the optimal one.

2.3 Minimization Without Cost Functional

Theorem 2 provides a direct way to construct the unique optimal trajectory. The computation enjoys linear complexity in time and space. It does not even require explicit or implicit evaluation of the cost functional or its gradient.

Consider an $m$ -dimensional trajectory whose $i$ -th piece is denoted by an $N = 2 s - 1$ degree polynomial:

\begin{matrix} (45) & p_{i} (t) = c_{i}^{⊤} β (t - t_{i - 1}), t \in [t_{i - 1}, t_{i}], \end{matrix}

where $β (x) = {(1, x, \dots, x^{N})}^{⊤}$ is the basis and $c_{i} \in R^{2 s \times m}$ the coefficients. For simplicity, we use the timeline relative to $t_{0} = 0$ . The trajectory is described by a coefficient matrix $c \in R^{2 M s \times m}$ and a time vector $T \in R_{> 0}^{M}$ defined by

\begin{matrix} (46) & c = {(c_{1}^{⊤}, \dots, c_{M}^{⊤})}^{⊤}, T = {(T_{1}, \dots, T_{M})}^{⊤}, \end{matrix}

where $T_{i}$ means the duration of the $i$ -th piece. Then we have the timestamp $t_{i} = \sum_{j = 1}^{i} T_{j}$ and the total duration $T = ∥ T ∥_{1}$ . The $M$ -piece trajectory $p : [0, T] \mapsto R^{m}$ is defined by

\begin{matrix} (47) & p (t) = p_{i} (t), \forall t \in [t_{i - 1}, t_{i}), \forall i \in {1, \dots, M} . \end{matrix}

To compute the unique solution for $(18)$ , we directly enforce optimality conditions on the coefficient matrix $c$ . Denote by $D_{0}, D_{M} \in R^{s \times m}$ and $D_{i} \in R^{d_{i} \times m}$ the specified derivatives at boundaries and intermediate timestamp $t_{i}$ , respectively. Each column of $D_{i}$ is related to one dimension. Then, conditions at $t_{i}$ are formulated by $E_{i}, F_{i} \in R^{2 s \times 2 s}$ :

\begin{matrix} (48) & (\begin{matrix} E_{i} & F_{i} \end{matrix}) (\begin{matrix} c_{i} \\ c_{i + 1} \end{matrix}) = (\begin{matrix} D_{i} \\ 0_{{\bar{d}}_{i} \times m} \end{matrix}), \end{matrix}

\begin{aligned} E_{i} = ( & β (T_{i}), \dots, β^{(d_{i} - 1)} (T_{i}), \\ (49) & β (T_{i}), \dots, β^{({\bar{d}}_{i} - 1)} (T_{i}))^{⊤}, \\ (50) & F_{i} = ( & 0, - β (0), \dots, - β^{({\bar{d}}_{i} - 1)} (0))^{⊤} . \end{aligned}

Especially, define $F_{0}, E_{M} \in R^{s \times 2 s}$ as

\begin{aligned} (51) & F_{0} = & {(β (0), \dots, β^{(s - 1)} (0))}^{⊤}, \\ (52) & E_{M} = & {(β (T_{M}), \dots, β^{(s - 1)} (T_{M}))}^{⊤} . \end{aligned}

The linear system for the optimal coefficient matrix is

\begin{matrix} (53) & M c = b \end{matrix}

where $M \in R^{2 M s \times 2 M s}$ and $b \in R^{2 M s \times m}$ are

\begin{matrix} (54) & M = (\begin{matrix} F_{0} & 0 & 0 & \dots & 0 \\ E_{1} & F_{1} & 0 & \dots & 0 \\ 0 & E_{2} & F_{2} & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & F_{M - 1} \\ 0 & 0 & 0 & \dots & E_{M} \end{matrix}), \end{matrix}

\begin{matrix} (55) & b = {(D_{0}^{⊤}, D_{1}^{⊤}, 0_{m \times {\bar{d}}_{1}}, \dots, D_{M - 1}^{⊤}, 0_{m \times {\bar{d}}_{M - 1}}, D_{M}^{⊤})}^{⊤} . \end{matrix}

It is essential that the uniqueness in Theorem 2 ensures the nonsingularity of $M$ for any time vector $T ≻ 0$ . Consequently, the unique solution $c$ can be obtained via linear equation system $(53)$ with a banded matrix $M$ , i.e., a banded system.

As for a nonsingular banded system, its Banded PLU Factorization always exists, which can be employed to compute the solution with $O (M)$ time and space complexity. Therefore, without the need of cost functional, the unique solution of problem $(18)$ is obtained in the lowest complexity, by directly applying our optimality conditions.

2.4 MINCO Trajectories With Spatial-Temporal Deformation

For multicopters, there are often task-specific requirements apart from feasibility, such as the perception quality in active SLAM or the occlusion rate in aerial videography. These user-defined requirements majorly need to flexibly and adaptively deform both the spatial and temporal profile of a trajectory. Therefore, we select the intermediate points and the time vector as two salient parameters in $(18)$ . Fortunately, the existence and uniqueness of solution guarantee the smoothness of sensitivity for them. An iterative procedure is then designed to conduct the spatial-temporal deformation with the lowest computation complexity per iteration.

We denote the intermediate points by $q = (q_{1}, \dots, q_{M - 1})$ where $q_{i} \in R^{m}$ is a specified zero-order derivative at $t_{i}$ . Denote by $T = {(T_{1}, \dots, T_{M})}^{⊤}$ the time vector where $T_{i} \in R_{> 0}$ . For any pair of $q$ and $T$ , Theorem 2 naturally determines a trajectory belonging to a class of control effort minimizers, named MINCO hereafter. The MINCO trajectory class, denoted by $T_{MINCO}$ , is defined as

\begin{aligned} T_{MINCO} = { & p (t) : [0, T] \mapsto R^{m} | c = c (q, T) determined \\ by Theorem 2, \forall q \in R^{m \times (M - 1)}, T \in R_{> 0}^{M}} . \end{aligned}

The dimension $m$ , the system order $s$ , initial and terminal conditions are omitted here for brevity.

Intuitively, all trajectories in $T_{MINCO}$ are compactly parameterized by only $q$ and $T$ . Evaluating an element in $T_{MINCO}$ directly follows our linear-complexity formulation.

We denote any user-defined objective (or constraint) on a trajectory by a $C^{2}$ function $K (c, T)$ with available gradient. This objective on $T_{MINCO}$ can be computed as

\begin{matrix} (56) & W (q, T) = K (c (q, T), T) . \end{matrix}

To accomplish deformation of $T_{MINCO}$ , the function $W$ together with its gradient $\partial W / \partial q$ and $\partial W / \partial T$ are needed for a high-level optimizer to optimize the objective.

Obviously, evaluating $W$ shares the same complexity as evaluating any trajectory in $T_{MINCO}$ . The key procedure is to compute the gradient.

Now we give a linear-complexity scheme to compute $\partial W / \partial q$ and $\partial W / \partial T$ from the given $\partial K / \partial c$ and $\partial K / \partial T$ . We first rewrite the linear equation system $(53)$ as

\begin{matrix} (57) & M (T) c (q, T) = b (q) . \end{matrix}

Without causing ambiguity, we omit parameters in $M, b, c, K$ and $W$ temporarily for simplicity. Any notation involving $c$ is interpreted as $c (q, T)$ . Denote by $q_{i, j}$ the $j$ -th entry in $q_{i}$ .

As for the gradient w.r.t. $q$ , we first differentiate both sides of $(57)$ w.r.t. $q_{i, j}$ , which gives

\begin{matrix} (58) & \frac{\partial c}{\partial q_{i, j}} = M^{- 1} \frac{\partial b}{\partial q_{i, j}} . \end{matrix}

Then,

\begin{aligned} \frac{\partial W}{\partial q_{i, j}} & = Tr {{(\frac{\partial c}{\partial q_{i, j}})}^{⊤} \frac{\partial K}{\partial c}} \\ = Tr {{(M^{- 1} \frac{\partial b}{\partial q_{i, j}})}^{⊤} \frac{\partial K}{\partial c}} \\ (59) & = Tr {{(\frac{\partial b}{\partial q_{i, j}})}^{⊤} (M^{- ⊤} \frac{\partial K}{\partial c})}, \end{aligned}

where $Tr (\cdot)$ is the trace operation.

The definition of $b (q)$ in $(55)$ implies that $\partial b / \partial q_{i, j}$ only has a single nonzero entry $1$ at its $(2 i - 1) s + 1$ row and $j$ column. Thus, stacking all the resultant scalars gives

\begin{matrix} (60) & \frac{\partial W}{\partial q_{i}} = {(M^{- 1} \frac{\partial K}{\partial c})}^{⊤} e_{(2 i - 1) s + 1}, \end{matrix}

where $e_{j}$ is the $j$ -th column of $I_{2 M s}$ . Now that we have already conducted the banded PLU factorization for $M$ when we compute $c$ . We can reuse the factorization to avoid inverting $M^{⊤}$ . Define a matrix $G \in R^{2 M s \times m}$ as

\begin{matrix} (61) & M^{⊤} G = \frac{\partial K}{\partial c} . \end{matrix}

We only need to compute $G$ once to obtain $\partial W / \partial q_{i}$ for all $1 ⩽ i < M$ . Denote the factorization of $M$ as $M = P L U$ . Specifically, $L$ is a banded matrix with zero upper bandwidth and all-ones diagonal entries. $U$ is a banded matrix with zero lower bandwidth and nonzero diagonal entries because of the nonsingularity of $M$ . The pivoting matrix $P$ simply changes the row order of the operand, satisfying $P^{⊤} P = I$ . Consequently, the transpose also has a Banded LUP Factorization. Specifically, $M^{⊤} = \bar{L} \bar{U} P^{⊤}$ , where

\begin{matrix} (62) & \bar{L} = U^{⊤} {(U \circ I)}^{- 1}, \bar{U} = (I \circ U) L^{⊤}, \end{matrix}

where the inverse is only done for a diagonal matrix and $\circ$ the Hadamard product. Then, $G$ can be also computed in linear complexity through such a factorization. For convenience, we partition $G$ into

\begin{matrix} (63) & G = {(G_{0}^{⊤}, G_{1}^{⊤}, \dots, G_{M - 1}^{⊤}, G_{M}^{⊤})}^{⊤} \end{matrix}

such that $G_{0}, G_{M} \in R^{s \times m}$ and $G_{i} \in R^{2 s \times m}$ for $1 ⩽ i < M$ . After that, the gradient of $W$ w.r.t. $q$ is determined as

\begin{matrix} (64) & \frac{\partial W}{\partial q} = (G_{1}^{⊤} e_{1}, \dots, G_{M - 1}^{⊤} e_{1}), \end{matrix}

where $e_{1}$ is the first column of $I_{2 s}$ . This operation takes out $M - 1$ specific columns in $G^{⊤}$ .

As for the gradient w.r.t. $T$ , differentiating both sides of $(57)$ w.r.t. $T_{i}$ gives

\begin{matrix} (65) & \frac{\partial M}{\partial T_{i}} c + M \frac{\partial c}{\partial T_{i}} = 0 . \end{matrix}

Thus,

\begin{aligned} \frac{\partial W}{\partial T_{i}} & = \frac{\partial K}{\partial T_{i}} + Tr {{(\frac{\partial c}{\partial T_{i}})}^{⊤} \frac{\partial K}{\partial c}} \\ = \frac{\partial K}{\partial T_{i}} - Tr {{(\frac{\partial M}{\partial T_{i}})}^{⊤} M^{- 1} \frac{\partial K}{\partial c}} \\ (66) & = \frac{\partial K}{\partial T_{i}} - Tr {G_{i}^{⊤} \frac{\partial M}{\partial T_{i}} c} \end{aligned}

The banded structure of $M$ implies that

G^{⊤} \frac{\partial M}{\partial T_{i}} c = G_{i}^{⊤} \frac{\partial E_{i}}{\partial T_{i}} c_{i} .

Then we obtain the gradient w.r.t. $T_{i}$ computed as

\begin{matrix} (68) & \frac{\partial W}{\partial T_{i}} = \frac{\partial K}{\partial T_{i}} - Tr {G_{i}^{⊤} \frac{\partial E_{i}}{\partial T_{i}} c_{i}}, \end{matrix}

where $\partial E_{i} / \partial T_{i}$ can be analytically derived from $(64)$ . Computing $(68)$ for every $1 ⩽ i ⩽ M$ gives $\partial W / \partial T$ .

Finally, we finish the computation of $\partial W / \partial q$ and $\partial W / \partial T$ . It can be verified from both $(64)$ and $(68)$ that the gradient propagation is also done in $O (M)$ complexity. As for $K$ , we make no assumption on its concrete form. Actually, the smoothness of $K$ is not even needed if only the resultant $W$ is $C^{2}$ . In other word, the linear-complexity gradient propagation enjoys both efficiency and flexibility. By incorporating it into common optimizers, we can accomplish the spatial-temporal deformation of $T_{MINCO}$ for a wide range of planning purposes while maintaining the local smoothness of trajectories.

3. Geometrically Constrained Flight Trajectory Optimization

In this section, we provide a unified framework for flight trajectory optimization with different time regularization $ρ (T)$ , spatial constraints $\tilde{F}$ and continuous-time constraints $G$ . This framework indeed relaxes the original problem into $T_{MINCO}$ .

The spatial-temporal deformation is utilized to meet various feasibility requirements. Lightweight schemes are specially designed to eliminate geometrical constraints such that the trajectory can be freely deformed. For continuous-time constraints, a time integral penalty functional is proposed to ensure the feasibility without sacrificing the scalability.

Finally, our framework transforms the constrained trajectory optimization into a sparse unconstrained one which can be reliably solved.

3.1 Temporal Constraint Elimination

fig-9-2 — Figure 2[1]: Left: Domain of $J$ on an $M$-piece trajectory with total time fixed as $T_\Sigma$. The domain is indeed the relative interior of an $(M-1)$-simplex in $\mathbb{R}^M_{>0}$. Right: Contour of $\ln{J}$ with $M=3$. The function goes to infinity as the time vector approaches the boundary of the open domain in $\mathbb{R}^2_{>0}$.

Deforming MINCO needs standard optimizers that are often designed for Euclidean spaces. However, the trajectory definition and cost functional~ $(11)$ both restrict the domain of $T$ to simple manifolds, on which frequent retractions are needed during optimization. We give explicit diffeomorphisms for $T$ such that surrogate variables are in Euclidean spaces. Thus, common efficient optimizers can be conveniently applied.

For polynomial splines, the control effort in $(11)$ is a function of $c$ and $T$ , denoted by $J_{c} (c, T)$ . Analytical expressions of $J_{c}$ , $\partial J_{c} / \partial c$ , and $\partial J_{c} / \partial T$ are available in literatures(See original paper). Now that $T_{MINCO}$ are polynomial splines with coefficients determined by $c (q, T)$ , the cost functional of $(11)$ can be written as

\begin{matrix} (69) & J (q, T) = J_{q} (q, T) + ρ (∥ T ∥_{1}), \end{matrix}

where $J_{q}$ is defined as $J_{q} (q, T) = J_{c} (c (q, T), T)$ . Obviously, computing $J_{q}$ , $\partial J_{q} / \partial q$ , and $\partial J_{q} / \partial T$ from any provided $J_{c}$ , $\partial J_{c} / \partial c$ , and $\partial J_{c} / \partial T$ can be done in $O (M)$ complexity, as already shown in deformation of $T_{MINCO}$ .

It is natural to optimize $T$ via $\partial J / \partial T$ . However, $J_{q} (q, T)$ has its definition over $T \in R_{> 0}^{M}$ . It becomes unbounded when any $T_{i}$ approaches zero and no consecutively repeating points appear in $q$ . Besides, $ρ_{f}$ defined in $(13)$ further restricts the domain of $J$ to $\sum_{i = 1}^{M - 1} T_{i} < T_{Σ}$ , as shown in Fig. 2.

We use diffeomorphisms to eliminate constraints for $ρ_{f}$ and $ρ_{s}$ . Consider the domain of $ρ_{f}$ in $(13)$ ,

\begin{matrix} (70) & T_{f} = {T \in R^{M} | ∥ T ∥_{1} = T_{Σ}, T ≻ 0} . \end{matrix}

It is clear that $J (q, \cdot)$ is finite for a nontrivial $q$ if and only if $T \in RelInt (T_{f})$ , i.e., the relative interior of $T_{f}$ .

Proposition 1:

$T_{f}$ defined by $(70)$ is diffeomorphic to $R^{M - 1}$ . Denote by $τ = (τ_{1}, \dots, τ_{M - 1})$ an element in $R^{M - 1}$ . A $C^{\infty}$ diffeomorphism is given by the map below for $1 ⩽ i < M$ :

\begin{matrix} (71) & T_{i} = \frac{e^{τ_{i}}}{1 + \sum_{j = 1}^{M - 1} e^{τ_{j}}} T_{Σ}, T_{M} = T_{Σ} - \sum_{j = 1}^{M - 1} T_{j} . \end{matrix}

By exploiting the explicit diffeomorphism $(71)$ , we directly minimize the cost function $J$ over $R^{M - 1}$ via $τ$ , because the domain constraints are satisfied by default.

Optimizing $τ$ requires gradient propagation. We partition the gradient as $\partial J_{q} / \partial T = {(g_{a}^{⊤}, g_{b})}^{⊤}$ , where $g_{a} \in R^{M - 1}$ and $g_{b} \in R$ .

Differentiating the layer in $(71)$ yields the gradient of $J$ w.r.t. $τ$ ,

\begin{matrix} (72) & \frac{\partial J}{\partial τ} = \frac{(g_{a} - g_{b} 1) \circ e^{[τ]}}{1 + ∥ e^{[τ]} ∥_{1}} - \frac{(g_{a}^{⊤} e^{[τ]} - g_{b} ∥ e^{[τ]} ∥_{1}) e^{[τ]}}{{(1 + ∥ e^{[τ]} ∥_{1})}^{2}}, \end{matrix}

where $e^{[\cdot]}$ is the entry-wise exponential map, and $1$ an all-ones vector. If an initial guess $T$ is specified, the corresponding $τ$ can be computed via the inverse map of the diffeomorphism, given by $τ_{i} = \ln (T_{i} / T_{M})$ for $1 ⩽ i < M$ . As for $ρ_{s}$ in $(12)$ , only $T ≻ 0$ needs to be ensured. It suffices to use $T = e^{[τ]}$ as the diffeomorphism between $R^{M}$ and $R_{> 0}^{M}$ .

For either $ρ_{f}$ or $ρ_{s}$ , we denote the diffeomorphism by $T (τ)$ . Unconstrained optimization on $τ$ can be directly conducted to minimize $J (q, T (τ))$ . Although $T (τ)$ does not preserve convexity, the original cost $J (q, T)$ is already nonconvex as given in $(57)$ .

Thus, the only concern is whether $T (τ)$ brings new local minima in the space of $τ$ or eliminates local minima in the space of $T$ .

Proposition 2

Denote by $F : D_{F} \mapsto R$ any $C^{2}$ function with a convex open domain $D_{F} \in R^{N}$ . Given any $C^{2}$ diffeomorphism $G : R^{N} \mapsto D_{F}$ , define $H : R^{N} \mapsto R$ as $H (y) = F (G (y))$ for $y \in R^{N}$ . For any $x \in D_{F}$ and $y \in R^{N}$ satisfying $x = G (y)$ or equivalently $y = G^{- 1} (x)$ , the following statements always hold:

$\nabla F (x) = 0$ if and only if $\nabla H (y) = 0$ ;
$\nabla^{2} F (x)$ is positive-definite (or positive-semidefinite) at $\nabla F (x) = 0$ , if and only if $\nabla^{2} H (y)$ is positive-definite (or positive-semidefinite) at $\nabla H (y) = 0$ .

Proof: See Appendix in the original paper.

Proposition 2 confirms that $T (τ)$ preserves the first/second-order necessary optimality conditions and second-order sufficient optimality conditions. It is also applicable to substitute the exponential map in this subsection with any $C^{2}$ diffeomorphism from $R$ to $R_{> 0}$ for a better numerical condition. In the sense of commonly-used optimality conditions, our constraint elimination does not produce extra spurious local minima or cancel any existing one.

3.2 Spherical Spatial Constraint Elimination

fig-9-3 — Figure 3[1]: Inverse stereographic projection $f_s$ maps the Euclidean space $\mathbb{R}^n$ onto a sphere without north pole $\mathcal{S}^n_\odot$ in an $(n+1)$-dimensional space. The orthographic projection $f_o$ maps $\mathcal{S}^n_\odot$ onto an $n$-dimensional ball $\mathcal{B}^n$. The variable $\xi$ moves freely in $\mathbb{R}^n$ while the transformed variable $q$ stays in $\mathcal{B}^n$. Optimization on $\xi$ becomes unconstrained when $q$ is constrained by a ball.

We enforce motion safety by confining trajectories into the feasible region $\tilde{F}$ . Although $\tilde{F}$ is nonconvex, it is a union of convex primitives that are sequentially connected. If all pieces have been assigned into these primitives, the safety constraint on each piece becomes convex and thus can be conveniently encoded in $G$ . Owing to the feature of MINCO, the traverse time for every primitive can be directly optimized. Thus, we fix the piece assignment before optimization, rather than resorting to integer variables during optimization. Consequently, intermediate points should be contained by the overlap between primitives, forming inequalities. For Inequality Constrained Problems (ICPs), general methods successively approximate the constraints via additional parameters. However, we aim to apply the constraints directly and efficiently. Therefore, we propose spatial constraint elimination to enforce them exactly, leveraging their geometrical properties.

Consider the constraint $q \in P \subset R^{n}$ where $P$ is a closed ball. Its dimension satisfies $n ⩽ m$ since a low-dimensional constraint also exists in $R^{m}$ . If $P$ is a closed ball $P^{B}$ centered at point $o$ with radius $r$ ,

\begin{matrix} (73) & P^{B} = {x \in R^{n} | ∥ x - o ∥_{2} ⩽ r}, \end{matrix}

We utilize a smooth surjection to map $R^{n}$ to $P^{B}$ such that optimization over $R^{n}$ implicitly satisfies the constraint $P^{B}$ . As illustrated in Fig. 3, the map is a composition of the inverse stereographic projection and the orthographic projection. First, we utilize the inverse stereographic projection to map $R^{n}$ to $S_{⊙}^{n}$ , where $S_{⊙}^{n}$ is a unit sphere without north pole, i.e.,

\begin{matrix} (74) & S_{⊙}^{n} = {x \in R^{n + 1} | ∥ x ∥_{2} = 1, x_{n + 1} < 1} . \end{matrix}

The inverse stereographic projection $f_{s}$ is define as

\begin{matrix} (75) & f_{s} (x) = \frac{{(2 x^{⊤}, x^{⊤} x - 1)}^{⊤}}{x^{⊤} x + 1} \in S_{⊙}^{n}, \forall x \in R^{n} . \end{matrix}

Note that $f_{s}$ is a diffeomorphism between $R^{n}$ and $S_{⊙}^{n}$ . We then project $S_{⊙}^{n}$ from $R^{n + 1}$ back in $R^{n}$ to obtain

\begin{matrix} (76) & B^{n} = {x \in R^{n} | ∥ x ∥_{2} \leq 1} . \end{matrix}

The map is described by

\begin{matrix} (77) & f_{o} (x) = {(x_{1}, \dots, x_{n})}^{⊤} \in B^{n}, \forall x \in S_{⊙}^{n}, \end{matrix}

which is indeed a smooth surjection onto $B^{n}$ . Each point in $B^{n}$ , except the center, is paired with two points in $S_{⊙}^{n}$ . The composition of $f_{s}$ , $f_{o}$ , and a linear transformation, is a smooth surjection:

\begin{matrix} (78) & f_{B} (x) = o + \frac{2 r x}{x^{⊤} x + 1} \in P^{B}, \forall x \in R^{n} . \end{matrix}

The map $f_{B}$ introduces a new coordinate, denoted by $ξ$ , such that optimizing $ξ$ over $R^{n}$ always satisfies the constraint on $q$ described by $P^{B}$ . For the $i$ -th intermediate point $q_{i}$ , denote by $ξ_{i}$ the corresponding new coordinate.

Accordingly, denote by $ξ$ the new coordinate for $q$ . Optimizing $ξ$ requires gradient propagation for $\partial J / \partial q$ .

Denote by $g_{i}$ the $i$ -th entry $\partial J / \partial q_{i}$ in $\partial J / \partial q$ . Differentiating the layer $f_{B}$ gives the gradient

\begin{matrix} (79) & \frac{\partial J}{\partial ξ_{i}} = \frac{2 r_{i} g_{i}}{ξ_{i}^{⊤} ξ_{i} + 1} - \frac{4 r_{i} (ξ_{i}^{⊤} g_{i}) ξ_{i}}{(ξ_{i}^{⊤} ξ_{i} + 1)^{2}} . \end{matrix}

If the optimization needs to start from an initial guess $q$ , the backward evaluation of $ξ$ can be done by using a local inverse of $f_{B}$ , given by $ξ_{i}$ for $1 ⩽ i < M$ :

\begin{matrix} (80) & ξ_{i} = \frac{r_{i} - \sqrt{r_{i}^{2} - ∥ q_{i} - o_{i} ∥_{2}^{2}}}{∥ q_{i} - o_{i} ∥_{2}^{2}} (q_{i} - o_{i}) . \end{matrix}

fig-9-4 — Figure 4[1]: Constrained minimum $q^*$ of a convex function $J(q)$ within a 2-D ball. Transformed by $f_\mathcal{B}$, the resultant function $J(f_\mathcal{B}(\xi))$ becomes nonconvex but it preserves the local minimum $\xi^*$ satisfying $q^*=f_\mathcal{B}(\xi^*)$ with no additional local minimum introduced.

Similarly, we analyze influences that the smooth surjection $f_{B}$ imposes on the constrained local minima in $P^{B}$ . Although $f_{B}$ lacks the one-to-one correspondence as diffeomorphisms possess, its components are all well-formed. Firstly, $f_{o}$ only takes the first $n$ entries of a point. This operation preserves at least the first-order necessary conditions for local minima in either $B^{n}$ or $S_{⊙}^{n}$ . Secondly, $f_{s}$ is a diffeomorphism between $S_{⊙}^{n}$ and $R^{n}$ , thus satisfying Proposition 2. Therefore, we can also confirm that $f_{B}$ does not produce extra spurious local minima or cancel any existing one. As shown in Fig. 4, the constrained minimum within a 2-D ball is transformed into an unconstrained minimum.

3.3 Polyhedral Spatial Constraint Elimination

fig-9-5 — Figure 5[1]: Transformations on a convex polytope. A convex polytope $\mathcal{P}^\mathcal{H}$ with $\hat{n}+1$ vertices is indeed a standard $\hat{n}$-simplex in the barycentric coordinate. The simplex $\mathcal{P}^\mathcal{H}_w$ is then the image of an entry-wise square map $[\cdot]^2$ with ball-shaped domain, which can be eliminated as in Fig. 3.

Now we consider the elimination of polyhedral constraints. Specifically, $P$ is a closed convex polytope $P^{H}$ defined by

\begin{matrix} (81) & P^{H} = {x \in R^{n} | A x ⪯ b} . \end{matrix}

where $Int (P^{H}) \neq \emptyset$ according to $(15)$ . Common optimization algorithms use the $H$ -representation of $P^{H}$ as linear inequality constraints. In our framework, we exploit their geometrical property to eliminate these constraints so that $T_{MINCO}$ can be freely deformed.

To achieve this, we use the $V$ -representation of $P^{H}$ instead, where any $q \in P^{H}$ has a (general) barycentric coordinate, i.e., a convex combination of vertices. To obtain the vertices, we apply the efficient convex hull algorithm to the dual of $P^{H}$ based on an interior point calculated by Seidel's algorithm. Note that this procedure produces negligible overhead in our case ( $n ⩽ 4$ ).

The procedure to eliminate a polytope constraint is illustrated in the Fig. 5. We denote all $\hat{n} + 1$ vertices of $P^{H}$ by $(v_{0}, \dots, v_{\hat{n}})$ , where $v_{i} \in R^{n}$ for each $i$ . The barycentric coordinate of a point $q \in P^{H}$ consists of the weights for these vertices. To obtain a more compact form, define ${\hat{v}}_{i} = v_{i} - v_{0}$ and $\hat{V} = ({\hat{v}}_{1}, \dots, {\hat{v}}_{\hat{n}})$ , then the position can be calculated as

\begin{matrix} (82) & q = v_{0} + \hat{V} w, \end{matrix}

where $w = {(w_{1}, \dots, w_{\hat{n}})}^{⊤} \in R^{\hat{n}}$ is the last $\hat{n}$ entries in the barycentric coordinate. The set of coordinates in convex combinations can also be written as

\begin{matrix} (83) & P_{w}^{H} = {w \in R^{\hat{n}} | w ⪰ 0, ∥ w ∥_{1} ⩽ 1} . \end{matrix}

The Main Theorem of Polytope Theory confirms the equivalence between $P_{w}^{H}$ and $P^{H}$ under $(82)$ . The polytope is exactly converted into a standard $(\hat{n} + 1)$ -simplex by simply adding auxiliary variables and applying a linear map to $q$ .

This process does not produce additional nonlinearity in the optimization problem except that the dimension of decision variables is increased. Therefore, we only consider the decision variables on $q$ as the corresponding $w$ hereafter.

The simplex $(83)$ can be eliminated by nonlinear transformations. We first use an entry-wise square map $[\cdot]^{2} : R^{\hat{n}} \mapsto R^{\hat{n}}$ to eliminate nonnegativity constraints using $w = [x]^{2}$ . Then, the constraint $P_{w}^{H}$ on $w$ is transformed into a closed unit ball $B^{\hat{n}}$ on $x$ ,

\begin{matrix} (84) & B^{\hat{n}} = {x \in R^{\hat{n}} | ∥ x ∥_{2} ⩽ 1} . \end{matrix}

Consequently, we can utilize the smooth surjection $f_{B}$ in $(78)$ again. The composition of $(82)$ , $[\cdot]^{2}$ , and $f_{B}$ yields a smooth surjection $f_{H}$ from $R^{\hat{n}}$ onto $P^{H}$ :

\begin{matrix} (85) & f_{H} (x) = v_{0} + \frac{4 \hat{V} [x]^{2}}{(x^{⊤} x + 1)^{2}} \in P^{H}, \forall x \in R^{\hat{n}} . \end{matrix}

A new coordinate $ξ$ is introduced by $f_{H}$ , where any $ξ \in R^{\hat{n}}$ ensures $q \in P^{H}$ . The boundary of $P^{H}$ is also attainable. Similarly, $ξ$ is the new coordinate for $q$ . Optimizing $ξ$ requires gradient propagation. Denote by $g_{i}$ the $i$ -th gradient $\partial J / \partial q_{i}$ in $\partial J / \partial q$ , then differentiating the layer $f_{H}$ gives

\begin{matrix} (86) & \frac{\partial J}{\partial ξ_{i}} = \frac{8 ξ_{i} \circ {\hat{V}}^{⊤} g_{i}}{(ξ_{i}^{⊤} ξ_{i} + 1)^{2}} - \frac{16 g_{i}^{⊤} \hat{V} [ξ_{i}]^{2}}{(ξ_{i}^{⊤} ξ_{i} + 1)^{3}} ξ_{i} . \end{matrix}

If an initial guess $q$ is specified, the corresponding $ξ$ can be computed via the local inverse of $f_{H}$ . The barycentric coordinate of each $q_{i}$ can be obtained using the analytic approach by Warren et al.. After that the analytic local inverses of $[\cdot]^{2}$ and $f_{B} (\cdot)$ give us the desired $ξ_{i}$ . Another flexible way is to directly minimize the squared distance between $f_{H} (ξ)$ and the given $q_{i}$ . Both approaches have negligible time consumption but promising results.

The map $[\cdot]^{2}$ in $f_{H}$ presents additional nonlinearity into optimization. Fortunately, variable transformation via $[\cdot]^{2}$ is a special case of the inequality-to-equality conversion. Concretely, the inequality constraints are $- w ⪯ 0$ . By introducing additional variables $x$ , the equivalent equality constraints are $- w + [x]^{2} = 0$ , yielding $w = [x]^{2}$ . Such type of constraint conversion is proved to preserve first/second-order necessary conditions and second-order sufficient conditions for ICPs by Bertsekas as provided in literature. We confirm that the additional nonlinearity in $f_{H}$ does not exclude the desired minimum or produce undesired minimum practically.

Direct constraints on $q$ are eliminated for either $P^{B}$ or $P^{H}$ using a smooth surjection $q (ξ)$ . We can conduct unconstrained optimization on $ξ$ to minimize $J (q (ξ), T (τ))$ hereafter.

3.4 Time Integral Penalty Functional

After eliminating direct constraints, $T_{MINCO}$ can be freely deformed to meet the continuous-time constraints $G$ . However, enforcing $G$ over the entire trajectory involves infinitely many inequalities that cannot be solved by constrained optimization. It further needs temporal discretization that usually produces a large number of decision variables. To preserve the sparsity of trajectory parameterization, we decouple the resolution of constraint evaluation from the number of decision variables. Inspired by the constraint transcription, we transform $G$ into finite constraints by integral of constraint violations.

For a trajectory $p : [0, T] \mapsto R^{m}$ , we define

\begin{matrix} (87) & I_{G}^{k} [p] = \int_{0}^{T} max {[G (p (t), \dots, p^{(s)} (t)), 0]}^{k} d t, \end{matrix}

where $k \in R_{> 0}$ and $max {[\cdot, 0]}^{k}$ is the composition of the entry-wise maximum and an entry-wise power function. Specifically, smoothing is needed if $k \leq 1$ . The functional-type constraint is then equivalent to equality constraints $I_{G}^{k} [p] = 0$ . Actually, $I_{G}^{k} [p]$ is a function of trajectory parameters, which we adopt as penalty terms. If $k = 1$ , it forms a nonsmooth but exact penalty. If $k > 1$ , it forms a differentiable strictly convex penalty. Thus either $I_{G}^{3} [p]$ or a smoothing approximation of $I_{G}^{1} [p]$ can be adopted. For simplicity, we utilize $I_{G}^{3} [p]$ hereafter unless otherwise specified. There are two reasons for choosing a penalty function method. Firstly, the integral in $(87)$ can only be evaluated numerically, making the constraint approximation inevitable.

Secondly, penalty methods have no requirement on a feasible initial guess which is nontrivial to construct.

We define the time integral penalty functional for $p (t)$ as

\begin{matrix} (88) & I_{G} [p] = χ^{⊤} I_{G}^{k} [p] . \end{matrix}

where $χ \in R_{⩾ 0}^{n_{g}}$ is a weight vector. Normally, $χ$ should contain large constants. If no constraint is violated, $I_{G} [p]$ remains zero. Otherwise, if any part on $p (t)$ violates any constraint in $G$ , the penalty functional $I_{G} [p]$ grows rapidly. By incorporating $I_{G} [p]$ into the cost functional, continuous-time constraints are enforced within an acceptable tolerance.

Practically, $I_{G} [p]$ can only be evaluated by quadrature. To conduct the quadrature, we first define a sampled function $G_{τ} : R^{2 s \times m} \times R_{> 0} \times [0, 1] \mapsto R^{n_{g}}$ as

\begin{matrix} (89) & G_{τ} (c_{i}, T_{i}, τ) = G (c_{i}^{⊤} β (T_{i} \cdot τ), \dots, c_{i}^{⊤} β^{(s)} (T_{i} \cdot τ)), \end{matrix}

where $τ \in [0, 1]$ is a normalized stamp. Then the quadrature for $I_{G} [p]$ , denoted by $I : R^{2 M s \times m} \times R_{> 0}^{M} \mapsto R_{> 0}$ , is computed as a weighted sum of the sampled penalty,

\begin{matrix} (90) & I (c, T) = \sum_{i = 1}^{M} \frac{T_{i}}{κ_{i}} \sum_{j = 0}^{κ_{i}} {\bar{ω}}_{j} χ^{⊤} max {[G_{τ} (c_{i}, T_{i}, \frac{j}{κ_{i}}), 0]}^{k}, \end{matrix}

where $κ_{i}$ controls the resolution. We choose the trapezoidal rule $({\bar{ω}}_{0}, {\bar{ω}}_{1}, \dots, {\bar{ω}}_{κ_{i} - 1}, {\bar{ω}}_{κ_{i}}) = (1 / 2, 1, \dots, 1, 1 / 2)$ because of its reliable performance for ill-shaped $C^{2}$ integrands in our practice. Intuitively, $I (c, T)$ is a differentiable approximation to $I_{G} [p]$ , whose precision is adjustable through $κ_{i}$ . The value and gradient at most timestamps can be parallelly computed then directly combined as one.

3.5 Trajectory Optimization via Unconstrained NLP

Due to $G$ and $F$ in $(11)$ , the optimal trajectory parameterization is generally hard to know. Unlike traditional methods approximating solutions via a large number of variables, we propose to solve a lightweight relaxed optimization via unconstrained NLP, where the spatial-temporal deformation of $T_{MINCO}$ is applied. The relaxation to $(11)$ is defined as

\begin{matrix} (91) & min_{ξ, τ} J (q (ξ), T (τ)) + I (c (q (ξ), T (τ)), T (τ)), \end{matrix}

where $J$ is the time-regularized control effort $(69)$ for $T_{MINCO}$ and $I$ is the quadrature of penalty functional $(90)$ . Note that any task-specific requirement, either objectives or constraints, can be combined in $(91)$ without affecting its structure.

To generate trajectories for a flat multicopter, we first parameterize its flat-output trajectory as $T_{MINCO}$ . After assigning a fixed number of pieces into each $P_{i}$ , variable transformations are applied to eliminate all direct constraints.

User-defined $G_{D}$ are also transformed into $G$ via $Ψ_{x}$ and $Ψ_{u}$ . Finally, we obtain the cost function $(91)$ .

Apparently, the gradient propagation is derived for all layers except $Ψ_{x}$ and $Ψ_{u}$ . One can either apply Automatic Differentiation (AD) to $Ψ_{x}$ and $Ψ_{u}$ or derive the gradient propagation analytically by following the reverse-mode AD. The efficiency is the same as the flatness map as ensured by Baur-Strassen Theorem. The differentiation is only needed for the given flat dynamics once and for all. With available gradient, the relaxation $(91)$ is then solved by the L-BFGS algorithm.

4 Conclusion (By GPT-5.2 in Github Copilot)

MINCO can be viewed as a back-end trajectory generator + lightweight unconstrained optimizer: given a piece assignment and boundary/intermediate conditions, it constructs the unique minimum-control-effort spline (solving a multi-stage LQMT like $(18)$ ), and then deforms its spatial and temporal parameters to satisfy geometry and continuous-time constraints (relaxing the original problem $(11)$ into an unconstrained NLP $(91)$ ).

Below is a practical procedure you can follow to implement the MINCO pipeline.

MINCO procedure (implementation-oriented)

Inputs

Flatness order $s$ (e.g., jerk $s = 3$ or snap $s = 4$ ), weights $W$ , and time regularization $ρ (\cdot)$ in $(11)$ , $(12)$ , $(13)$ .
Start/goal boundary derivatives ${\bar{z}}_{o}, {\bar{z}}_{f}$ (see $(11 e)$ ).
Free-space approximation $\tilde{F} = ⋃_{i} P_{i}$ (see $(14)$ – $(17)$ ), plus piece assignment (e.g., $K$ pieces per primitive).
Continuous-time dynamic constraints $G$ (obtained from $G_{D}$ via flatness maps, see $(9)$ – $(10)$ ).

Decision variables

Spatial variables $ξ$ that generate intermediate points $q (ξ)$ using a smooth surjection (e.g., ball case $(78)$ , polytope case via $f_{H}$ ).
Time variables $τ$ that generate $T (τ)$ via a diffeomorphism (e.g., fixed-sum case $(71)$ ; gradient uses $(72)$ ).

Algorithm

Discretize the topology (piece assignment)
- Choose $M$ polynomial pieces and assign each piece to a convex primitive $P_{i}$ (sequential connection as in $(15)$ ).
- Define intermediate points ${q_{i}}$ that must lie in overlaps of adjacent primitives (handled later by the surjection $q (ξ)$ ).
Parameterize the flat-output trajectory as a MINCO spline
- Represent each piece with polynomial coefficients $c_{i}$ and duration $T_{i}$ .
- Use the MINCO construction to obtain coefficients from spatial-temporal parameters: $c = c (q, T)$ (banded linear system such as $(57)$ ).
- Compute the smoothness+time cost $J (q, T) = J_{q} (q, T) + ρ (∥ T ∥_{1}) (69) .$
Eliminate direct constraints by variable transformations
- Time positivity / fixed total time: optimize over $τ$ and map to $T (τ)$ (e.g., $(71)$ ).
- Geometry of intermediate points: optimize over $ξ$ and map to $q (ξ)$ (e.g., $(78)$ for balls; analogous smooth surjection for polytopes).
- After this step, constraints like $T ≻ 0$ and $q_{i} \in P$ are satisfied by construction.
Handle continuous-time constraints via integral penalty
- Convert the infinite constraints $G (z (t), \dots, z^{(s)} (t)) ⪯ 0$ into a penalty functional $(87)$ – $(88)$ .
- Numerically evaluate it by quadrature, yielding $I (c, T)$ in $(90)$ .
Solve the unconstrained NLP
- Minimize the relaxed objective $min_{ξ, τ} J (q (ξ), T (τ)) + I (c (q (ξ), T (τ)), T (τ)) (91) .$
- Use L-BFGS (as described at the end of §3.5). For gradients, propagate through layers:
  - MINCO layer $c (q, T)$ : linear-complexity gradients w.r.t. $q$ and $T$ (see $(64)$ , $(68)$ ).
  - Time map $T (τ)$ : use $(72)$ (or the exponential map case).
  - Spatial map $q (ξ)$ : use the analytic Jacobians (e.g., $(86)$ ).
  - Flatness maps $Ψ_{x}, Ψ_{u}$ : use AD or derive reverse-mode gradients once (see §3.5).
Recover the feasible multicopter trajectory
- Output the optimized flat trajectory $z (t)$ , then map to state/input using $(2)$ – $(3)$ .
- Validate $z (t) \in F$ and $G ⪯ 0$ using the same sampling resolution $κ_{i}$ as in $(90)$ .

This procedure is exactly the “MINCO-as-a-module” viewpoint: (i) construct the unique minimum-effort spline from $q, T$ , and (ii) optimize $q, T$ (via $ξ, τ$ ) to satisfy geometry and dynamics.

MINCO: Geometrically Constrained Trajectory Optimization for Multicopters ​

1 Preliminaries ​

1.1 Differential Flatness ​

1.2 Direct Optimization in Flat-Output Space ​

1.3 Problem Formulation ​

2 Multi-Stage Control Effort Minimization ​

2.1 Unconstrained Control Effort Minimization ​

2.2 Optimality Conditions ​

2.3 Minimization Without Cost Functional ​

2.4 MINCO Trajectories With Spatial-Temporal Deformation ​

3. Geometrically Constrained Flight Trajectory Optimization ​

3.1 Temporal Constraint Elimination ​

3.2 Spherical Spatial Constraint Elimination ​

3.3 Polyhedral Spatial Constraint Elimination ​

3.4 Time Integral Penalty Functional ​

3.5 Trajectory Optimization via Unconstrained NLP ​

4 Conclusion (By GPT-5.2 in Github Copilot) ​

MINCO procedure (implementation-oriented) ​