A nonlinear tracking model predictive control scheme for dynamic target signals

Page created by Beth Thomas
 
CONTINUE READING
A nonlinear tracking model predictive control scheme for
                     dynamic target signals ?

                         Johannes Köhler1 , Matthias A. Müller2 , Frank Allgöwer1
          1
              Institute for Systems Theory and Automatic Control, University of Stuttgart, 70550 Stuttgart, Germany
                    2
                        Institute of Automatic Control,Leibniz University Hannover, 30167 Hannover, Germany

Abstract

We present a nonlinear model predictive control (MPC) scheme for tracking of dynamic target signals. The scheme combines
stabilization and dynamic trajectory planning in one layer, thus ensuring constraint satisfaction irrespective of changes in the
dynamic target signal. For periodic target signals we ensure exponential stability of the optimal reachable periodic trajectory
using suitable terminal ingredients and a convexity condition for the underlying periodic optimal control problem. Furthermore,
we introduce an online optimization of the terminal set size to automate the trade-off between fast convergence and operation
close to the constraints. In addition, we show how stabilization and dynamic trajectory planning can be formulated as
partially decoupled optimization problems, which reduces the computational demand while ensuring recursive feasibility and
convergence. The main tool to enable the proposed design is a novel reference generic offline computation that provides
suitable terminal ingredients for tracking of dynamic reference trajectories. The practicality of this approach is demonstrated
on benchmark examples, which demonstrates superior performance compared to state of the art approaches.

Key words: model predictive control; control of constrained systems; output regulation; periodic references; reference
tracking; nonlinear systems; stability

1   Introduction                                                    In many applications, the control goal goes beyond the
                                                                    stabilization of a pre-determined setpoint. These prac-
Model Predictive Control (MPC) is a well established                tical challenges include tracking of changing reference
control method, that computes the control input by re-              setpoints, stabilization of dynamic trajectories, output
peatedly solving an optimization problem online [29].               regulation and general economic optimal operation. De-
The main advantages of MPC are the ability to cope with             signing MPC schemes that provide theoretical guaran-
nonlinear dynamics, hard state and input constraints,               tees (recursive feasibility, stability and performance) for
and the inclusion of performance criteria.                          such control problems is the focus of much research.

Motivation: Most of the existing theoretical results for            Related work: In [10,15] tracking of reachable dynamic
MPC consider the problem of stabilizing some given                  reference trajectories is studied and stability is ensured
steady-state [29]. Theoretical properties, such as recur-           by using suitable terminal ingredients. In [14], a track-
sive feasibility and asymptotic stability can be ensured            ing MPC scheme without terminal ingredients for un-
by including suitable terminal ingredients (terminal set            reachable target signals is studied and (practical) sta-
and terminal cost) in the optimization problem, which               bility of the optimal reachable trajectory is established.
are computed offline, compare [5,24].                               The MPC scheme is simple to implement, however, the
                                                                    theoretical guarantees depend on a sufficiently large pre-
? The material in this paper was partially presented at the         diction horizon that may be conservative for many ap-
6th IFAC Conference on Nonlinear Model Predictive Con-
                                                                    plications.
trol, August 19—22, 2018, Madison, Wisconsin (USA).
  Email addresses:                                                  A promising alternative to tackle the problem of un-
johannes.koehler@ist.uni-stuttgart.de (Johannes                     reachable target signals, is the simultaneous opti-
Köhler1 ), mueller@irt.uni-hannover.de (Matthias A.                mization of an artificial reference, which is pursued
Müller2 ), frank.allgower@ist.uni-stuttgart.de (Frank              in [18,19,21,9,26,11]. By using terminal constraints for
Allgöwer1 ).                                                       the artificial reference, these schemes provide a large

Preprint submitted to Automatica                                                                                 14 April 2020
region of attraction and ensure recursive feasibility in-         sive exposition of the subject, unify the consideration
dependent of the (typically exogenous) target signal.             of different terminal ingredients, include an online opti-
In particular, in [18] a setpoint tracking MPC scheme             mization of the terminal set size, discuss how to partially
for linear systems has been introduced based on simul-            decouple the reference update and add an example with
taneous optimization of an artificial steady state and            a quantitative comparison to state of the art approaches.
tracking of this steady state. Compared to a standard
MPC formulation, this scheme ensures recursive feasi-             Outline: Sec. 2 discusses preliminaries regarding track-
bility independent of the target signal and provides a            ing MPC. Sec. 3 contains the proposed MPC scheme, in-
large region of attraction. In [31], the complexity of the        cluding theoretical analysis and extensions. Sec. 4 shows
polyhedral invariant set for tracking from [18] has been          the applicability and advantages of the proposed method
reduced by considering an online optimization of the              using numerical examples. Sec. 5 concludes the paper.
terminal set size. In [21], for linear systems the approach
in [18] has been extended to periodic target signals by
using an artificial periodic trajectory and a terminal            Notation: The quadratic norm with respect to a pos-
equality constraint. Similarly, in [20] for linear systems        itive definite matrix Q = Q> is denoted by kxk2Q =
the economically optimal periodic trajectory is stabi-            x> Qx. By K∞ we denote the class of functions α :
lized using a tracking formulation. The method in [18]            R≥0 → R≥0 , which are continuous, strictly increasing,
has been extended to setpoint tracking for nonlinear              unbounded and satisfy α(0) = 0. The identity matrix is
systems in [19]. Economic MPC schemes based on ar-                In ∈ Rn×n . The interior of a set X is denoted by int(X ).
tificial steady states have been considered in [9,26,11],
for both linear and nonlinear systems.
                                                                  2    Tracking MPC for reachable references
Contribution: The goal of this paper is to generalize and
unify the methodologies from [18,19,21] to design non-
linear MPC schemes that exponentially stabilize the op-           We consider the following nonlinear discrete-time sys-
timal reachable periodic trajectory given a possibly un-          tem xt+1 = f (xt , ut ) with state x ∈ Rn , control input
reachable periodic output target signal, which is a funda-        u ∈ Rm , and time t ∈ N. We impose point-wise in time
mental step towards practical nonlinear MPC schemes.              constraints on the state and input (xt , ut ) ∈ Z, with
We first generalize the conditions on the terminal ingre-         some compact set Z. As a preliminary problem, we con-
dients, such that we can present a unified theorem that           sider tracking of a given reachable reference trajectory
incompases terminal equality constraints (TEC) (under             rt = (xrt , urt ) ∈ Rn+m , analogous to [15].
suitable controllabiltiy conditions) and suitable termi-
nal cost and terminal set (which can be designed using            Assumption 1 The reference trajectory r satisfies rt ∈
the reference generic offline computation in [15]). Then          Zr , for all t ≥ 0, with some set Zr ⊆ int(Z). Further-
we design a nonlinear tracking MPC scheme for unreach-            more, the evolution of the reference trajectory is restricted
able periodic target signals. We provide a novel proof            by rt+1 ∈ R(rt ), with R(r) = {(xr+ , ur+ ) ∈ Zr | xr+ =
to show that the optimal reachable periodic trajectory            f (xr , ur )}.
is exponentially stable for the resulting closed-loop sys-
tem, if the set of feasible periodic output trajectories
is convex. Furthermore, we extend this method to al-              This assumption characterizes that the reference tra-
low for an online optimization of the terminal set size,          jectory r is reachable, i.e., follows the dynamics f and
which significantly improves the performance, similar             (strictly) satisfies the constraints Z for all times. Denote
to [31]. This extension requires one additional scalar op-        the tracking error by et := xt − xrt . The control goal is
timization variable and automates the trade-off between           stability of the tracking error et = 0 and constraint sat-
fast convergence and operation close to the constraints,          isfaction (xt , ut ) ∈ Z, ∀t ≥ 0. We define the quadratic
which typically needs to be decided offline. In addition,         reference tracking stage cost
we provide a novel algorithm that partially decouples
the reference trajectory updates and the computation of                        `(x, u, r) = kx − xr k2Q + ku − ur k2R ,
the closed-loop input. Finally, we demonstrate the ap-
plicability and practicality of the proposed methodology          with positive definite weighting matrices Q, R. Denote
using two nonlinear benchmark examples. Furthermore,              the reference r over the prediction horizon N by r·|t
we showcase superior performance compared to state of             with rk|t = rt+k , k = 0, . . . , N . Given a predicted state
the art approaches in a quantitative comparison by in-
cluding the proposed terminal ingredients and optimiz-            and input sequence x·|t ∈ Rn×(N +1) , u·|t ∈ Rm×N the
ing terminal set size online.                                     tracking cost with respect to the reference r·|t is given by

A preliminary version of the proposed approach can be                                           N
                                                                                                X −1
found in the conference proceedings [13]. Compared to             JN (x·|t , u·|t , r·|t ) :=          `(xk|t , uk|t , rk|t ) + Vf (xN |t , rN |t ),
the conference version, we provide a more comprehen-                                            k=0

                                                              2
with the terminal cost Vf . The MPC scheme is based on                sible and the tracking error e = 0 is uniformly exponen-
the following (standard) MPC optimization problem                     tially stable for the resulting closed-loop system.

         Vt := min JN (x·|t , u·|t , r·|t )                (1a)
                u·|t                                                  PROOF. This theorem is a straight forward exten-
          s.t. xk+1|t = f (xk|t , uk|t ),                  (1b)       sion of standard MPC results [29], compare also [15,10].
               (xk|t , uk|t ) ∈ Z, k = 0, . . . , N − 1,   (1c)       Given the optimal solution u∗·|t , the candidate sequence
               x0|t = xt , xN |t ∈ Xf (rN |t ),            (1d)                         (
                                                                                            u∗k+1|t              0≤k ≤N −2
                                                                            uk|t+1 =                                       ,
where Xf (r) ⊂ Rn denotes the terminal set. The solution                                    kf (x∗N |t , rN |t ) k = N − 1
to this optimization problem are the value function Vt
and the optimal state and input trajectories x∗·|t , u∗·|t .          is a feasible solution to (1) and implies
For simplicity, we assume throughout the paper that
the optimization problems admit a unique minimizer 1 .                          Vt+1 − Vt ≤ −`(xt , ut , rt ) ≤ −ket k2Q .           (4)
In closed-loop operation we apply the first part of the
optimized input trajectory to the system, leading to the              Compactness of Z in combination with the local
closed-loop system xt+1 = f (xt , u∗0|t ) = x∗1|t , t ≥ 0. As         quadratic upper bound (3) imply ket k2Q ≤ Vt ≤ cv ket k2Q ,
discussed in the introduction, we need suitable terminal              for some cv ≥ cu ≥ 1 and all xt such that Problem (1) is
ingredients to ensure stability and recursive feasibility             feasible, compare [29, Prop. 2.16]. Uniform exponential
for the closed loop.                                                  stability follows from standard Lyapunov arguments
                                                                      based on the value function V .                         
Assumption 2 There exist a terminal controller kf :
Rn × Zr → Rm , a terminal cost Vf : Rn × Zr and a ter-
minal set Xf (r) ⊂ Rn , such that the following properties            Thus, the closed-loop tracking MPC scheme given by (1)
hold for any r ∈ Zr , any x ∈ Xf (r) and any r+ ∈ R(r)                has all the (standard) desirable properties, in case that a
                                                                      reachable reference trajectory shall be tracked. The main
       Vf (x+ , r+ ) ≤Vf (x, r) − `(x, kf (x, r), r),      (2a)       contribution of this work (see Sec. 3) is the development
      (x, kf (x, r)) ∈Z,                                   (2b)       of MPC schemes for more general tracking problems, in-
                                                                      cluding cases where an unreachable dynamic target sig-
                 x+ ∈Xf (r+ ),                             (2c)       nal is given. We note that the tracking MPC scheme (1)
                                                                      (and correspondingly also the proposed scheme in Sec-
with x+ = f (x, kf (x, r)). Furthermore, there exist con-             tion 3) can be easily modified to ensure robust reference
stants cu ,  > 0, such that for any reference r·|t satisfying        tracking, compare [16], [15, Thm. 2], and the discussion
Ass. 1, and any xt ∈ Rn with ket kQ ≤ , Problem (1) is               in Remark 9.
feasible and the value function satisfies
                                                                      In the following, we briefly detail how Assumption 2 can
                         Vt ≤ cu ket k2Q .                  (3)       be satisfied using either a terminal equality constraint
                                                                      or a suitable terminal cost.

The first set of conditions (2) is standard in (reference             Terminal equality constraint - controllability
tracking) MPC, compare for example [5,29,10,15]. Con-
dition (3) ensures that the value function admits a local             The following proposition shows that a simple terminal
quadratic upper bound. 2 In Prop. 4 and Lemma 5 be-                   equality constraint (TEC) with Xf (r) = xr , Vf = 0 sat-
low, we provide sufficient conditions for Assumption 2                isfies Ass. 2, if the system is locally uniformly exponen-
using controllability and stabilizability conditions.                 tially finite time controllable.

Theorem 3 Let Ass. 1–2 hold. Assume that Problem (1)                  Proposition 4 Suppose there exist constants N0 ∈ N,
is feasible at t = 0. Then Problem (1) is recursively fea-            cu , TEC > 0, such that for any reference trajectory
                                                                      r·|t satisfying Ass. 1, and any state xt ∈ Rn satisfying
1
  Existence of a minimizer is, e.g., guaranteed if f, `, Vf are       ket kQ ≤ TEC , there exists an input trajectory u·|t ∈
continuous and the constraint set is compact. If the mini-            Rm×N0 such that
mizer is not unique, an arbitrary minimizer can be chosen.
2
  The weak controllability condition [29, Ass. 2.17], which                   xk+1|t = f (xk|t , uk|t ),       xN0 |t = xr,N0 |t ,
does not require feasibility of Problem (1) locally around
the reference trajectory r, would be sufficient to prove The-                 JN0 (x·|t , u·|t , r·|t ) ≤ cu ket k2Q .
orem 3. Assumption 2, similar to condition [19, Ass. 4] used
for setpoint stabilization, is stronger and will be used in the       Then for any N ≥ N0 , Ass. 2 holds with Xf (r) = xr ,
proof of Theorem 8 in Section 3.                                      Vf (x, r) = 0, kf (x, r) = ur and some constant  > 0.

                                                                  3
PROOF. Satisfaction of (2) is standard. Consider                                     set Xf the terminal controller kf is a feasible solution
Problem (1) at time t with initial condition satisfying                              to (1) and thus using (2a) the terminal cost Vf is an up-
ket kQ ≤  ≤ TEC , some prediction horizon N ≥ N0 ,                                 per bound on the value function V , which ensures (3). 
and the input u·|t appended by uk|t = urk|t , k ≥ N0 . This
candidate input u satisfies JN (x·|t , u·|t , r·|t ) ≤ cu ket k2Q
and kxk|t − xrk|t k2Q + kuk|t − urk|t k2R ≤ cu 2 . Given that                       Proposition 3 in [15] provides a semidefinite program
rk|t ∈ Zr ⊆ int(Z), there exists a small enough con-                                 (SDP) to compute matrices Pf , Kf satisfying (6) using
stant  ∈ (0, TEC ], such that (xk|t , uk|t ) ∈ Z. Thus, the                        a parametrization of the form Pf = X −1 , Kf = Y Pf ,
                                                                                     θj : Z → R,
candidate solution is a feasible solution to (1) and the
bound (3) holds.                                                                                      X                             X
                                                                                         X(r) = X0 +       θj (r)Xj , Y (r) = Y0 +       θj (r)Yj ,
                                                                                                       j                             j
The considered controllability condition holds, for exam-
ple, if the dynamic f is continuously differentiable and                             using methods for (quasi) linear parameter varying
the linearized dynamic f around any reachable trajec-                                (LPV) systems and gain scheduling [30]. This reference
tory r is N0 -step uniformly controllable.                                           generic offline design is only done once for a given sys-
                                                                                     tem f and class of reference trajectories (Ass. 1) and
Terminal cost - reference generic offline computations                               the resulting terminal ingredients can be applied if the
                                                                                     reference r is optimized online (cf. Sec. 3). Details on
                                                                                     the computation with numerical examples can be found
In the following, we discuss how to compute a terminal
                                                                                     in [15].
cost Vf and a terminal set Xf satisfying Ass. 2. Denote
the Jacobian of f evaluated around an arbitrary point
r ∈ Zr by                                                                            Discussion: The main advantage of using a terminal
                                                                                     equality constraint (TEC, Prop. 4) is the fact that no of-
            
                ∂f
                                               
                                                    ∂f
                                                                                    fline design is needed. One of the main benefits of using
   A(r) =                          ,   B(r) =                          .   (5)       a terminal cost/set (QINF, Lemma 5) is that the desired
                ∂x       (x,u)=r                    ∂u       (x,u)=r                 properties also hold for an arbitrarily small prediction
                                                                                     horizon N . Furthermore, the values of cu ,  can be com-
                                                                                     puted explicitly and are typically significantly less con-
Lemma 5 Suppose that f is twice continuously differen-                               servative than the ones obtained when using a terminal
tiable. Assume that there exist a matrix Kf (r) ∈ Rm×n                               equality constraint (TEC, Prop 4), which impacts the
and a positive definite matrix Pf (r) ∈ Rn×n continuous                              closed-loop convergence rate. This impact on closed-loop
in r, such that for any r ∈ Zr , r+ ∈ R(r), the following                            performance is quantitatively investigated with numer-
matrix inequality is satisfied                                                       ical examples in Section 4.

    (A(r) + B(r)Kf (r))> Pf (r+ )(A(r) + B(r)Kf (r))                                 3     Nonlinear tracking MPC for dynamic target
  ≤Pf (r) − (Q + Kf (r)> RKf (r)) − ˜In                                   (6)             signals

with some constant ˜ > 0. Then there exists a sufficiently                          In the following, we design a nonlinear MPC scheme for
small α, such that Ass. 2 is satisfied for any N ≥ 0 with                            tracking of exogenous (unreachable) periodic target sig-
                                                                                     nals using the terminal ingredients in Assumption 2. For
  Vf (x, r) = kx − xr k2Pf (r) , cu = sup λmax (Pf (r), Q),                          the resulting closed-loop system, Theorem 8 establishes
                                      r∈Zr
                                           p                                         exponential stability of the optimal reachable periodic
  Xf (r) = {x ∈ Rn | Vf (x, r) ≤ α},  = α/cu ,                                      trajectory. In Section 3.3 we discuss how online optimiza-
  kf (x, r) = ur + Kf (r) · (x − xr ),                                               tion of the terminal set size α can speed up convergence,
                                                                                     while at the same time allowing optimal operation (ar-
                                                                                     bitrarily) close to the constraints. Algorithm 1 in Sec-
where λmax (P, Q) denotes the maximal generalized eigen-
                                                                                     tion 3.4 provides a simple means to reduce the compu-
value solving (P − λQ)v = 0, for some v ∈ Rn .
                                                                                     tational demand by partially decoupling the trajectory
                                                                                     planning and the reference tracking problem.
PROOF. Satisfaction of conditions (2) follows from [15,
Lemma 1] based on standard differentiability/Lipschitz                               3.1    Nonlinear periodic tracking MPC
continuity arguments, compare also [5,29,10]. Further-
more, kx−xr k2Q ≤  implies Vf (x, r) ≤ cu kx−xr k2Q ≤ α                             In [18,19,21], tracking MPC schemes based on simulta-
and hence x ∈ Xf (r). Using standard arguments (com-                                 neous optimization of an artificial reference have been
pare for example [29, Prop. 2.35]), within the terminal                              introduced. Compared to a standard reference tracking

                                                                                 4
MPC formulation such as (1), these schemes ensure re-                             using JT . As we will see later in the theoretical analysis
cursive feasibility independent of the (potentially un-                           (Thm. 8) and the numerical examples (Sec. 4), this for-
reachable) target signal and provide a large region of at-                        mulation ensures that the closed loop smoothly tracks
traction. In the following, we extend these methods to                            the optimal reachable periodic trajectory xT ∗ .
nonlinear periodic reference tracking using general ter-
minal ingredients (Ass. 2, using a terminal equality con-                         3.2   Theoretical analysis
straint (TEC, Prop. 4) or a terminal cost/set (QINF,
Lemma 5)).
                                                                                  In the following, we derive the theoretical properties of
                                                       r         r   r            the closed-loop system based on (8). The following con-
We consider a nonlinear output function y = h(x , u ) ∈                           dition is used to ensure exponential stability of the opti-
Rp and assume that at time t an exogenous T -periodic                             mal trajectory xT ∗ , similar to [21, Ass. 2],[19, Ass. 1-2].
               e
target signal y·|t ∈ Rp×T is given. For some T -periodic
reference trajectory r·|t = (xr·|t , ur·|t ) ∈ R(n+m)×T , the                     Assumption 6 There exist (unique) locally Lipschitz
tracking cost w.r.t. this target signal y e is defined as                         continuous functions gx : Rp×T → Rn×T , gu : Rp×T →
                                                                                  Rm×T such that gx (y·|t r
                                                                                                            ) = xr·|t , gu (y·|t
                                                                                                                             r
                                                                                                                                 ) = ur·|t , for any
                       T
                       X −1                                                       feasible solution to (7). The set of feasible solutions to (7)
            e
JT (r·|t , y·|t ) :=       k h(xrj|t , urj|t ) −yj|t
                                                 e 2        r
                                                     kS = ky·|t    e 2
                                                                − y·|t kS ,       is convex 3 in y·|t
                                                                                                   r
                                                                                                      .
                       j=0   |    {z        }
                                        r
                                      =yj|t
                                                                                  This assumption implies that (7) is a strictly convex
                                                                                  problem and the minimizer rT ∗ is unique. Thus, for any
with some positive definite weighting matrix S ∈ Rp×p .                           y r 6= y T ∗ it is possible to incrementally change y r , such
The objective is to stabilize the reachable (Ass. 1) T -                          that it remains feasible and the cost JT decreases. Fur-
                               T∗
periodic reference trajectory r·|t = (xT·|t∗ , uT·|t∗ ) that min-                 thermore, due to convexity the directional derivative of
imizes the distance to the signal y e , which is defined as                       JT at y T ∗ in any feasible direction is non-negative, i.e.,
the minimizer to the following optimization problem                               for any reference r·|t that satisfies the constraints in (7),
                                                                                                                   r
                                                                                  the corresponding output y·|t      satisfies
          e                       e
     VT (y·|t ) = min JT (r·|t , y·|t )                               (7)
                       r·|t
                                                                                                   r      T∗ >
s.t. rj+1|t ∈ R(rj|t ) ⊆ Zr , r0|t = rT |t , j = 0, . . . , T − 1.                               (y·|t − y·|t ) ∇yr JT |yr =yT ∗ ≥ 0,             (10)

                                                          T∗                      which can be equivalently written as
The corresponding output reference is denoted by y·|t        ,
       T∗       T∗
with yk|t = h(rk|t ). In order to find and stabilize this pe-                                           e
                                                                                            JT (r·|t , y·|t          e
                                                                                                            ) ≥ VT (y·|t       T∗
                                                                                                                         ) + ky·|t    r 2
                                                                                                                                   − y·|t kS .    (11)
riodic trajectory, we consider the following optimization
problem, similar to [21]
             e                                                  e
   WT (xt , y·|t ) = min JN (x·|t , u·|t , r·|t ) + JT (r·|t , y·|t )             Remark 7 Similar to [19, Remark 1], existence of gx ,
                         u·|t ,r·|t
                                                                                  gu can be ensured based on the implicit function theorem,
             s.t. xk+1|t = f (xk|t , uk|t ), x0|t = xt ,      (8a)                if a rank condition on the linearization of a suitably de-
                  (xk|t , uk|t ) ∈ Z, xN |t ∈ Xf (rN |t ),    (8b)                fined T -step system is satisfied and f , h are continuously
                  rj+1|t ∈ R(rj|t ) ⊆ Zr , rl+T |t = rl|t , (8c)                  differentiable.
                                                                                  For a linear output h(x, u) = Cx + Du, the convexity
                  j = 0, . . . , T − 1, k = 0, . . . , N − 1,                     assumption is ensured if the constraint set in (7) (de-
                  l = 0, . . . , max{0, N − T }.                                  scribing reachable periodic orbits) is convex. For T = 1,
                                                                                  this reduces to convexity of the steady-state manifold Zr ,
The optimal state and input trajectory is given by u∗·|t ,                        which is often easy to verify. Furthermore, even in case
x∗·|t , with the artificial reference r·|t
                                        ∗
                                           = (xr∗     r∗                          the set of reachable periodic trajectories is non-convex,
                                               ·|t , u·|t ) and the
output yk|t r∗        ∗
               = h(rk|t  ). In closed-loop operation we apply                     it may be possible to choose a suitable nonlinear output
                                                                                  h, such that the convexity condition (Ass. 6) is satisfied,
the first part of the optimized input trajectory to the                           compare [7].
system, leading to the following closed-loop system                               If the convexity condition in Assumption 6 is not satis-
                                                                                  fied, the MPC scheme will not necessarily stabilize the
               xt+1 = f (xt , u∗0|t ) = x∗1|t ,     t ≥ 0.            (9)         optimal reachable trajectory xT ∗ , but could instead stabi-
                                                                                  lize a suboptimal periodic trajectory, similar to [26]. The
The rational behind this optimization problem is to pe-
                                                                                  3
nalize the (standard) tracking cost JN w.r.t. some arti-                            Given two feasible solutions r1 , r2 with corresponding
ficial periodic reference r together with the distance of                         outputs y1r , y2r , the reference r·|t = (gx (y·|t
                                                                                                                                 r           r
                                                                                                                                     ), gu (y·|t )) is a
                                                                                                                  r       r              r
the output of this artificial reference to the target signal                      feasible solution to (7) with y·|t = βy1 + (1 − β)y2 , β ∈ [0, 1].

                                                                              5
main alternative to using a tracking scheme with simulta-               with some constant Lg . Thus, strict convexity (com-
neous optimization of the artificial trajectory such as (8),            pare (11)) implies
would be to directly solve (7) and then apply a tracking
MPC for this reachable reference trajectory, compare Sec-                              ∗
                                                                                  JT (r·|t               ∗T
                                                                                           , t) − VT ≥ ky·|t    r∗ 2
                                                                                                             − y·|t kS
tion 2. If (7) is solved with a standard convex solver, the
solver will most likely end in the same local minimum as                        ≥1/L2g kxT·|t∗ − xr∗   2      2   T∗     r∗ 2
                                                                                                  ·|t kQ ≥ 1/Lg kx0|t − x0|t kQ .
the closed-loop MPC scheme. Thus, even if the convexity
condition is not satisfied, the proposed scheme (8) is still            Correspondingly, using the fact that a2 +b2 ≥ 1/2(a+b)2
a good choice.                                                          for all a, b ∈ R yields the lower bound

The following theorem establishes exponential stability                      Wt ≥ kxt − xr∗   2        ∗
                                                                                         0|t kQ + JT (r·|t , t) − VT
of the optimal reachable trajectory xT ∗ given suitable                                            ∗T
terminal ingredients (Ass. 2) and the convexity condi-                     ≥kxt − xr∗   2      2          r∗ 2          T 2
                                                                                   0|t kQ + 1/Lg kx0|t − x0|t kQ ≥ αW ket kQ ,
tion on the set of feasible periodic orbits (Ass. 6), which
is the main result of this paper. This result generalizes               with αW = 21 min{1, 1/L2g }. In case keTt k2Q ≤ 2 , In-
and unifies the results in [19,21], by considering non-                 equality (3) in Assumption 2 ensures that r·|t = r·|t  T∗
                                                                                                                                  is
linear dynamics, periodic reference trajectories, estab-                                                                       T 2
lishing exponential stability, and unifying the considera-              a feasible solution to (8), which satisfies Wt ≤ cu ket kQ .
tion of different terminal ingredients (Ass. 2, Lemma 5,                As in Theorem 3, compact constraints together with this
Prop. 4).                                                               local upper bound imply the imply the upper bound
                                                                        in (12b), compare [29, Prop. 2.16]. Inequalities (12) im-
Theorem 8 Let Assumptions 2 and 6 hold. Assume that                     ply (uniform) stability of xT ∗ for the closed-loop system,
h is bounded on Zr and that the Problem (8) is feasible at              but not necessarily asymptotic or exponential stability.
t = 0. Then the Problem (8) is recursively feasible for the             Part III: Exponential stability - case distinction:
resulting closed-loop system (9), for arbitrary target sig-             Case 1: Consider
nals y e . Furthermore, for a T -periodic target signal y e ,
                                                                                                                 T∗ 2
the optimal reachable trajectory xT ∗ is (uniformly) expo-                             kxt − xr∗   2      r∗
                                                                                              0|t kQ ≥ γky·|t − y·|t kS ,           (14)
nentially stable for the resulting closed-loop system (9).
                                                                        with a later specified positive constant γ. Then, (12a)
                                                                        and (13) imply
PROOF. Part I: Recursive Feasibility: It suffices to
note that feasibility of (8) does not depend on the target                                       (12a)
signal y e . Correspondingly, the input sequence u·|t+1 in                        Wt+1 − Wt ≤ −kxt − xr∗   2
                                                                                                      0|t kQ
                                                      ∗
Theorem 3 with the shifted reference rk|t+1 = rk+1|t         is              (14)
                                                                                                                  T∗ 2
a feasible solution to (8).                                                     ≤ − 1/2(kxt − xr∗   2      r∗
                                                                                               0|t kQ + γky·|t − y·|t kS )
Part II: Stability: Consider a periodic target signal                        (13)
 e       e                               e        e                                                                    T∗ 2
yk|t = yk−1|t+1   , which is denoted by yt+k := yk|t . Thus,                    ≤ − 1/2(kxt − xr∗   2      2   r∗
                                                                                               0|t kQ + γ/Lg kx0|t − x0|t kQ )
                                                        ∗
the minimizer of (7) is a periodic trajectory, i.e., xTk+t  :=                  ≤ − 1/4 min 1, γ/L2g kxt − xTt ∗ k2Q .
                                                                                            
  T∗        T∗                                            e
xk|t = xk−1|t+1 , and we write JT (r, t) := JT (r, y·|t      ),
               e
VT := VT (y·|t   ), with JT (periodically) time-varying in              Case 2: Assume
the second argument and VT constant in time. Define the
                                                                                                                 T∗ 2
candidate Lyapunov function Wt := WT (xt , y·|t    e
                                                      ) − VT                           kxt − xr∗   2      r∗
                                                                                              0|t kQ ≤ γky·|t − y·|t kS .           (15)
                  T           T∗
and the error et := xt − xt . In the following, we show
that there exists a positive constant αW , such that                    Boundedness of h on Zr implies that there exists a con-
                                                                                                  r      r 2
                                                                        stant ymax , such that ky·|t − ỹ·|t kS ≤ ymax , for any tra-
               Wt+1 ≤ Wt − kxt − xr∗   2
                                  0|t kQ ,                  (12a)                  r    r
                                                                        jectories y , ỹ that satisfy the constraints in (7). This
               αW keTt k2Q   ≤ Wt ≤   cv keTt k2Q ,         (12b)       implies

                                                                                                 (15)
holds for all xt such that Problem (8) is feasi-                                 kxt − xr∗   2      r∗     T∗ 2
                                                                                        0|t kQ ≤ γky·|t − y·|t kS ≤ γymax .
ble. The shifted reference r·|k+1 in Part I satisfies
                           ∗
JT (r·|t+1 , t + 1) = JT (r·|t , t). Thus, feasibility in com-
                                                           ∗            For γ ≤ γ1 := 2 /ymax , we have kxt − xr∗   2    2
                                                                                                                0|t kQ ≤  . Thus,
bination with (4) implies Wt+1 − Wt ≤ −`(xt , ut , r0|t      ),
                                                                        Assumption 2 implies
which implies (12a). Lipschitz continuity of g implies
                  T∗                T∗                                    kx∗1|t − xr∗   2        ∗      ∗      ∗
                                                                                    1|t kQ ≤ JN (x·|t , u·|t , r·|t )               (16)
         kxr∗                r∗
           0|t − x0|t kQ ≤ kx·|t − x·|t kQ                   (13)
                                                                          (3)                     (15)
              r∗           T∗             r∗     T∗                                                          T∗ 2
       =kgx (y·|t ) − gx (y·|t )kQ ≤ Lg ky·|t − y·|t kS ,                  ≤ cu kxt − xr∗   2         r∗
                                                                                       0|t kQ ≤ γcu ky·|t − y·|t kS ≤ γcu ymax .

                                                                    6
For γ ≤ γ2 := 2 /(4cu ymax ) ≤ γ1 this implies kxt+1 −                       continuity, compare (13). Correspondingly, we have
xr∗                       r
  1|t kQ ≤ /2. Consider yk|t+1 = h(rk|t+1 ), where r·|t+1
is the candidate reference from Part I of the proof. At                       Wt+1 − Wt
time t + 1, define an auxiliary reference                                                                               ∗
                                                                              ≤ JN (x̂, û, r̂) + JT (r̂, t + 1) − JT (r·|t , t) − kxt − xr∗   2
                                                                                                                                          0|t kQ
                                                                              (19),(20)
          ŷ r := βy·|t+1
                    r               T∗
                          + (1 − β)y·|t+1 , β ∈ [0, 1],          (17)            ≤        2cu kxt+1 − xr∗   2          r∗ 2
                                                                                                       1|t kQ − kxt − x0|t kQ
                                                                                                                          T∗ 2
                                                                                                                   r∗
                                                                              − ((1 − β 2 ) − 2cu L2g (1 − β)2 ) ky·|t − y·|t kS
                                                                                |             {z              }
with the corresponding state and input trajectory x̂r =                                        =:c2 (β)
gx (ŷ r ), ûr = gu (ŷ r ), r̂ = (x̂r , ûr ). Convexity (Ass. 6) en-       (13),(16)
                                                                                                                      T∗ 2
sures that the auxiliary reference r̂ is a feasible solution                     ≤        (2c2u γ − c2 (β)/2)ky·|t
                                                                                                               r∗
                                                                                                                   − y·|t kS
to (7) at t + 1. This definition implies                                                               T∗ 2
                                                                              − c2 (β)/(2L2g )kxr∗                     r∗ 2
                                                                                                0|t − x0|t kQ − kxt − x0|t kQ
                                                                                                           T∗ 2
                                                                              ≤(2c2u γ − c2 (β)/2)ky·|t
                                                                                                    r∗
                                                                                                        − y·|t kS
                                       T∗
              ŷ r − y·|t+1
                      r
                            = (1 − β)(y·|t+1    r
                                             − y·|t+1 ).         (18)         − min{1/2, c2 (β)/(4L2g )}kxt − xT0|t∗ k2Q .

The cost JT satisfies                                                         Let β = β2 := arg maxβ∈[β1 ,1] c2 (β), with c2 (β2 ) > 0.
                                                                              For γ ≤ γ3 := c2 (β2 )/(4c2u ), this implies
                                ∗
          JT (r̂, t + 1) − JT (r·|t , t)                                      Wt+1 − Wt ≤ − min 1/2, c2 (β2 )/(4L2g ) kxt − xTt ∗ k2 .
                                                                                               

     =(ŷ r − y·|t+1
               r
                     )> S(ŷ r + y·|t+1
                                  r         e
                                        − 2y·|t+1 )
   (17)
                                                                              Combine: Combining these two cases yields
               T∗
    = (1 − β)(y·|t+1    r
                     − y·|t+1 )> S
                   r
          ((1 + β)y·|t+1           T∗
                         + (1 − β)y·|t+1     e
                                         − 2y·|t+1 )                           Wt+1 ≤Wt − γT kxt − xTt ∗ k2 ,                         (21)
                                                                                                             
                            T∗ 2                                                            c2 (β2 ) 1 γ
     = − (1 − β 2 )ky·|t
                     r∗
                         − y·|t kS                                              γT := min           , ,         , γ := min{γ1 , γ2 , γ3 }.
                     T∗       r
                                                                                             4L2g 4 4L2g
          + (1 − β)(y·|t+1 − y·|t+1 )∇yr JT (y r , y e )|yr =yT ∗
   (10)                                                                       Uniform exponential stability follows using inequali-
                           T∗ 2
    ≤ − (1 − β 2 )ky·|t
                    r∗
                        − y·|t kS .                              (19)         ties (12b), (21) and Lyapunov arguments.          

Lipschitz continuity (compare (13)) implies
                                                                              Theorem 8 ensures exponential stability of the optimal
                                                                              reachable trajectory xT ∗ by showing quadratic lower and
        kxt+1 − x̂r0 kQ ≤ kxt+1 − xr∗         r∗      r
                                   1|t kQ + kx1|t − x̂0 kQ                    upper bounds and an exponential decay of the Lyapunov
                                                                                                           e
                 r
     ≤/2 + Lg ky·|t+1 − ŷ r kS                                              function Wt := WT (xt , y·|t   ) − VT . The exponential de-
                                                                              cay of Wt is shown by utilizing two distinct candidate
   (18)                 r        T∗
    = /2 + Lg (1 − β)ky·|t+1 − y·|t+1 kS .                                   solutions, namely (x̂, û, r̂) and the standard candidate
                                                                              solution from Theorem 3. In particular, we distinguish
                                                                              whether the tracking error kxr∗           2
                                                                                                               0|t −xt kQ is large/small (γ)
                                         √
For β ∈ [β1 , 1] with β1 := 1−/(2Lg ymax ), this implies                     compared to the output tracking cost JT = ky·|t     r∗   e 2
                                                                                                                                     −y·|t kS .
kxt+1 − x̂r0 kQ ≤ . Thus, Assumption 2 ensures that                          If the reference tracking error is large, then the stan-
there exists some state and input sequence (x̂, û), such                     dard candidate solution, e.g. used in Theorem 3, ensures
that (x̂, û, r̂) is a feasible solution to (8) at time t + 1                 a sufficient exponential decrease in the Lyapunov func-
and the tracking cost satisfies                                               tion Wt . On the other hand, if the reference tracking
                                                                              error is small enough (γymax ), then the convexity con-
                      (3)                                                     dition (Ass. 6) ensures that the artificial reference r can
      JN (x̂, û, r̂) ≤ cu kxt+1 − x̂r0 k2Q                      (20)         be incrementally (β) moved towards the optimal reach-
                  r∗ 2                                                        able reference rT ∗ , which decreases the output tracking
   ≤2cu (kxt+1 − x1|t kQ + kxr∗       r 2
                              1|t − x̂0 kQ )                                  cost JT . The local quadratic bound (3) (Ass. 2) on the
   ≤2cu kxt+1 − xr∗   2         2   r
                 1|t kQ + 2cu Lg ky·|t+1 −          ŷ r k2S ,                value function V ensures that the optimization problem
 (18)                                                                         is feasible with the incrementally moved reference r̂ and
                                                  T∗ 2
  ≤ 2cu kxt+1 − xr∗   2        2        2  r∗
                 1|t kQ + 2cu Lg (1 − β) ky·|t − y·|t kS ,                    that the increase in the tracking cost JN is quadratically
                                                                              bounded. Finally, there exists a sufficiently small change
                                                                              (β2 ), such that this auxiliary candidate solution (x̂, û, r̂)
where the second to last inequality follows from Lipschitz                    ensures an exponential decay in Wt .

                                                                          7
Remark 9 Similar to the derivations in [19,21], Theo-              Remark 11 We note that this result guarantees expo-
rem 8 assumes no model mismatch, which is rarely the               nential stability of the optimal reachability trajectory
case in practical applications. To ensure robust recur-            xT ∗ using either terminal equality constraints (Prop. 4)
sive feasibility despite disturbances, the MPC problem (8)         or a terminal cost (Lemma 5), respectively. However,
needs to be adjusted using constraint tightening tech-             the quantitative bounds regarding convergence rate may
niques from robust MPC. A corresponding formulation                differ significantly. In particular, including suitably de-
for nonlinear robust tracking MPC systems can be found             signed terminal ingredients greatly improves the closed-
in the recent paper [27], which combines the formulation           loop performance, which is demonstrated in the numeri-
presented in Section 3.3 with the nonlinear robust MPC             cal examples in Section 4, compare also examples in [15].
formulation in [16].
                                                                   3.3   Online optimized terminal set size
In addition to possible feasibility issues, model mismatch
typically also implies non-zero offset, even in case of con-       In case we use a terminal cost and terminal set
stant references. For the special case of setpoint track-          (Lemma 5), the fact that Zr is chosen in advance (as also
ing (T = 1), this issue is typically resolved using offset-        done in [19,21]) can be disadvantageous. In particular,
free MPC formulations (cf. [25] and references therein),           similar to setpoint stabilization in [5], the terminal set
which rely on a disturbance estimator for constant off-            size α in Lemma 5 is the minimum of two values: α1 and
sets. In order to transfer this concept to T -periodic tra-        α2 . The first (α1 ) is independent of Zr and needs to be
jectories, the dimension of the disturbance model must             such that (2a) holds (cf. [15, Alg. 1]). The second (α2 )
be correspondingly increased. An alternative approach to           ensures constraint satisfaction (2b) and depends on the
ensure offset-free tracking is to use a parameter estima-          difference between Z and Zr . Thus, by choosing Zr we
tion scheme with an adaptive MPC formulation, under                trade achievable terminal set size (α2 ) (and hence con-
appropriate assumptions on the model mismatch, com-                vergence speed of the closed-loop state) against opera-
pare e.g. [4]. Extending the proposed approach to ensure           tion close to the boundary of the constraint set Z. In the
offset free tracking despite deterministic model mismatch          following, we show how α can be optimized online, in-
is an open problem.                                                stead of using a preassigned reference constraint set Zr .
                                                                   In [31], a similar dynamic scaling of the terminal set has
Remark 10 The proposed approach can be extended to                 been suggested for linear setpoint tracking with a poly-
stabilize the economically optimal reachable reference r,          topic terminal set (albeit with a different motivation).
as an extension to the linear approach in [20], assuming           The proposed formulation is geared towards the termi-
that (7) remains (strictly) convex (Ass. 6). The exten-            nal ingredients from Lemma 5 and polytopic constraints
sion of the approaches in [26,9] for economically opti-            of the form Z = {r ∈ Rn+m | Li r ≤ li , i = 1, . . . , nz }.
mal steady-state operation, to optimal dynamic operation           To this end, we define the following functions
based on this result is part of current research. Recently,
                                                                                             −1/2
in [2] for the special case of linear systems, the analysis         LP K,i (r) := kPf               (r)[In , Kf> (r)]L>
                                                                                                                      i k, i = 1, . . . , nz .
in Theorem 8 was extended to stage costs ` with Q semi-
definite using suitable observability conditions, which is         The modified optimization problem is given by
especially relevant for input-output models.
                                                                                                                                  e
                                                                             min           JN (x·|t , u·|t , r·|t ) + JT (r·|t , y·|t )
                                                                         u·|t ,r·|t ,αst
The consideration of nonperiodic dynamic target signals
in this framework is still an open topic. Setpoint sta-             s.t. xk+1|t = f (xk|t , uk|t ), x0|t = xt ,                           (22a)
bilization is a special case in Theorem 8 with T = 1,                    (xk|t , uk|t ) ∈ Z, k = 0, . . . , N − 1                         (22b)
R(r) = r and the feasible steady-state manifold Zr , which                                               √              √
has also been considered in [19]. Compared to [19], Theo-                Vf (xN |t , rN |t ) ≤ (αts )2 , αmin ≤ αts ≤ α1 ,                (22c)
rem 8 ensures exponential stability with convergence rate                xrj+1|t = f (xrj|t , urj|t ), rl+T |t = rl|t ,                   (22d)
cv −γT
   cv   < 1 and Lyapunov function Wt , while [19] uses a                 Li rj|t + LP K,i (rj|t )αts ≤ li , i = 1, . . . , nz   (22e)
proof of contradiction to establish convergence. Further-                j = 0, . . . , T − 1, l = 0, . . . , max{0, N − T }, .
more, the general assumptions on the terminal ingredi-
ents (Ass. 2) allow us to use the continuously parame-             Compared to (8), the reference constraints (8c) and
terized and thus differentiable terminal cost Vf based on          in particular the polytopic constraints Zr are replaced
Lemma 5 (cf. [15]), instead of partitioning Zr , as done           by (22d) and (22e) and we√have one additional op-
in [19]. The practicality of this result is demonstrated in        timization variable αs =       α, where αmin > 0 is
the numerical examples in Section 4. In the setpoint sta-          needed to avoid robustness issues and retain the closed-
bilization case (T = 1) with N = 0, the candidate solu-            loop properties of the original scheme (Thm. 8). The
tions in Theorem 8 correspond to an inner-loop controller                        −1/2
kf (x, r) with a corresponding (explicit) reference gover-         expression Pf      (r) in LP K,i denotes any matrix
nor (cf. [12]) for r given by (17), compare the numerical          square root of X(r) = Pf−1 (r), which can be com-
example in Section 4.                                              puted using the command sqrtm(X) or chol(X) in

                                                               8
Matlab. In the numerical examples (Sec. 4), we imple-                  the polytopic characterization is reduced by online opti-
ment (22e) usingP the symbolic Cholesky decomposition                  mizing a scaling of the polytopic terminal set. The pro-
of X(r) = X0 + i θi (r)Xi , which is suitable for auto-                posed approach, especially the simplified formula (23),
matic differentiation used in CasADi [1].                              can be viewed as an extension of this approach to nonlin-
                                                                       ear systems with ellipsoidal terminal sets. For compari-
Proposition 12 Suppose there exist matrices Pf , Kf                    son, in [19] the reference constraint set Zr is partitioned
and a constant α > 0, such that condition (2a) holds                   and a fixed size αi is considered for each partition Yi ,
for all r, r+ ∈ Z, Vf (x, r) ≤ α1 with xr+ = f (xr , ur ),             which is conservative compared to the online optimized
x+ = f (x, ur + Kf (r)(x − xr )). Assume further that h is             value αs , compare Fig. 1 in Section 4.
bounded on Z and Ass. 6 holds with Zr in (7) replaced by
                                                                       Remark 13 The proposed approach with online opti-
                             √                                         mization of αs also provides a simple means to avoid ter-
  Z̃r = {r| Li r + LP K,i (r) αmin ≤ li , i = 1, . . . , nz }.
                                                                       minal equality constraints as a valuable extension of the
Then the MPC scheme based on (22) satisfies the theo-                  linear periodic tracking MPC in [21]. In case of linear
retical properties in Theorem 8. Consider the scheme (8)               system dynamics with a polytopic terminal set and online
with a given constraint set Zr and correspondingly com-                optimized αs (cf. [31]), the overall optimization problem
puted α2 . If α2 ∈ (αmin , α1 ), the scheme (22) can stabilize         is a quadratic program (QP) with one additional scalar
references r ∈ / Zr and has a larger region of attraction.             variable αs and linear inequality constraints instead of
                                                                       the terminal equality constraint.

PROOF. First, note that (22e) is equivalent to (2b),                   Remark 14 It is possible to further relax the tightened
compare e.g. [6, Equation (10)] based on the support                   reference constraints (22e) or (23), by taking into ac-
function. Thus, every feasible solution                                count the fact that the terminal set is contractive with
                                            √ to (8) is also a         some constant ρ ∈ (0, 1), i.e.,
feasible solution to (22) with αts = α. Recursive fea-
sibility follows with the same candidate input by using
  s
αt+1   = αts∗ , where αts∗ denotes the solution to (22). Parts                 Vf (f (x, kf (x, r)), r+ ) ≤ ρ2 Vf (x, r),     (24)
II and III of Theorem 8 remain true sincepInequality (3)                                               +         r+       r
                                                                               ∀ Vf (x, r) ≤ α1 , r, r ∈ Z, x = f (x , u ), r
holds for all references r·|t ∈ Z̃r with  = αmin /cu > 0.
Satisfaction of Ass. 6 with Z̃r as defined above ensures                                                                     s
                                                                       compare
                                                                       √      √ [15, Prop. 1]. In particular, by redefining α =
that the reference
               √     ŷ r (17) satisfies the constraints
                                               √         in (22)         α − αmin we can replace the constraints (22c),(22e)
with α = αmin . Furthermore, αs > α2 provides a
        s
                                                                       by the following constraints
larger terminal set and thus enlarges
                                    √        the set of feasible
initial conditions. For αs < α2 we can consider ref-                                                        √
                                                                             Vf (xN |t , rN |t ) ≤ (αts +αmin )2 ,
erences r ∈ / Zr (close to the boundary of Z) and thus                                                    √
provide a feasible solution for initial conditions close to                  Lmax,i (αts ρmod(j+T −N,T ) + αmin ) ≤ li − Li rj|t ,
                                                                                      √       √
the constraints and stabilize references r ∈     / Zr .                     αts ∈ [0, α1 − αmin ],

                                                                       where mod denotes the modulo operator. The theoretical
Proposition 12 essentially confirms that this modifica-                properties in Theorem 8 and Proposition 12 remain valid
                                                                                                     s
tion preserves the theoretical properties in Theorem 8.                with the candidate solution αt+1  = ραts∗ , which satisfies
                                                                                                                        √
In summary, the optimization over αs provides an addi-                 αt+1 ≥ max{ρ αt , αmin }, with αt∗ = (αts∗ + αmin )2 .
                                                                                       2 ∗
tional degree of freedom which can significantly enlarge               These relaxed constraints are especially useful in tran-
the terminal set and lead to faster convergence. For nu-               sient operation with active constraints on the reference r
merical reasons, one can also replace the constraint (22e)             and ρT  1.
with the more conservative constraint
                                                                       3.4     Partially decoupled reference updates
       Lmax,i αts + Li rj|t ≤ li , j = 0, . . . T − 1,     (23)
       Lmax,i := max LP K,i (r), i = 1, . . . , nz .
                   r∈Z                                                 In the following, we demonstrate that the joint stabiliza-
                                                                       tion and trajectory planning (Sec. 3) can be partially de-
This formulation also retains the properties in Theo-                  coupled, which can significantly reduce the online com-
rem 8 and the constraint (23) is linear in the optimiza-               putational demand.
tion variables αs , r. This constraint on the reference r is           Motivation: The main premise of the proposed approach
similar to the constraint tightening in robust MPC with                (Sec. 3) is that the operating conditions change on a
a variable tube size αs . In the linear polytopic setpoint             time scale similar to the system dynamics, which in turn
tracking case, a polyhedral invariant set for tracking can             necessitates online updates of the reference trajectory.
be used to constrain the reference r and the terminal                  The most challenging problems are those, where the op-
state x, compare [18, Ass. 2]. In [31] the complexity of               erating conditions change at a similar time scale to the

                                                                   9
system dynamics, while the target signal and hence the                        The result essentially follows from the continuously dif-
optimal system operation is determined based on long                          ferentiable parametrization and the triangular inequal-
term considerations that involve a significantly larger                       ity. In the special case of constant matrices Pf , condi-
time scale, i.e., the period length T is very large. An                       tion (26) is satisfied with Lp = 0. In addition to the
example of such a multi time-scale problem would be                           continuity property (26), we use the fact that the termi-
energy systems, compare e.g. [17], where real time deci-                      nal cost is contractive with some factor ρ ∈ (0, 1), com-
sions are made every 5 min, while the planning horizon                        pare (24).
is 7 days yielding T ≥ 2 · 103 . For such problems, it is
vital that the reference r is updated frequently, while at                    In the following, we summarize the basic approach to
the same time it may be computationally too expensive                         partially decouple the optimization problem (22). Sup-
to solve the joint planning and regulation problem (8)                        pose at time ti , we have trajectories x·|ti , u·|ti , r·|ti ,
in each time step t.                                                          αtsi , that satisfy the constraints in (22). For the next
Continuity terminal cost: In the following, we demon-                         M ∈ N time-steps t = ti , . . . , ti + M − 1, the track-
strate how the optimization problem (8) can be decom-                         ing MPC considers the shifted reference r·|t   ∗
                                                                                                                                 , i.e., rk|t =
                                                                                                                               i
posed into two partially decoupled optimization prob-                          ∗
                                                                              rmod(k+t−t             and the following updated terminal      set
lems that may be solved at different time scales by using                                  i ,T )|ti

the following continuity property of the terminal cost.                       size
                                                                                                                            ∗
Proposition 15 Suppose that the conditions in Lemma 5                              αttr =ρ2(t−ti ) max{αmin , Vf (xN |ti , rN |ti )},        (27)
are satisfied with Zr = Z, Pf = X −1 , Kf = Y Pf ,
                                                                              with the contraction rate ρ according to (24). The closed-
                     p                               p
                     X                               X                        loop input is computed based on the following reference
 X(r) = X0 +               θj (r)Xj , Y (r) = Y0 +         θj (r)Yj ,         tracking MPC (similar to (1))
                     j=1                             j=1
                                                                                       min JN (x·|t , u·|t , r·|t )                         (28a)
with θj continuously differentiable. Then there exists a                                u·|t
constant Lp , such that for any r, r̃ ∈ Z the terminal cost                       s.t. xk+1|t = f (xk|t , uk|t ), (xk|t , uk|t ) ∈ Z,       (28b)
Vf satisfies the following continuity condition
                                                                                       Vf (xN |t , rN |t ) ≤ αttr , x0|t = xt ,             (28c)
                                                                                       k = 0, . . . , N − 1.
         q
             Vf (x, r̃)                                           (26)
         q
     ≤       Vf (x, r)(1 + Lp kr − r̃k) + kxr − x̃r kPf (r̃) .                Note that the contractive terminal constraint (28c) with
                                                                              αttr according to (27) is similar to a contractive MPC [8].
                                                                              In parallel, the following reference optimization problem
                                                                              is solved at time ti in order to obtain an updated refer-
PROOF. The fact that θj is continuously differentiable
                                                                              ence at ti+1 = ti + M
directly implies that X is continuously differentiable
w.r.t. r. Thus, also the matrix Pf = X −1 is continuously                                                                      e
differentiable in r for any r ∈ Z, using the fact X is posi-                                     min            JT (r·|ti+1 , y·|t i+1
                                                                                                                                       )
                                                                                         r·|ti+1 ,αst
                                                                                                        i+1
tive definite with uniform lower and upper bounds. This                                          q
                                                                                             M              ∗
property in combination with compact constraints and                                s.t. ρ           αttri (1
                                                                                                  + Lp krN    +M |ti − rN |ti+1 k)
uniform bounds on Pf ensures that there exists a local                                    q
                                                                                                                         s
Lipschitz constant Lp ≥ 0, such that                                                     + Vf (xr∗
                                                                                                N +M |ti , rN |ti+1 ) ≤ αti+1 ,             (29a)

               Pf (r̃) − Pf (r) ≤ Lp Pf (r)kr − r̃k,                                     xrj+1|ti+1       =     f (xrj|ti+1 , urj|ti+1 ),   (29b)
                                                                                         rl+T |ti+1 = rl|ti+1 ,                             (29c)
which implies                                                                            Li rj|ti+1 + LP K,i (rj|ti+1 )αtsi+1 ≤ li ,        (29d)
                                                                                         √                     √
             q                                                                             αmin ≤ αtsi+1 ≤ α1 ,                             (29e)
 kxkPf (r̃) ≤ kxk2Pf (r) + Lp kxk2Pf (r) kr − r̃k
           q                                                                             i = 1, . . . , nz , j = 0, . . . , T − 1,
=kxkPf (r) 1 + Lp kr − r̃k ≤ kxkPf (r) (1 + Lp kr − r̃k),                                l = 0, . . . , max{0, N + M − T }.

for any r, r̃ ∈ Z. Thus, condition (26) follows from                          Note, that the constraints on the reference r can be fur-
                                                                              ther relaxed using the formula in Remark 14. Since we
                                                                                                                                  e
                                                                              start to solve (29) at time ti , the target signal y·|t     is
   q
     Vf (x, r̃) = kx − x̃r kPf (r̃)                                                                                                   i+1
                                                                              not yet available and instead the currently available tar-
≤kx − xr kPf (r̃) + kxr − x̃r kPf (r̃)                                                    e
                                                                              get signal y·|t i
                                                                                                needs to be shifted by M time steps (as-
 q
                                                                              suming it is T -periodic). The overall algorithm is sum-
≤ Vf (x, r)(1 + Lp kr − r̃k) + kxr − x̃r kPf (r̃) .                 
                                                                              marized in Alg. 1.

                                                                         10
Algorithm 1 Partially decoupled reference updates                          Theorem 3 and the contractivity (24). Correspondingly,
Execute at each time step ti = i · M , i ∈ N                               at time ti+1 , the candidate solution satisfies
           ∗
  Obtain r·|t i
                from reference planner (29).
                                                                                                          ∗              2M tr
  Get xN |ti from Tracking MPC (28).                                                      Vf (xN |ti+1 , rN +M |ti ) ≤ ρ   αti .            (30)
  Compute αttri using (27).
    Tracking MPC                                                           The constraint (29a) ensures
    for t = ti , . . . , ti + M − 1 do
       Update r·|t , αttr .
                                                                                           q
                                                                                                             ∗
                                                                                             Vf (xN |ti+1 , rN |ti+1 )
       Solve tracking MPC (28).
       Apply control input ut = u∗0|t .                                            (26)(30)   q
                                                                                                                  ∗             ∗
                                                                                      ≤ ρM αttri (1 + Lp krN         +M |ti − rN |ti+1 k)
    end for
                                                                                                                            (29a)
    Reference planner
                                                                                             q
                                                                                                                   ∗                s∗
                                e                                                           + Vf (xr∗ N +M |ti , rN |ti+1 ) ≤ αti+1 ,
    Obtain target signal y·|t     i+1
                                      .
    Solve trajectory planning problem (29).
                                                                           which in combination with the update (27) and (29e)
The optimization problem (28) represents a standard                        implies αttri+1 ≤ αts∗i+1 . At time ti+1 a feasible solution
tracking MPC (Sec. 2) that is executed in each time                        to (29) is given by the previous reference r shifted by
                                                                                                        ∗
step t with a fixed (periodic) reference trajectory r and                  M steps, i.e., rj|ti+1 = rmod(j+M,T   )|ti , j = 0, . . . , T − 1,
a shrinking terminal set. On the other hand, the op-                       with the candidate terminal set size
timization problem (29) can be solved in the interval
[ti , ti+M ], thus allowing to solve larger planning prob-
                                                                                                  q                       √      √
                                                                            αtsi+1 = max{ρM           αttri + 0.5(1 − ρM ) αmin , αmin }
lems (T >> 1) by updating the reference r less fre-
quently (M ≥ 1). Condition (29a) constrains how the
updated reference r may deviate from the previous so-                      and condition (29a) strictly satisfied using the fact hat
lution, which partially couples the planning (29) and                      Vf (xr∗
                                                                                N +M |ti , rN |ti+1 ) = 0 by definition.
regulation problem (28). Compared to a joint optimiza-                     Part II. Convergence: Consider the auxiliary candidate
tion, as in (8), the practical convergence under chang-                    reference r̂ based on ŷ r from (17) with some βti ∈ [0, 1].
ing operation conditions may be slower, as the reference                   Suppose βti is chosen, such that kr̂ − r·|ti+1 k ≤ , with
is updated less frequently and the constraint (29a) lim-                   some constant . There exists a constant 1 > 0, such
its the rate of change, compare the numerical example                      that for  ≤ 1 this auxiliary reference satisfies the con-
in Sec. 4. However, the partially decoupled updates in                     straint (29a), with
Alg. 1 can significantly reduce the computational de-
mand, especially in case of longer planning horizons T .                           q                              q
                                                                              ρMαttri (1 + Lp krN |ti+1 − r̂N k) + Vf (xrN |ti+1 , r̂N )
Proposition 16 Suppose the conditions in Prop. 12                             q           √              √
and 15 hold and Alg. 1 is initialized with r·|t   ∗
                                                       , αts∗0 sat-        ≤ρM αttri + ( α1 ρM Lp + cu )1
                                                     0
isfying (29b)–(29e), αttr0 ≤ αts∗0 and xt0 such that (28)
                                                                              q                        √
                                                                           =ρM αttri + (1 − ρM )0.5 αmin ≤ αtsi+1 .
is feasible. Then Alg. 1 is recursively feasible for the re-
sulting closed-loop system. Assume further 4 that there
exists a constant c > 0, such that for every constraint                    We show satisfaction of (29d) for the auxillary reference
i = 1, . . . , nz , we have either inf r∈Z̃r LP K,i (r) ≥ c or             r̂ with a case distinction.        √
supr∈Z̃r LP K,i (r) = 0. For a T -periodic target signal                   Case 1: Suppose that αtsi+1 = αmin . In this case con-
                                ∗
y e , the resulting reference r·|t i
                                     converges to the optimal              dition (29d) is equivalent to r̂ ∈ Z̃r , which is guaranteed
                          T∗                                               by the convexity condition     (Ass. 6) as in Prop. 12.
reachable trajectory r in finite time and the state xt
converges exponentially fast to xTt ∗ .                                                s        M
                                                                                                  q                      √
                                                                           Case 2: αti+1 = ρ        αti + 0.5(1 − ρM ) αmin . Given
                                                                                                       tr

                                                                           Pf , Kf and Pf−1 continuous, Z compact and continuity
PROOF. Part I. Recursive feasibility: First, for t =                       of the quadratic norm, there exists a function δ ∈ K∞ ,
ti + k, k = 1, . . . , M − 1 the reference r·|t satisfies the              such that for any r, r̃ ∈ Z:
tightened constraints (29d) with αts∗i . Thus, feasibility
of (28) at time ti implies recursive feasibility of (28) at                    LP K,i (r) − LP K,i (r̃) ≤ δ(kr − r̃k), i = 1, . . . , nz .
t with the updated terminal set size αttr ≤ αts∗i accord-
ing to (27), the standard MPC candidate solution from                      For constraints i with LP K,i (r̂(j)) = 0 feasibility
                                                                           of (29d) is independent of αs and thus follows from
4
 This condition excludes the special case where Li,x =                     convexity (Ass. 6). For the other constraints i satisfac-
−Li,u Kf (r) for some (but not all) r ∈ Z̃r .                              tion of condition (29d) at ti+1 follows from feasibility

                                                                      11
of (29d) at ti together with the definition of the candi-               to uniform stability in Thm. 8. It is possible to adjust
                          ∗                  tr    s∗
date reference rj|ti+1 = rmod(j+M,T  )|ti , αti ≤ αti and               Alg. 1, such that the reference planner does not require
LP K,i (r̂j ) > 0:                                                      explicit information from the system, by replacing the
                                                                        update αttr in (27) and just using the fact that the termi-
    LP K,i (r̂j )αtsi+1 + Li r̂j                                        nal set is ρ-contractive. Although this may simplify the
                  q                          √                        computation, the closed-loop convergence of the tracking
   ≤LP K,i (r̂j ) ρM αttri + 0.5(1 − ρM ) αmin                          MPC is typically significantly faster, which is why the
                                                   q                    update (27) can speed up the convergence rate of the ref-
     + Li (r̂j − rj|ti+1 ) + li − LP K,i (rj|ti+1 ) αttri               erence planner. Furthermore, it is possible to implement
        √                                                               Alg. 1 in an asynchronous fashion with M changing on-
   ≤li + α1 δ(kr·|ti+1 − r̂k) + kLi kkr·|ti+1 − r̂k
                                                                        line, if the constraint (29a) is adjusted to hold for any
                                 q            √                         M ∈ [Mmin , Mmax ] ⊂ N. This way, the reference plan-
     − (1 − ρM ) LP K,i (r̂j ) ( αttri − 0.5 αmin ) ≤ li ,
                     | {z } |            {z          }                  ner needs to solve (29) until ti + Mmax , but the reference
                             ≥c                   √
                                              ≥0.5 αmin                 can also be updated earlier starting at ti + Mmin .

where the last inequality holds for  ≤ 2 with some
2 > 0. The reference satisfies                                         4      Numerical examples

                      kr·|ti+1 − r̂k ≤ Lg kŷ r − y·|t
                                                   r
                                                       k
                                                      i S               The following examples show the general applicability
               (18)                 T∗             r                    of the proposed method and illustrate advantages com-
               = (1 −      βt )Lg ky·|t i+1
                                              −   y·|t    k ,
                                                       i+1 S            pared to existing approaches. We first benchmark the
                                                                        performance of the proposed approach at the example
with some Lipschitz constant Lg , compare Ass. 6. Thus,                 of setpoint tracking of a continuous stirred tank reac-
                                   r        T∗
choosing βt = max{1 − /(Lg ky·|t    i+1
                                         − y·|t i+1
                                                    kS ), 0} < 1        tor (CSTR). Then, we demonstrate the applicability of
with  = min{1 , 2 }, the candidate reference r̂ is feasi-            nonlinear periodic reference tracking at the example of
               r         T∗
ble. In case ky·|ti+1
                      − y·|t kS ≥ /Lg , this implies                   a ball and plate system. The offline and online compu-
                                                                        tation is done using SeDuMi-1.3 [32] and CasADi [1],
                                 ∗
           JT (r̂, ti+1 ) − JT (r·|t  , ti )                            respectively.
                                    i
        (19)                              √
         ≤ − (1 − βt )(/Lg )2 ≤ −3 /(L3g ymax ),                      4.1     Setpoint tracking - CSTR
                                                  r
with ymax as defined on page 6. In case ky·|t       i+1
                                                        −               The following example demonstrates the performance
  T∗
y·|t kS ≤ /Lg , the candidate reference converges                      benefits of the proposed method, especially the param-
to the optimal reference trajectory in one step, i.e.,                  eterized terminal ingredients (Lemma 5) and the online
βt = 0, JT (r̂, ti+1 ) = VT . This ensures conver-                      optimized terminal set (Prop. 12), at the example of set
gence of JT√to VT and thus r to rT ∗ in at most                         point tracking (T = 1).
Tmax = M (( ymax Lg /)3 + 1) time steps. Exponential                   System model: We consider a CSTR model 5
convergence of x to xT ∗ follows from exponential stabil-                                                                               
                                                                                                                                  M
ity (Thm. 3) and finite time convergence of the reference                                                      (1 − x1 ) − kx1 e− x2
                                                                                      !
                                                                                                           1
                                                                                ẋ1                       θf
r.                                                                     ẋ =              =                            M
                                                                                                                                           
                                                                                ẋ2             1
                                                                                               θf   (xf − x2 ) + kx1 e− x2 − αf u(x2 − xc )

The result in Proposition 16 essential builds on three
                                                                        with the concentration x1 , the temperature x2 and the
properties. First, the constraints on the reference plan-
                                                                        coolant flow rate u, taken from [23]. The discrete-time
ner (29a) and the tracking MPC (28) with the contract-
                                                                        model is defined with an Euler discretization and the
ing terminal set size (27) are such that the proposed al-
                                                                        sampling time h = 0.1s. We consider the setpoint track-
gorithm is recursively feasible. Second, in each time step
                                                                        ing problem using the nonlinear MPC schemes (8), (22)
ti , it is possible to incrementally reduce the tracking cost
                                                                        in Section 3 with T = 1 and the output y = x2 .
JT , which in combination with convexity (Ass. 6) and
                                                                        Offline computations and terminal set: The constraint
compact constraints implies that the reference planner
                                                                        set is given by Z = [0, 1]2 ×[0, 2], the stage cost is Q = I2 ,
converges to the optimal reference rT ∗ in finite time. Fi-
                                                                        R = 0.01, the output weighting is S = 103 and the fea-
nally, once the reference planner converged in finite time,
                                                                        sible reference manifold is
the tracking MPC (28) ensures exponential convergence
to the reference r = rT ∗ (c.f. Thm. 3) and thus expo-
nential convergence of xt to xT ∗ .                                             Zr = {(x, u)| f (x, u) = x, x2 ∈ [0.43, 0.86]}.

Remark 17 We point out that we only showed con-                         5
                                                                          The parameters are θf = 20, k = 300, M = 5, xf =
vergence for this partially coupled approach, as opposed                0.3947, xc = 0.3816, αf = 0.117.

                                                                   12
You can also read