Collision Avoidance with Stochastic Model Predictive Control for Systems with a Twofold Uncertainty Structure - arXiv

Page created by Barry Navarro
 
CONTINUE READING
This work has been accepted to the IEEE 2021 International Conference on Intelligent Transportation Systems.

                                                 Collision Avoidance with Stochastic Model Predictive Control for
                                                           Systems with a Twofold Uncertainty Structure
                                                                       Tim Brüdigam1 , Jie Zhan1 , Dirk Wollherr1 , and Marion Leibold1

                                              Abstract— Model Predictive Control (MPC) has shown to be           handled robustly by using Robust Model Predictive Control
                                           a successful method for many applications that require control.       (RMPC) methods for bounded uncertainties [1], [2]. How-
                                           Especially in the presence of prediction uncertainty, various         ever, these robust controllers are often highly conservative.
                                           types of MPC offer robust or efficient control system behavior.
                                           For modeling, uncertainty is most often approximated in such a           Stochastic Model Predictive Control (SMPC) approaches
arXiv:2106.08463v1 [eess.SY] 15 Jun 2021

                                           way that established MPC approaches are applicable for specific       [3], [4] provide more efficient solutions compared to RMPC
                                           uncertainty types. However, for a number of applications,             by utilizing probabilistic chance constraints instead of hard
                                           especially automated vehicles, uncertainty in predicting the          constraints. These chance constraints enable increased effi-
                                           future behavior of other agents is more suitably modeled by           ciency by allowing a small probability of constraint violation,
                                           a twofold description: a high-level task uncertainty and a low-
                                           level execution uncertainty of individual tasks. In this work, we     limited by a predefined acceptable risk. Various SMPC
                                           present an MPC framework that is capable of dealing with this         methods exist, approximating the chance constraint to obtain
                                           twofold uncertainty. A scenario MPC approach considers the            a tractable representation that may be solved in an optimal
                                           possibility of other agents performing one of multiple tasks,         control problem. In general, each SMPC method considers
                                           with an arbitrary probability distribution, while an analytic         one type of uncertainty within the prediction model.
                                           stochastic MPC method handles execution uncertainty within a
                                           specific task, based on a Gaussian distribution. Combining both          Analytic SMPC approaches [5]–[7] yield an analytic ap-
                                           approaches allows to efficiently handle the twofold uncertainty       proximation of the chance constraint, but these approaches
                                           structure of many applications. Application of the proposed           are mostly restricted to Gaussian uncertainties. In particle-
                                           MPC method is demonstrated in an automated vehicle simula-            based SMPC [8] and Scenario Model Predictive Control
                                           tion study.                                                           (SCMPC) [9], samples of the uncertainty are drawn that
                                                                                                                 are then used to approximate the chance constraint. While
                                                                 I. I NTRODUCTION
                                                                                                                 arbitrary uncertainty distributions are possible, large numbers
                                              Advances in research on automated systems are facilitating         of samples are required to provide sufficient approxima-
                                           the use of controllers for complex applications, which is             tions for some uncertainty distributions, which increases
                                           especially evident for automated vehicles. In many of these           computational complexity. If mixed uncertainty structures
                                           applications, there is one controlled agent, e.g., a vehicle or       best describe the system behavior, the chance constraint
                                           mobile robot, which is required to act and move among other           approximations of these SMPC approaches are not neces-
                                           agents. In order to move efficiently and avoid collisions, it         sarily suitable. In [10] an SMPC framework, S+SC MPC,
                                           is necessary for the controlled agent to anticipate the future        was introduced that utilizes both SCMPC and a Gaussian
                                           behavior of the surrounding agents.                                   uncertainty-based SMPC method, specifically designed for a
                                              The challenge here is that future behavior of other agents is      simple automated vehicle example.
                                           subject to uncertainty. In many applications, this uncertainty           In this paper, we propose an S+SC MPC framework
                                           consists of two types, task uncertainty and task execution            that significantly generalizes the work of [10]. In [10] a
                                           uncertainty. Using automated vehicles as an example, the              simple S+SC MPC framework was specifically designed for
                                           future motion of other surrounding vehicles is first subject          automated vehicles, where only one surrounding vehicle and
                                           to specific maneuvers, such as lane keeping or lane changing.         two possible maneuvers are considered. Here, we present
                                           Second, the execution of these maneuvers may vary again.              a general S+SC MPC framework, applicable to a variety
                                           A lane change may be executed quickly and aggressively, or            of automated systems. We specifically focus on collision
                                           slowly over a longer period of time.                                  avoidance, which requires considering multiple other agents
                                              Model Predictive Control (MPC) is a suitable method                that may perform multiple different tasks.
                                           to plan motion and trajectories for automated systems in                 The proposed S+SC MPC approach utilizes an SCMPC
                                           environments with uncertainty. In MPC an optimal control              approach for task uncertainty and an analytic SMPC ap-
                                           problem is solved on a finite horizon, utilizing prediction           proach for task execution uncertainty. Combining these two
                                           models to take into account the controlled agent dynamics             approaches into a single MPC optimal control problem al-
                                           and the future behavior of other agents. Constraints subject to       lows to efficiently consider the twofold uncertainty structure
                                           environment uncertainty, e.g., for collision avoidance, may be        of many practical applications with task and task execution
                                                                                                                 uncertainty, e.g., automated vehicles [7], [11]–[14]. An au-
                                              1 T. Brüdigam, J. Zhan, D. Wollherr, and M. Leibold are with
                                                                                                                 tomated vehicle simulation study illustrates the applicability
                                           the Chair of Automatic Control Engineering at the Technical Univer-
                                           sity of Munich, Germany. {tim.bruedigam; jie.zhan; dw;                of the proposed S+SC MPC framework.
                                           marion.leibold}@tum.de                                                   The paper is structured as follows. Section II introduces
the problem statement. The S+SC MPC method is derived            denoted by k. The SMPC OCP is given by
in Section III. A simulation study is presented in Section IV,
                                                                      J ∗ = min JN (ξ0 , U )                                             (5a)
followed by conclusive remarks in Section V.                                  U
                                                                     s.t. ξk+1 = f (ξk , uk )                                            (5b)
                II. P ROBLEM S TATEMENT
                                                                          DO
                                                                         ξk+1     =   ADO ξkDO   +B   DO
                                                                                                           uDO
                                                                                                            k    (Ti ) + G   DO
                                                                                                                                  wkex   (5c)
                                                                         uk ∈ U,                            k = 0, ..., N − 1            (5d)
  MPC for collision avoidance with multiple agents requires
                                                                         ξk ∈ X ,                           k = 1, ..., N                (5e)
two prediction models, one for the controlled agent (CA) and
                                                                         Pr ξk ∈ Ξsafe
                                                                                       
one for the dynamic obstacles (DOs) to be avoided.                                k      ≥ β,               k = 1, ..., N                (5f)
  We consider the CA dynamics                                    with U = [u0 , ..., uN −1 ], cost function JN , horizon N ,
                                                                 actuator constraints U, and deterministic state constraints
                     ξk+1 = f (ξk , uk )                   (1)   X . As the DO dynamics (5c) are subject to uncertainty, the
                                                                 chance constraint (5f) is employed for collision avoidance.
depending on the nonlinear function f with state ξk and          At each time step k, the probability of the CA state ξk
input uk at time step k.                                         lying within the safe set Ξsafe       must be larger than the risk
                                                                                                    k
   Two types of uncertainties are considered for the DOs: task   parameter β = h (β ta , β ex ), 0 ≤ β ≤ 1. The function
uncertainty and task execution uncertainty. This distinction     h (β ta , β ex ) indicates that β depends on a task uncertainty risk
reflects the situation of many applications, where the motion    parameter β ta and a task execution uncertainty risk parameter
of surrounding agents is divided into discrete tasks with        β ex .
multiple task execution possibilities.                              It is not possible to directly solve the chance-constrained
   Definition 1 (Tasks): At each time step, a DO decides         OCP. In the following, a method is derived that approximates
to execute exactly one task Ti defined by the task set           the chance constraint (5f) to obtain a tractable OCP. We first
T = {Ti | i = 1, ..., nT }. Each task Ti is assigned a prob-     focus on task uncertainty in Section III-A, followed by task
PnT pi , subject to the probability distribution PT , where
ability                                                          execution uncertainty in Section III-B, which then allows to
   i=1 pi = 1 and 0 < p1 ≤ ... ≤ pnT ≤ 1. A DO input             consider both uncertainties simultaneously as described in
corresponding to task Ti is denoted by uDO (Ti ).                Section III-C.
   Definition 2 (Task Execution): Each task Ti is subject to
                                                                                            III. M ETHOD
a nominal motion governed by the DO dynamics, a reference
state, and an additive Gaussian uncertainty wkex ∼ N (0, Σex        In the following, the S+SC MPC framework is derived,
                                                            k)
with covariance matrix Σex    , representing uncertainty while   starting with individually focusing on SCMPC and SMPC.
                            k
executing task Ti .                                              A. SCMPC for Task Uncertainty
   We consider multiple DOs. The dynamics for a single DO           We first focus on task uncertainty. At each time step,
is then given by                                                 one task is performed. The control action, corresponding
                                                                 to different tasks, may significantly vary between different
        DO
       ξk+1 = ADO ξkDO + B DO uDO
                               k (Ti ) + G
                                           DO ex
                                             wk            (2)   tasks. Therefore, describing task uncertainty with Gaussian
                                                                 noise is impractical, rendering analytic SMPC approaches
with the DO state ξkDO , the input uDO
                                    k as well as the state and   inapplicable. Considering every possible task may lead to
input matrices ADO , B DO , GDO . A DO stabilizing feedback      highly conservative control behavior. However, applying
controller is assumed of the form                                SCMPC is a suitable approach to handle task uncertainty.
                         DO                                      With SCMPC, task uncertainty may be approximated by a
            uDO             ξkDO − ξk,ref
                                    DO
                                               
             k (Ti ) = K                  (Ti )            (3)   small number of samples, as the number of possible tasks is
                                                                 usually small. In this section no task execution uncertainty
with feedback matrix K DO and a reference state ξk,ref
                                                    DO
                                                                 is considered, i.e., wkex = 0.
depending on task Ti . The nominal state, assuming zero             Here, an SCMPC approach inspired by [9] is used. By
uncertainty and task Ti , follows                                drawing K samples from the probability distribution PT , the
                      DO DO
                                                                 task uncertainty is approximated, yielding the set of samples
             ξ DO
               k+1 = A  ξ k + B DO uDO
                                    k (Ti ) .              (4)
                                                                                       S = {si | i = 1, ..., K} ,                         (6)
   Collisions with DOs are avoided by determining a set of       where a task Ti is assigned to each sample si . An agent may
safe states for the CA.                                          execute the same task for multiple time steps. However, the
   Definition 3: The safe set Ξsafe
                               k    for time step k ensures      agent task may change at every time step.
that all CA states ξk ∈ Ξsafe
                          k   guarantee collision avoidance         Assumption 1: Within each SCMPC OCP, each sampled
at time step k.                                                  task is assumed to be executed for the entire prediction
   We now formulate the optimal control problem (OCP) to         horizon.
be solved within this work. Without loss of generality, the         In other words, within the prediction, a sampled task
SMPC OCP starts at time step 0 where prediction steps are        is assumed to continue. This assumption is reasonable, as
multiple tasks may be sampled and a new OCP with new                    with stage cost l and terminal cost Jf .
samples is initiated at each time step.                                   The constraint ξk ∈ Ξsafek  may be described by a set of
   If Assumption 1 holds, a DO input sequence is ob-                    functions
tained for each sample of S.DOThe resulting        input sequence
                                                                                    dk ξk , ξkDO ≥ 0 ⇔ ξk ∈ Ξsafe
                                                                                                 
                                                                                                                              (10)
U DO (si ) = uDO 0 (si )  , ..., u  N −1 (si )  depends  on the indi-                                              k
vidual inputs uDO  (T   ),  performing      task T  corresponding  to
                 k    i                           i
                                                                        with dk = [dk,1 , ..., dk,nd ]> , where nd denotes the number
sample si . Based on U DO (si ), the predicted DO states for
                                                                        of constraint functions.
each sample are obtained according to the DO dynamics (2)
                                                                           In order to find an analytic approximation for the chance
with wkex = 0, resulting in the predicted states ξkDO (si ) for
                                                                        constraint (5f) with only one task, a linearized description
k = 1, ..., N .
                                                                        of the chance constraint is required. Therefore, the nonlinear
   Depending on the predicted DO states, a safe set Ξsafe     k (si )   constraint (10) is linearized around the nominal states with
may be computed for each drawn sample si . Each safe set
                                                                        ξkDO = ξ DO       DO                             DO
                                                                                 k + ek and the prediction error ek . Based on
requires an individual constraint in the SCMPC OCP. There-                 ex
                                                                        wk , the prediction error follows ek ∼ N (0, Σek ) where
                                                                                                              DO
fore, for the SCMPC approach, the chance constraint (5f) is
adapted to                                                                                                                >
                                                                                     Σek+1 = ΦΣek Φ> + GDO Σex
                                                                                                            kG
                                                                                                               DO
                                                                                                                                    (11)
     Pr ξk ∈ Ξsafe               ta
                        
                 k (s) ≥ β , k = 1, ..., N, s ∈ S.                (7)
                                                                        with Φ = ADO + B DO K DO .
Multiple methods exist to generate safe sets, e.g., signed                The resulting linearized description of (10) is
distance [15] or grid-based methods [13].
                                                                                    dk ξk , ξ DO   + ∇dDO   DO
                                                                                                 
   The sample size K depends on the chosen risk parameter.                                    k         k ek ≥ 0                    (12)
We propose a strategy to obtain K that focuses on the least
                                                                        with
likely task T1 in T .
   Theorem 1: The sample size                                                                       ∂dk
                                                                                         ∇dDO
                                                                                           k =                        .             (13)
                                                                                                    ∂ξkDO
                                 1 − β ta
                                         
                                                                                                            ξk ,ξDO
                                                                                                                 k
                  K > log1−p1                           (8)
                                   p1                                     The linearized chance constraint is then given by
ensures that the probability of not having sampled the least
                                                                               Pr ∇dDO    DO               DO
                                                                                                                 ≥ β ex ,
                                                                                                              
probable task T1 , if it later occurs, is lower than the allowed                      k ek ≥ −dk ξk , ξ k                           (14)
risk 1 − β ta , i.e., (7) is satisfied.                                 which is still a probabilistic expression. However, (14) may
      Proof: The proof is based on [10]. Given independent              be approximated into an analytic expression similar to [10].
and identically distributed samples, the worst-case probabil-             Theorem 2: The probabilistic chance constraint (14) may
ity of not sampling task T1 , if it later occurs, is given by           be approximated by the analytic expression
p1 (1 − p1 )K . The sample size K in (8) then follows from
solving for K with 1 − β ta > p1 (1 − p1 )K , i.e., bounding                 dk,i ξk , ξ DO
                                                                                            
                                                                                         k    ≥ γk,i                            (15a)
the worst-case probability given the risk parameter β ta .                          q
                                                                                                       DO >    −1
   If the least likely task T1 is actually performed by the DO,              γk,i = 2∇dDO          e
                                                                                              k,i Σk ∇dk,i erf    (1 − 2β ex )  (15b)
this worst-case probability of not having sampled task T1 is
                                                                        with γk = [γk,1 , ..., γk,nd ]> and 0.5 ≤ β ex ≤ 1.
lower than the acceptable risk, defined by the SCMPC risk
                                                                             Proof: The proof follows     [7], [10]. Dueto (11) it holds
parameter β ta .                                                                                  
                                                                                                                    DO >
   After having introduced an SCMPC approach to handle                  that ∇dDOk e DO
                                                                                     k    ∼    N    0, ∇d DO e
                                                                                                          k,i Σk ∇dk,i     in (14). The
task uncertainty, the following section introduces an analytic          quantile function for univariate normal distributions allows
SMPC approximation for task execution uncertainty.                      to reformulate (14) into (15).
                                                                           Note that ∇dDO
                                                                                        k,i is defined similar to (13). The individual
B. SMPC Task Execution Uncertainty                                      approaches for handling task uncertainty and task execution
   We now focus on task execution uncertainty, assuming                 uncertainty are combined in the following section.
only one task is possible. In the DO dynamics (2), task
execution uncertainty is described by the additive Gaussian             C. S+SC MPC Algorithm
uncertainty, representing uncertainty considering the nominal              The results of Section III-A and Section III-B are now
trajectory of a task. Approximating a Gaussian distribution             combined in order to obtain the S+SC MPC framework,
potentially requires a large number of samples, therefore,              which is able to efficiently handle the mixed uncertainty
an analytic SMPC approach is more suitable than SCMPC.                  structure. In addition, multiple DOs are considered with the
The cost function (5a) may depend on the DO uncertainty.                DO dynamics
Therefore, the cost is adjusted based on the expectation                                                      
value, yielding                                                           DO,j
                                                                        ξk+1   = ADO,j ξkDO,j + B DO,j uDO,j
                                                                                                         k    Tij + GDO,j wkex,j (16)
                    −1
                  N
                                                           !
                                                                        with stabilizing feedback matrix K DO,j for the DOs j =
                  X
       JN = E          l (ξk , uk , wkex ) + Jf (ξN , wN
                                                       ex
                                                          )  (9)
                    k=0                                                 1, ..., nDO .
TABLE I: Initial Vehicle Configuration
               TV4                TV5
                                                       TV3                                    EV   TV1   TV2       TV3   TV4    TV5
                                    EV
                                                                           x-pos. (m)         0    -25    25       40    -30    -10
                   TV1                             TV2                     x-vel. (m s−1 )    27   17     27       27    27     22

               Fig. 1: Initial scenario configuration.
                                                                        As a special case of (1), the EV dynamics are represented
   The tractable S+SC MPC OCP for multiple DOs is then                using the linear, discrete-time point mass model
given by                                                                                      EV
                                                                                             ξk+1 = AξkEV + BuEV
                                                                                                              k                       (18)
                  −1
                N
                                                      !
                                                                                                                         >
                X
    ∗                              ex              ex
  J = min E          l (ξk , uk , wk ) + Jf (ξN , wN )  (17a)         with the EV states ξk = [xk , vx,k , yk , vy,k ] and inputs
         U                                                                               >
                    k=0                                               uk = [ux,k , uy,k ] where
 s.t. ξk+1 = f (ξk , uk )                                     (17b)
                                                                                                      0.5∆t2
                                                                                                                       
       DO,j
                                                                            1 ∆t 0 0                                0
     ξ k+1,i   =   ADO,j ξ DO,j
                           k,i    +B    DO,j
                                               uDO,j
                                                k       sji   (17c)          0 1 0 0                ∆t             0 
                                                                       A=   0 0 1 ∆t , B =  0
                                                                                                                         . (19)
     uk ∈ U,                              k = 0, ..., N − 1   (17d)                                                0.5∆t2 
     ξk ∈ X ,                             k = 1, ..., N       (17e)           0 0 0 1                      0          ∆t
                      
     djk,i ξk , ξ DO,j
                  k,i
                            j
                         ≥ γk,i ,         k = 1, ..., N       (17f)      The TV dynamics are assumed to be subject to un-
            r                                                         certainties. In the case of vehicles, tasks are maneuvers.
                                           >
      j
     γk,i =        2∇dDO,j e,j DO,j
                      k,i Σk ∇dk,i  erf −1 (1 − 2β ex ) (17g)         Therefore, we consider maneuver uncertainty and maneuver
                                                                      execution uncertainty. The TV dynamics are in the form
with i = 1, ..., Kj where Kj is determined according to (8)           of (2) with ADO , B DO , states, and inputs similar to (19)
for each DO, given the DOs j = 1, ..., nDO .                          as well as GDO = diag (0.05, 0.067, 0.013, 0.03) account-
   In (17f), an individual approximated chance constraint             ing for diverse TV uncertainty in longitudinal and lateral
is generated for each sample si , depending on Kj . While             direction. The covariance matrix of the normally distributed
this approach is reasonable for a small number of samples,            TV maneuver execution uncertainty wkex ∼ N (0, Σex       k ) is an
it becomes computationally expensive for a larger Kj . A              identity matrix Σexk = diag(1, 1, 1, 1). Furthermore, additive
possible alternative for application is to combine similar            measurement noise νk ∼ N (0, Σνk ) is considered for xTV         k
individual task in order to reduce the number of total                and ykTV with Σνk = diag (0.16, 0.01). The TVs have multiple
constraints. This approach is illustrated in the simulation           maneuver options with associated maneuver probabilities.
example in Section IV.                                                The possible maneuvers consist of lane changes to left (LCL)
   If it is required to guarantee safety or recursive feasibility,    and right (LCR), lane keeping (LK), accelerating (AC),
the proposed S+SC MPC method may be extended by the                   braking (BR), and insignificant acceleration (IA), as well
safety framework for SMPC approaches proposed in [16].                as a combination of the lateral and longitudinal maneuvers,
                                                                      resulting in a total of nine possible maneuvers.
                      IV. S IMULATION S TUDY                             The road consists of three lanes with lane width llane =
   To evaluate the effectiveness of the S+SC MPC algorithm            3.5 m, where the center of the left lane represents y = 0.
presented in Section III-C, a highway scenario involving              All vehicles are aveh = 6 m in length and bveh = 2 m in
five target vehicles (TVs) is simulated, using the Control            width. The initial lateral position of all vehicles coincides
Toolbox [17]. Here, the CA and DOs become ego vehicle                 with the lateral center of the vehicles’ respective lanes with
(EV) and TVs, respectively. The initial vehicle configuration         zero lateral velocity. The initial longitudinal positions and
is depicted in Figure 1.                                              velocities of all vehicles are summarized in Table I. The TV
                                                                                                       TV
                                                                      reference state is chosen as ξref,k          TV
                                                                                                           = [0, vx,ref,k    TV
                                                                                                                          , yref,k , 0]> ,
   We first present the results of the simulation study with                   TV           TV
the proposed S+SC MPC algorithm, and then, for compar-                where vx,ref,k and yref,k may vary over time depending on
ison, we investigate the stand-alone algorithms SMPC and              the scenario. The feedback controller for the TVs is
SCMPC. Eventually, we investigate applying S+SC MPC to
                                                                                                                     
                                                                                      DO      0 −1.0       0       0
varying scenario configurations.                                                   K =                                  .            (20)
                                                                                              0     0     −0.8 −2.2

A. Simulation Setup                                                      To prevent collisions, a region around the TV is inadmis-
                                                                      sible for the EV. This is referred to as the safety constraint,
   All simulations are run on an Intel i5-2500K CPU @
                                                                      where the admissible area is the safe set Ξsafe
                                                                                                                    k . In line with
3.30GHz with 15.6GB RAM. Each simulation consists of
                                                                      [10], we impose a safety constraint modeled as an ellipse.
niter = 100 MPC iterations, which is equivalent to a scenario
                                                                      Its definition adheres to
duration of 20 s with ∆t = 0.2 s. In the following, SI units
are assumed for variables and parameters expressed without                                         2           2
                                                                                             (∆xk )   (∆yk )
units.                                                                               dk =        2
                                                                                                    +        − 1 ≥ 0,                 (21)
                                                                                               a        b2
2                   2
                                                                                kuk kR , Jf = k∆ξN kS , with the cost function weights
                                      TVBR,LCL   TVIA,LCL    TVAC,LCL                                                             2
                                        k+2        k+2         k+2
                                                                                Q, S ∈ R4×4 , and R ∈ R2×2 , as well as kzkZ = z > Zz
   TVIA,LK                            TVBR,LK    TVIA,LK     TVAC,LK
     k                                  k+2        k+2         k+2              and ∆ξk = ξk − ξref,k with reference ξref,k . For positional
                                      TVBR,LCR
                                        k+2      TVIA,LCR
                                                   k+2       TVAC,LCR
                                                               k+2
                                                                                reference tracking in y-direction, the EV reference is set to
                                                                                its current lane center, while vy,ref,k = 0 and vx,ref,k =
     k                 k+1                           k+2
                          MPC time step                                         27 m s−1 . For the cost function, the first element of ∆ξk
                                                                                is neglected, since no reference for xk is imposed. Here,
Fig. 2: Qualitative depiction of the combined safety con-                       Q = S = diag(0, 3, 0.5, 0.1), R = diag(1, 0.1) are selected.
straint ellipse. Safety ellipses for step k + 1 omitted.
                                                                                   While the maneuver probabilities are scenario specific and
                                                                                different task uncertainty risk parameters β ta are evaluated,
where we decompose the distance between the EV and TV                           the task execution risk parameter is chosen to be β ex = 0.8.
                                                             TV
into a longitudinal and a lateral component ∆xk = xEV  k −xk                    In case the original MPC problem is infeasible, a recovery
                EV     TV
and ∆yk = yk − y k .                                                            MPC OCP is solved with slack variables to soften the
   The ellipse center coincides with the TV center. Therefore,                  constraints, as described in [10]. For the recovery OCP,
(21) is fulfilled if the EV center lies outside the inner space                 the slack variable weight in the cost function is λ = 50
or on the edge of the ellipse, i.e., dk ≥ 0. The parameters                     and the task execution risk parameter is changed to a more
a = 30 and b = 2 represent the semi-major and semi-minor                        conservative value βλex = 0.995, to prioritize safety. In case
axis of the ellipse, respectively. The values of a and b are                    the recovery problem fails, the solver selects the last feasible
chosen conservatively, i.e., the area covered by the safety                     point as the solution to the OCP.
ellipse is larger than the vehicle shape.                                          Apart from the safety constraint, the EV plans its motion
   To reduce the number of constraints for sampled TV                           subject to the constraints −1.75 ≤ yk ≤ 8.75, −5 ≤ ux,k ≤
maneuvers, we first introduce a method to adapt the safety                      5, −0.5 ≤ uy,k ≤ 0.5, −1 ≤ ∆ux,k ≤ 1, −0.2 ≤ ∆uy,k ≤
constraint ellipse (21). As an example, we assume that all                      0.2 with ∆ux,k = ux,k − ux,k−1 , ∆uy,k = uy,k − uy,k−1 .
possible maneuvers are sampled. Then, as mentioned as
a possibility in Section III-C, we combine the individual                       B. Simulation Results
constraint ellipses of all sampled maneuvers at each time
step, as shown in Figure 2. If less maneuvers are sampled,                         In the following, the S+SC MPC algorithm is evaluated
the aggregated ellipse only covers the sampled maneuvers.                       in the presented scenario. As mentioned, the maneuver risk
   The result is the aggregated ellipse                                         parameter β ta is varied, resulting in a varying sample size K.
                                                                                Monte Carlo simulations are conducted 150 times for each
                                2                2
                    (∆x̃k )     (∆ỹk )                                         risk parameter value.
              d˜k =      2    +         − 1 ≥ 0,                        (22a)
                       ãk        b̃2k                                             Each simulation consists of two parts. For the first 20
             ∆x̃k = xk − x̃TV                                                   steps, the EV follows a conservative behavior with β ta =
                            k ,                                         (22b)
                                                                                0.999, representing a behavior prediction initialization phase.
             ∆ỹk = yk − ỹkTV ,                                        (22c)   The EV assumes that the TV probabilities for lane changes
                       xTV,IA   +   xTV,BR   +   xTV,AC                         or changes in acceleration are pLC = 0.80 and pAC = pBR =
             x̃TV
               k =
                        k            k            k
                                                             ,          (22d)
                                      3                                         0.40. In case a lane changes is possible to the left or right,
                       ykTV,LK + ykTV,LCL + ykTV,LCR                            pLC is assigned equally. In the second part from step 21 to
             ỹkTV =                                        (22e)               step 100, it is assumed that the EV has adapted its behavior
                                     3
                                                                               prediction. Therefore, the probabilities of TV maneuvers
with center x̃TV       TV
                 k , ỹk    . The longitudinal and lateral posi-                change to pLC = 0.20 and pAC = pBR = 0.10. For the
tion of the TV corresponding to the respective maneuvers                        second part of the simulation, different risk parameters β ta ∈
are indicated by the variables xTV,M    k      and ykTV,M , M ∈                 {0.99, 0.95, 0.89, 0.83} are evaluated. Within the actual sim-
{IA, BR, AC, LK, LCL, LCR}, respectively. The combined                          ulation, all TVs maintain their respective lanes, except TV4,
ellipse exhibits the adjusted semi-major and semi-minor axes                    which moves to the center lane. The reference velocities
                                                                                                       TV1
                                                                                in x-direction are vx,ref    = 22 m s−1 , vx,ref
                                                                                                                              TV2
                                                                                                                                   = 22 m s−1 ,
                                                     2            
                                                                                vx,ref = 17 m s , vx,ref = 17 m s , vx,ref = 27 m s−1 .
                                                                                  TV3           −1    TV4           −1    TV5
  ãk = a + 0.5 xTV,
                 k
                     AC
                        − xTV,
                           k
                               BR
                                  +                         b̃k − b ,   (23a)
                                                 llane                             The result of an individual example with β ta = 0.95 is
  b̃k = b + 0.5 ykTV, LCL − ykTV, LCR .                                 (23b)   illustrated in Figure 3. While there initially is a gap between
                                                                                TV3 and TV5, the EV does not plan to overtake, as a
By generating the aggregated safety ellipse, the number of                      potential lane change of either TV3 or TV5 would result in an
necessary constraints is reduced. As seen in Figure 2, the                      inevitable collision. Therefore, the EV slows down such that
aggregated ellipse does not necessarily cover all individual                    TV5 passes TV3 first. Subsequently, the EV safely moves to
safety ellipses perfectly, which is still reasonable as the                     the left lane to overtake TV3.
individual safety ellipses are designed larger than necessary.                     Even though SMPC, in general, allows a small probability
   For the MPC OCP (17), a prediction horizon N = 12 is                         of constraint violation, in regular scenarios collisions are
                                                         2
selected. The cost function terms are set to l = k∆ξk kQ +                      avoided as the repetitively updated SMPC inputs allow to
TABLE III: SMPC and SCMPC Simulation Results
                                                                                             SMPC                  SCMPC
                                                                           risk parameter     0.8      0.99     0.95   0.89       0.83
                                                                           collisions         79        49       43       45       41
                                                                           cost J100         3.22e4   6.77e4   6.27e4   6.88e4   7.08e4
                                                                           infeasible OCP
                                                                                              31.2     54.1     53.6     55.7     54.7
                                                                           steps
                                                                           infeasible rec.
                                                                                              21.8     33.8     33.6     34.1     34.3
                                                                           OCP steps

Fig. 3: Vehicle motion for simulation steps 21 (top) and 45
(bottom). The EV is shown in red, TVs in blue. Fading boxes               [7], is analyzed with β ex = 0.8. The advantage of S+SC
represent past states.                                                    MPC is that the mixed uncertainty structure is exploited.
                                                                          Applying only SMPC, in order to account for maneuver and
         TABLE II: S+SC MPC Simulation Results                            execution uncertainty, multiple possible maneuvers would
                                                                          need to be approximated by a Gaussian uncertainty. However,
    risk parameter β ta         0.99    0.95     0.89      0.83
                                                                          this would result in a major increase of the safety ellipse,
    collisions                   0       0        0         0             covering the entire road width, rendering overtaking other
    cost J100               3.64e4     3.40e4   3.59e4    3.76e4          TVs impossible. Therefore, in the SMPC simulation, the
    infeasible OCP                                                        SMPC algorithm only accounts for maneuver execution
                                26.3    25.2     24.2      26.6
    steps                                                                 uncertainty.
    infeasible rec.
                                2.2     3.2      5.2       7.4
                                                                             A total of 79 collisions occurred. While the cost is slightly
    OCP steps                                                             lower compared to S+SC MPC, significantly more steps with
                                                                          infeasible OCPs occur, especially for the recovery problem.
constantly adjust. For example, it may not be possible to sat-               In the SCMPC simulation, inspired by [11], the maneuver
isfy the chance constraint for a late prediction step within the          execution uncertainty is approximated by samples. To com-
SMPC horizon, due to an unexpected uncertainty realization.               pare a similar situation as in the SMPC simulation, no task
The OCP is therefore infeasible. However, a collision may                 uncertainty is considered here. Again, a significant number
still be prevented in the next steps, depending on the future             of simulation runs result in collisions, while the cost also
uncertainty realizations. Here, we designed a challenging                 increases compared to S+SC MPC. The steps with infeasible
situation for the EV, as lane changes are considered to be                OCPs appear more often than in the S+SC MPC simulation
probable for all TVs and must be accounted for. The results               runs. The computation times for SMPC and SCMPC are
of the Monte Carlo simulations are shown in Table II.                     similar to S+SC MPC.
   Summarizing the simulation results, the first important
observation is that no collisions occurred. While the safety              D. Varying Vehicle Settings
ellipse is slightly violated in some simulation runs, the safety             So far, only one vehicle setting is considered. Therefore,
ellipse is chosen large enough that no collisions followed.               we additionally ran 150 simulations with randomly chosen
   The performance is evaluated by computing the cost at                  TV settings for each simulation run (similar initial EV state
each time step, based on the actual states and inputs, with               as before). The TVs get assigned random initial positions
                          99
                          X                                               xTV
                                                                            0 ∈ [−150, 150] and are placed on one of the three lanes,
                                        2             2
                 J100 =         k∆ξk+1 kQ + kuk kR .               (24)   i.e., y0 ∈ {0, 3.5, 7}. The constant longitudinal velocity for
                          k=0                                             each TV is randomly chosen according to vxTV ∈ [17, 27] with
The cost remains on a similar level for all risk parameters,              vyTV = 0. It is ensured that all vehicles positioned on similar
where the best choice in this scenario is β ta = 0.95. Lower              lanes have enough longitudinal distance ∆x ≥ 50, and
risk increases conservatism, while high risk results in less              velocities are selected such that TV collisions are avoided.
smooth control inputs, again increasing the cost.                         The proposed S+SC MPC method successfully handled all
   As mentioned before, the potential lane changes of all TVs             150 simulation runs and no collisions occurred.
pose a challenging situation for the EV, resulting in steps                  Overall, S+SC MPC allows exploiting the uncertainty
where the OCP becomes infeasible. However, the steps with                 structure of the simulation setting, achieving adequate perfor-
successfully solved recovery OCPs are significantly more                  mance and avoiding collisions. While the results presented
likely, especially for a low accepted level of risk. The average          here are promising, it is to note that the benefits of the
computation time is 214 ms.                                               proposed method depend on the application setting and to
                                                                          which degree the uncertainty structure may be exploited.
C. Comparison to SMPC and SCMPC
   We now compare the results of S+SC MPC to only                                                V. C ONCLUSION
applying SMPC or SCMPC. the results are shown in Ta-                        The proposed S+SC MPC method allows considering the
ble III. First, an analytic SMPC algorithm, inspired by                   specific uncertainty structure found in many applications,
where both task uncertainty and task execution uncertainty
are present. As SCMPC is suitable for non-Gaussian task
uncertainty and SMPC copes well with Gaussian execution
uncertainty, the combination shows promising results.
   While in this work the S+SC MPC method is applied to
a vehicle scenario, the framework is designed in a general
way, such that it is applicable also to other applications,
e.g., human-robot collaboration. In this robotics setting, a
robotic arm may have the option of moving to one of several
items, while the exact motion towards the specific item may
vary. Without specifically focusing on agents, the proposed
framework may also be applicable to process control or
finance.
                             R EFERENCES
 [1] W. Langson, I. Chryssochoos, S.V. Raković, and D.Q. Mayne. Robust
     model predictive control using tubes. Automatica, 40(1):125 – 133,
     2004.
 [2] J.B. Rawlings, D.Q. Mayne, and M. Diehl. Model Predictive Control:
     Theory, Computation, and Design. Nob Hill Publishing, 2017.
 [3] A. Mesbah. Stochastic model predictive control: An overview and
     perspectives for future research. IEEE Control Systems, 36(6):30–44,
     Dec 2016.
 [4] M. Farina, L. Giulioni, and R. Scattolini. Stochastic linear model
     predictive control with chance constraints – a review. Journal of
     Process Control, 44(Supplement C):53 – 67, 2016.
 [5] A.T. Schwarm and M. Nikolaou. Chance-constrained model predictive
     control. AIChE Journal, 45(8):1743–1752, 1999.
 [6] B. Kouvaritakis, M. Cannon, S.V. Rakovic, and Q. Cheng. Explicit use
     of probabilistic distributions in linear predictive control. Automatica,
     46(10):1719 – 1724, 2010.
 [7] A. Carvalho, Y. Gao, S. Lefevre, and F. Borrelli. Stochastic predictive
     control of autonomous vehicles in uncertain environments. In 12th
     International Symposium on Advanced Vehicle Control, Tokyo, Japan,
     2014.
 [8] L. Blackmore, M. Ono, A. Bektassov, and B.C. Williams. A proba-
     bilistic particle-control approximation of chance-constrained stochastic
     predictive control. Trans. Rob., 26(3):502–517, June 2010.
 [9] G. Schildbach, L. Fagiano, C. Frei, and M. Morari. The scenario
     approach for stochastic model predictive control with bounds on
     closed-loop constraint violations. Automatica, 50(12):3009 – 3018,
     2014.
[10] T. Brüdigam, M. Olbrich, M. Leibold, and D. Wollherr. Combining
     stochastic and scenario model predictive control to handle target
     vehicle uncertainty in autonomous driving. In 21st IEEE International
     Conference on Intelligent Transportation Systems (ITSC), 2018.
[11] G. Schildbach and F. Borrelli. Scenario model predictive control for
     lane change assistance on highways. In 2015 IEEE Intelligent Vehicles
     Symposium (IV), pages 611–616, Seoul, South Korea, June 2015.
[12] G. Cesari, G. Schildbach, A. Carvalho, and F. Borrelli. Scenario model
     predictive control for lane change assistance and autonomous driving
     on highways. IEEE Intelligent Transportation Systems Magazine,
     9(3):23–35, Fall 2017.
[13] T. Brüdigam, F. di Luzio, L. Pallottino, D. Wollherr, and M. Leibold.
     Grid-based stochastic model predictive control for trajectory planning
     in uncertain environments. In 23rd IEEE International Conference on
     Intelligent Transportation Systems (ITSC), 2020.
[14] A. Muraleedharan, A. Tran, H. Okuda, and T. Suzuki. Grid-based
     stochastic model predictive control for trajectory planning in uncertain
     environments. In IFAC World Congress 2020, Berlin, Germany, 2020.
[15] J. Schulman, J. Ho, A. Lee, I. Awwal, H. Bradlow, and P. Abbeel.
     Finding locally optimal, collision-free trajectories with sequential
     convex optimization. In Robotics: Science and Systems 2013, 2013.
[16] T. Brüdigam, M. Olbrich, D. Wollherr, and M. Leibold. Stochastic
     model predictive control with a safety guarantee for automated driving.
     IEEE Transactions on Intelligent Vehicles, pages 1–1, 2021.
[17] M. Giftthaler, M. Neunert, M. Stäuble, and J. Buchli. The Control
     Toolbox - an open-source C++ library for robotics, optimal and
     model predictive control. In 2018 IEEE International Conference
     on Simulation, Modeling, and Programming for Autonomous Robots
     (SIMPAR), pages 123–129, May 2018.
You can also read