Sliding mode control

From Wikipedia, the free encyclopedia
Jump to: navigation, search

In control system, sliding mode control, or SMC, is a nonlinear control method that alters the dynamics of a nonlinear system by application of a discontinuous control signal that forces the system to "slide" along a cross-section of the system's normal behavior. The state-feedback control law is not a continuous function of time. Instead, it can switch from one continuous structure to another based on the current position in the state space. Hence, sliding mode control is a variable structure control method. The multiple control structures are designed so that trajectories always move toward an adjacent region with a different control structure, and so the ultimate trajectory will not exist entirely within one control structure. Instead, it will slide along the boundaries of the control structures. The motion of the system as it slides along these boundaries is called a sliding mode[1] and the geometrical locus consisting of the boundaries is called the sliding (hyper)surface. In the context of modern control theory, any variable structure system, like a system under SMC, may be viewed as a special case of a hybrid dynamical system as the system both flows through a continuous state space but also moves through different discrete control modes.

Introduction[edit]

Figure 1: Phase plane trajectory of a system being stabilized by a sliding mode controller. After the initial reaching phase, the system states "slides" along the line . The particular surface is chosen because it has desirable reduced-order dynamics when constrained to it. In this case, the surface corresponds to the first-order LTI system , which has an exponentially stable origin.

, and the sliding mode along the surface commences after the finite time when system trajectories have reached the surface. In the theoretical description of sliding modes, the system stays confined to the sliding surface and need only be viewed as sliding along the surface. However, real implementations of sliding mode control approximate this theoretical behavior with a high-frequency and generally non-deterministic switching control signal that causes the system to "chatter" in a tight neighborhood of the sliding surface. In fact, although the system is nonlinear in general, the idealized (i.e., non-chattering) behavior of the system in Figure 1 when confined to the surface is an LTI system with an exponentially stable origin.

Intuitively, sliding mode control uses practically infinite gain to force the trajectories of a dynamic system to slide along the restricted sliding mode subspace. Trajectories from this reduced-order sliding mode have desirable properties (e.g., the system naturally slides along it until it comes to rest at a desired equilibrium). The main strength of sliding mode control is its robustness. Because the control can be as simple as a switching between two states (e.g., "on"/"off" or "forward"/"reverse"), it need not be precise and will not be sensitive to parameter variations that enter into the control channel. Additionally, because the control law is not a continuous function, the sliding mode can be reached in finite time (i.e., better than asymptotic behavior). Under certain common conditions, optimality requires the use of bang–bang control; hence, sliding mode control describes the optimal controller for a broad set of dynamic systems.

One application of sliding mode controller is the control of electric drives operated by switching power converters.[2]:"Introduction" Because of the discontinuous operating mode of those converters, a discontinuous sliding mode controller is a natural implementation choice over continuous controllers that may need to be applied by means of pulse-width modulation or a similar technique[nb 1] of applying a continuous signal to an output that can only take discrete states. Sliding mode control has many applications in robotics. In particular, this control algorithm has been used for tracking control of unmanned surface vessels in simulated rough seas with high degree of success.[3][4]

Sliding mode control must be applied with more care than other forms of nonlinear control that have more moderate control action. In particular, because actuators have delays and other imperfections, the hard sliding-mode-control action can lead to chatter, energy loss, plant damage, and excitation of unmodeled dynamics.[5]:554–556 Continuous control design methods are not as susceptible to these problems and can be made to mimic sliding-mode controllers.[5]:556–563

Control scheme[edit]

Consider a nonlinear dynamical system described by

where

is an -dimensional state vector and

is an -dimensional input vector that will be used for state feedback. The functions and are assumed to be continuous and sufficiently smooth so that the Picard–Lindelöf theorem can be used to guarantee that solution to Equation (1) exists and is unique.

A common task is to design a state-feedback control law (i.e., a mapping from current state at time to the input ) to stabilize the dynamical system in Equation (1) around the origin . That is, under the control law, whenever the system is started away from the origin, it will return to it. For example, the component of the state vector may represent the difference some output is away from a known signal (e.g., a desirable sinusoidal signal); if the control can ensure that quickly returns to , then the output will track the desired sinusoid. In sliding-mode control, the designer knows that the system behaves desirably (e.g., it has a stable equilibrium) provided that it is constrained to a subspace of its configuration space. Sliding mode control forces the system trajectories into this subspace and then holds them there so that they slide along it. This reduced-order subspace is referred to as a sliding (hyper)surface, and when closed-loop feedback forces trajectories to slide along it, it is referred to as a sliding mode of the closed-loop system. Trajectories along this subspace can be likened to trajectories along eigenvectors (i.e., modes) of LTI systems; however, the sliding mode is enforced by creasing the vector field with high-gain feedback. Like a marble rolling along a crack, trajectories are confined to the sliding mode.

The sliding-mode control scheme involves

  1. Selection of a hypersurface or a manifold (i.e., the sliding surface) such that the system trajectory exhibits desirable behavior when confined to this manifold.
  2. Finding feedback gains so that the system trajectory intersects and stays on the manifold.

Because sliding mode control laws are not continuous, it has the ability to drive trajectories to the sliding mode in finite time (i.e., stability of the sliding surface is better than asymptotic). However, once the trajectories reach the sliding surface, the system takes on the character of the sliding mode (e.g., the origin may only have asymptotic stability on this surface).

The sliding-mode designer picks a switching function that represents a kind of "distance" that the states are away from a sliding surface.

  • A state that is outside of this sliding surface has .
  • A state that is on this sliding surface has .

The sliding-mode-control law switches from one state to another based on the sign of this distance. So the sliding-mode control acts like a stiff pressure always pushing in the direction of the sliding mode where . Desirable trajectories will approach the sliding surface, and because the control law is not continuous (i.e., it switches from one state to another as trajectories move across this surface), the surface is reached in finite time. Once a trajectory reaches the surface, it will slide along it and may, for example, move toward the origin. So the switching function is like a topographic map with a contour of constant height along which trajectories are forced to move.

The sliding (hyper)surface is of dimension where is the number of states in and is the number of input signals (i.e., control signals) in . For each control index , there is an sliding surface given by

The vital part of SMC design is to choose a control law so that the sliding mode (i.e., this surface given by ) exists and is reachable along system trajectories. The principle of sliding mode control is to forcibly constrain the system, by suitable control strategy, to stay on the sliding surface on which the system will exhibit desirable features. When the system is constrained by the sliding control to stay on the sliding surface, the system dynamics are governed by reduced-order system obtained from Equation (2).

To force the system states to satisfy , one must:

  1. Ensure that the system is capable of reaching from any initial condition
  2. Having reached , the control action is capable of maintaining the system at

Existence of closed-loop solutions[edit]

Note that because the control law is not continuous, it is certainly not locally Lipschitz continuous, and so existence and uniqueness of solutions to the closed-loop system is not guaranteed by the Picard–Lindelöf theorem. Thus the solutions are to be understood in the Filippov sense.[1][6] Roughly speaking, the resulting closed-loop system moving along is approximated by the smooth dynamics ; however, this smooth behavior may not be truly realizable. Similarly, high-speed pulse-width modulation or delta-sigma modulation produces outputs that only assume two states, but the effective output swings through a continuous range of motion. These complications can be avoided by using a different nonlinear control design method that produces a continuous controller. In some cases, sliding-mode control designs can be approximated by other continuous control designs.[5]

Theoretical foundation[edit]

The following theorems form the foundation of variable structure control.

Theorem 1: Existence of Sliding Mode[edit]

Consider a Lyapunov function candidate

where is the Euclidean norm (i.e., is the distance away from the sliding manifold where ). For the system given by Equation (1) and the sliding surface given by Equation (2), a sufficient condition for the existence of a sliding mode is that

in a neighborhood of the surface given by .

Roughly speaking (i.e., for the scalar control case when ), to achieve , the feedback control law is picked so that and have opposite signs. That is,

  • makes negative when is positive.
  • makes positive when is negative.

Note that

and so the feedback control law has a direct impact on .

Reachability: Attaining sliding manifold in finite time[edit]

To ensure that the sliding mode is attained in finite time, must be more strongly bounded away from zero. That is, if it vanishes too quickly, the attraction to the sliding mode will only be asymptotic. To ensure that the sliding mode is entered in finite time,[7]

where and are constants.

Explanation by comparison lemma

This condition ensures that for the neighborhood of the sliding mode ,

So, for ,

which, by the chain rule (i.e., with ), means

where is the upper right-hand derivative of and the symbol denotes proportionality. So, by comparison to the curve which is represented by differential equation with initial condition , it must be the case that for all . Moreover, because , must reach in finite time, which means that must reach (i.e., the system enters the sliding mode) in finite time.[5] Because is proportional to the Euclidean norm of the switching function , this result implies that the rate of approach to the sliding mode must be firmly bounded away from zero.

Consequences for sliding mode control

In the context of sliding mode control, this condition means that

where is the Euclidean norm. For the case when switching function is scalar valued, the sufficient condition becomes

.

Taking , the scalar sufficient condition becomes

which is equivalent to the condition that

.

That is, the system should always be moving toward the switching surface , and its speed toward the switching surface should have a non-zero lower bound. So, even though may become vanishingly small as approaches the surface, must always be bounded firmly away from zero. To ensure this condition, sliding mode controllers are discontinuous across the manifold; they switch from one non-zero value to another as trajectories cross the manifold.

Theorem 2: Region of Attraction[edit]

For the system given by Equation (1) and sliding surface given by Equation (2), the subspace for which the surface is reachable is given by

That is, when initial conditions come entirely from this space, the Lyapunov function candidate is a Lyapunov function and trajectories are sure to move toward the sliding mode surface where . Moreover, if the reachability conditions from Theorem 1 are satisfied, the sliding mode will enter the region where is more strongly bounded away from zero in finite time. Hence, the sliding mode will be attained in finite time.

Theorem 3: Sliding Motion[edit]

Let

be nonsingular. That is, the system has a kind of controllability that ensures that there is always a control that can move a trajectory to move closer to the sliding mode. Then, once the sliding mode where is achieved, the system will stay on that sliding mode. Along sliding mode trajectories, is constant, and so sliding mode trajectories are described by the differential equation

.

If an -equilibrium is stable with respect to this differential equation, then the system will slide along the sliding mode surface toward the equilibrium.

The equivalent control law on the sliding mode can be found by solving

for the equivalent control law . That is,

and so the equivalent control

That is, even though the actual control is not continuous, the rapid switching across the sliding mode where forces the system to act as if it were driven by this continuous control.

Likewise, the system trajectories on the sliding mode behave as if

The resulting system matches the sliding mode differential equation

and so as long as the sliding mode surface where is stable (in the sense of Lyapunov), the system can be assumed to follow the simpler condition after some initial transient during the period while the system finds the sliding mode. The same motion is approximately maintained provided the equality only approximately holds.

It follows from these theorems that the sliding motion is invariant (i.e., insensitive) to sufficiently small disturbances entering the system through the control channel. That is, as long as the control is large enough to ensure that and is uniformly bounded away from zero, the sliding mode will be maintained as if there was no disturbance. The invariance property of sliding mode control to certain disturbances and model uncertainties is its most attractive feature; it is strongly robust.

As discussed in an example below, a sliding mode control law can keep the constraint

in order to asymptotically stabilize any system of the form

when has a finite upper bound. In this case, the sliding mode is where

(i.e., where ). That is, when the system is constrained this way, it behaves like a simple stable linear system, and so it has a globally exponentially stable equilibrium at the origin.

Control design examples[edit]

  • Consider a plant described by Equation (1) with single input (i.e., ). The switching function is picked to be the linear combination
where the weight for all . The sliding surface is the simplex where . When trajectories are forced to slide along this surface,
and so
which is a reduced-order system (i.e., the new system is of order because the system is constrained to this -dimensional sliding mode simplex). This surface may have favorable properties (e.g., when the plant dynamics are forced to slide along this surface, they move toward the origin ). Taking the derivative of the Lyapunov function in Equation (3), we have
To ensure is a negative-definite function (i.e., for Lyapunov stability of the surface ), the feedback control law must be chosen so that
Hence, the product because it is the product of a negative and a positive number. Note that
The control law is chosen so that
where
  • is some control (e.g., possibly extreme, like "on" or "forward") that ensures Equation (5) (i.e., ) is negative at
  • is some control (e.g., possibly extreme, like "off" or "reverse") that ensures Equation (5) (i.e., ) is positive at
The resulting trajectory should move toward the sliding surface where . Because real systems have delay, sliding mode trajectories often chatter back and forth along this sliding surface (i.e., the true trajectory may not smoothly follow , but it will always return to the sliding mode after leaving it).
which can be expressed in a 2-dimensional state space (with and ) as
Also assume that (i.e., has a finite upper bound that is known). For this system, choose the switching function
By the previous example, we must choose the feedback control law so that . Here,
  • When (i.e., when ), to make , the control law should be picked so that
  • When (i.e., when ), to make , the control law should be picked so that
However, by the triangle inequality,
and by the assumption about ,
So the system can be feedback stabilized (to return to the sliding mode) by means of the control law
which can be expressed in closed form as
Assuming that the system trajectories are forced to move so that , then
So once the system reaches the sliding mode, the system's 2-dimensional dynamics behave like this 1-dimensional system, which has a globally exponentially stable equilibrium at .

Automated design solutions[edit]

Although various theories exist for sliding mode control system design, there is a lack of a highly effective design methodology due to practical difficulties encountered in analytical and numerical methods. A reusable computing paradigm such as a genetic algorithm can, however, be utilized to transform a 'unsolvable problem' of optimal design into a practically solvable 'non-deterministic polynomial problem'. This results in computer-automated designs for sliding model control. [8]


Sliding mode observer[edit]

Sliding mode control can be used in the design of state observers. These non-linear high-gain observers have the ability to bring coordinates of the estimator error dynamics to zero in finite time. Additionally, switched-mode observers have attractive measurement noise resilience that is similar to a Kalman filter.[9][10] For simplicity, the example here uses a traditional sliding mode modification of a Luenberger observer for an LTI system. In these sliding mode observers, the order of the observer dynamics are reduced by one when the system enters the sliding mode. In this particular example, the estimator error for a single estimated state is brought to zero in finite time, and after that time the other estimator errors decay exponentially to zero. However, as first described by Drakunov,[11] a sliding mode observer for non-linear systems can be built that brings the estimation error for all estimated states to zero in a finite (and arbitrarily small) time.

Here, consider the LTI system

where state vector , is a vector of inputs, and output is a scalar equal to the first state of the state vector. Let

where

  • is a scalar representing the influence of the first state on itself,
  • is a column vector representing the influence of the other states on the first state,
  • is a matrix representing the influence of the other states on themselves, and
  • is a row vector corresponding to the influence of the first state on the other states.

The goal is to design a high-gain state observer that estimates the state vector using only information from the measurement . Hence, let the vector be the estimates of the states. The observer takes the form

where is a nonlinear function of the error between estimated state and the output , and is an observer gain vector that serves a similar purpose as in the typical linear Luenberger observer. Likewise, let

where is a column vector. Additionally, let be the state estimator error. That is, . The error dynamics are then

where is the estimator error for the first state estimate. The nonlinear control law can be designed to enforce the sliding manifold

so that estimate tracks the real state after some finite time (i.e., ). Hence, the sliding mode control switching function

To attain the sliding manifold, and must always have opposite signs (i.e., for essentially all ). However,

where is the collection of the estimator errors for all of the unmeasured states. To ensure that , let

where

That is, positive constant must be greater than a scaled version of the maximum possible estimator errors for the system (i.e., the initial errors, which are assumed to be bounded so that can be picked large enough; al). If is sufficiently large, it can be assumed that the system achieves (i.e., ). Because is constant (i.e., 0) along this manifold, as well. Hence, the discontinuous control may be replaced with the equivalent continuous control where

So

This equivalent control represents the contribution from the other states to the trajectory of the output state . In particular, the row acts like an output vector for the error subsystem

So, to ensure the estimator error for the unmeasured states converges to zero, the vector must be chosen so that the matrix is Hurwitz (i.e., the real part of each of its eigenvalues must be negative). Hence, provided that it is observable, this system can be stabilized in exactly the same way as a typical linear state observer when is viewed as the output matrix (i.e., ""). That is, the equivalent control provides measurement information about the unmeasured states that can continually move their estimates asymptotically closer to them. Meanwhile, the discontinuous control forces the estimate of the measured state to have zero error in finite time. Additionally, white zero-mean symmetric measurement noise (e.g., Gaussian noise) only affects the switching frequency of the control , and hence the noise will have little effect on the equivalent sliding mode control . Hence, the sliding mode observer has Kalman filter–like features.[10]

The final version of the observer is thus

where

  • ,
  • , and
  • .

That is, by augmenting the control vector with the switching function , the sliding mode observer can be implemented as an LTI system. That is, the discontinuous signal is viewed as a control input to the 2-input LTI system.

For simplicity, this example assumes that the sliding mode observer has access to a measurement of a single state (i.e., output ). However, a similar procedure can be used to design a sliding mode observer for a vector of weighted combinations of states (i.e., when output uses a generic matrix ). In each case, the sliding mode will be the manifold where the estimated output follows the measured output with zero error (i.e., the manifold where ).

See also[edit]

Notes[edit]

  1. ^ Other pulse-type modulation techniques include delta-sigma modulation.

References[edit]

  1. ^ a b Zinober, A.S.I., ed. (1990). Deterministic control of uncertain systems. London: Peter Peregrinus Press. ISBN 978-0-86341-170-0. 
  2. ^ Utkin, Vadim I. (1993). "Sliding Mode Control Design Principles and Applications to Electric Drives". IEEE Transactions on Industrial Electronics. IEEE. 40 (1): 23–36. doi:10.1109/41.184818. 
  3. ^ "Autonomous Navigation and Obstacle Avoidance of Unmanned Vessels in Simulated Rough Sea States - Villanova University"
  4. ^ Mahini; et al. (2013). "An experimental setup for autonomous operation of surface vessels in rough seas". Robotica. 31 (5): 703–715. doi:10.1017/s0263574712000720. 
  5. ^ a b c d Khalil, H.K. (2002). Nonlinear Systems (3rd ed.). Upper Saddle River, NJ: Prentice Hall. ISBN 0-13-067389-7. 
  6. ^ Filippov, A.F. (1988). Differential Equations with Discontinuous Right-hand Sides. Kluwer. ISBN 978-90-277-2699-5. 
  7. ^ Perruquetti, W.; Barbot, J.P. (2002). Sliding Mode Control in Engineering. Marcel Dekker Hardcover. ISBN 0-8247-0671-4. 
  8. ^ Li, Yun; et al. (1996). "Genetic algorithm automated approach to the design of sliding mode control systems" (PDF). International Journal of Control. 64 (3): 721–739. doi:10.1080/00207179608921865. 
  9. ^ Utkin, Vadim; Guldner, Jürgen; Shi, Jingxin (1999). Sliding Mode Control in Electromechanical Systems. Philadelphia, PA: Taylor & Francis, Inc. ISBN 0-7484-0116-4. 
  10. ^ a b Drakunov, S.V. (1983). "An adaptive quasioptimal filter with discontinuous parameters". Automation and Remote Control. 44 (9): 1167–1175. 
  11. ^ Drakunov, S.V. (1992). Sliding-Mode Observers Based on Equivalent Control Method. Proceedings of the 31st IEEE Conference on Decision and Control (CDC),. pp. 2368–2370. ISBN 0-7803-0872-7. 

Further reading[edit]