Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science 6.243j (Fall 2003): DYNAMICS OF NONLINEAR SYSTEMS by A. Megretski Lecture 13: Feedback Linearization 1 Using control authority to transform nonlinear models into linear ones is one of the most commonly used ideas of practical nonlinear control design. Generally, the trick helps one to recognize “simple” nonlinear feedback design tasks. 13.1 Motivation and objectives In this section, we give a motivating example and state technical objectives of theory of feedback linearization. 13.1.1 Example: fully actuated mechanical systems Equations of rather general mechanical systems can be written in the form M (q(t))?q(t) + F (q(t),q˙(t)) = u(t), (13.1) where q(t) ? R k is the position vector, u(t) is the vector of actuation forces and torques, F : R k £ R k ≤? R k is a given vector-valued function, and M : R k ≤? R k£k is a given function taking positive deflnite symmetric matrix values (the inertia matrix). When u = u(t) is flxed (for example, when u(t) = u 0 cos(t) is a harmonic excitation), analysis of (13.1) is usually an extremely di–cult task. However, when u(t) is an unrestricted control efiort to be chosen, a simple change of control variable u(t) = M (q(t))(v(t) + F (q(t),q˙(t))) (13.2) transforms (13.1) into a linear double integrator model q?(t) = v(t). (13.3) 1 Version of October 29, 2003 2 The transformation from (13.1) to (13.3) is a typical example of feedback linearization, which uses a strong control authority to simplify system equations. For example, when (13.1) is an underactuated model, i.e. when u(t) is restricted to a given subspace in R k , the transformation in (13.2) is not valid. Similarly, if u(t) must satisfy an a-priori bound, conversion from v to u according to (13.2) is not always possible. In addition, feedback linearization relies on access to accurate information, in the current example – precise knowledge of functions M, F and precise measurement of coor- dinates q(t) and velocities ˙q(t). While in some cases (including the setup of (13.1)) one can extend the beneflts of feedback linearization to approximately known and imperfectly observed models, information ow constraints remain a serious obstacle when applying feedback linearization. 13.1.2 Output feedback linearization Output feedback linearization can be viewed as a way of simplifying a nonlinear ODE control system model of the form x˙(t) = f(x(t)) + g(x(t))u(t), (13.4) y(t) = h(x(t)), (13.5) where x(t) ? U is the state vector ranging over a given open subset X 0 of R n , u(t) ? R m is the control vector, y(t) ? R m is the output vector, f : X 0 ≤? R n , h : X 0 ≤? R m , and g : X 0 ≤? R n£m are given smooth functions. Note that in this setup y(t) has same dimension as u(t). The simpliflcation is to be achieved by flnding a feedback transformation v(t) = fl(x(t)) + fi(x(t))u(t), (13.6) and a state transformation z(t) = [z l (t); z 0 (t)] = ?(x(t)), (13.7) where ? : X 0 ≤? R n , fl : X 0 ≤? R m , fi : X 0 ≤? R m£m are continuously difierentiable functions, such that the Jacobian of ? is not singular on X 0 , and the relation between v(t), y(t) and z(t) subject to (13.6), (13.7) has the form z˙ l (t) = Az l (t) + Bv(t), y(t) = Cz l (t), (13.8) z˙ 0 (t) = a 0 (z l (t), z 0 (t)), (13.9) where A, B, C are constant matrices of dimensions k-by-k, k-by-m, and m-by-k respec- tively, such that the pair (A, B) is controllable and the pair (C, A) is observable, and a 0 : R k £ R n?k ≤? R n?k is a continuously difierentiable function. 3 More precisely, it is required that for every solution x : [t 0 ,t 1 ] ≤? X 0 , u : [t 0 ,t 1 ] ≤? R m , y : [t 0 ,t 1 ] ≤? R m of (13.4), (13.5) equalities (13.8), (13.9) must be satisfled for z(t),v(t) deflned by (13.6) and (13.7). As long as accurate measurements of the full state x(t) of the original system are available, X 0 = R n , and the behavior of y(t) and u(t) is the only issue of interest, the output feedback linearization reduces the control problem to a linear one. However, in a ddition to sensor limitations, X 0 is rarely the whole R n , and the state x(t) is typically required to remain bounded (or even to converge to a desired steady state value). Thus, it is frequently impossible to ignore equation (13.9), which is usually refered to as the zero dynamics of (13.4),(13.5). In the best scenario (the so-called “minimum phase systems”), the response of (13.9) to all expected initial conditions and reference signals y(t) can be proven to be bounded and generating a response x(t) conflned to X 0 . In general, the area X 0 on which feedback linearization is possible does not cover of states of interest, the zero dynamics is not as stable as desired, and hence the beneflts of output feedback linearization are limited. 13.1.3 Full state feedback linearization Formally, full state feedback linearization applies to nonlinear ODE control system model of the form (13.4), without a need for a particular output y(t) to be specifled. As in the previous subsection, the simpliflcation is to be achieved by flnding a feedback transformation (13.6) and a state transformation z(t) = ?(x(t)) (13.10) with a non-singular Jacobian. It is required that for every solution x : [t 0 ,t 1 ] ≤? X 0 , u : [t 0 ,t 1 ] ≤? R m of (13.4) equality z˙(t) = Az(t) + Bv(t) (13.11) must be satisfled for z(t),v(t) deflned by (13.6) and (13.10). It appears that the beneflts of having a full state linearization are substantially greater than those delivered by an output feedback linearization. Unfortunately, among systems of order higher than two, the full state feedback linearizable ones form a set of “zero mea- sure”, in a certain sense. In other words, unlike in the case of output feedback lineariza- tion, which is possible, at least locally, “almost always”, full state feedback linearizability requires certain equality constraints to be satisfled for the original system data, and hence does not take place in a generic setup. 13.2 Feedback linearization with scalar control This section contains basic results on feedback linearization of single-input systems (the case when m = 1 in (13.4)). 4 13.2.1 Relative degree and I/O feedback linearization Assume that functions h,f,g in (13.4),(13.5) are at least q + 1 times continuously difier- entiable. We say that system (13.4),(13.5) has relative degree q on X 0 if ∈h 1 (? x) = 0,...,∈h q?1 (? x) = 0, ∈h q (? x) → x ? X 0 ,x)g(? x)g(? x)g(? = 0 ? ? where h i : X 0 ≤? R are deflned by h 1 = h, h i+1 = (∈h i )f (i = 1,...,q). By applying the deflnition to the LTI case f(x) = Ax, g(x) = B, h(x) = Cx one can see that an LTI system with a non-zero transfer function always has a relative degree, which equals the difierence between the degrees of numerator and denominator of its transfer function. It turns out that systems with well deflned relative degree are exactly those for which input/output feedback linearization is possible. Theorem 13.1 Assuming that h,f,g are continuously difierentiable n + 1 times, the following conditions are equivalent: (a) system (13.4),(13.5) has relative degree q; (b) system (13.4),(13.5) is input/output feedback linearizable. Moreover if conditions (a) is satisfled then x) with k = 1,...,q are linearly independent for every x ? X 0 (i) the gradients ∈h k (? ? (which, in particular, implies that q ? n); (ii) vectors g k (?x) deflned by g 1 = g, g k+1 = [f,g k ] (k = 1,...,q ? 1) satisfy ∈h i (? x) = ∈h i+j?1 (? x) ? ?x)g k (? x)g(? x ? X 0 for i+ j ? q + 1; (iii) feedback linearization is possible with k = q, x) = ∈h q (? x), fl(? x)f(?fi(? x)g(? x) = ∈h q (? x), ? ? h 1 (?x) ?? h 2 (?x) ? ? z? l = ? l (?x) = ? . ? . . ? . ? h q (?x) 5 Note that, unlike the Frobenius theorem, Theorem 13.1 is not local: it provides feed- back linearization on every open set X 0 on which the relative degree is well deflned. Also, in the case of linear models, where f(x) = Ax and g(x) = B, it is always possible to get the zero dynamics depending on y only, i.e. to ensure that a 0 (z l ,z 0 ) = ?a 0 (Cz l ,z 0 ). This, however, is not always possible in the nonlinear case. For example, for system ? ? ? ? x 1 x 2 d ? x 2 ? = ? u ? , y = x 1 dt 2 x 3 x 1 + x 2 there exists no function p : X 0 ≤? R deflned on a non-empty open subset of R 3 such that ∈p(x)f(x) = b(x 1 ,p(x)), ∈p(x)g(x) = 0, ∈p(x) →= 0 ? x ? X 0 . Indeed, otherwise the system with new output y new = p(x) would have relative degree 3, which by Theorem 13.1 implies that (∈p)g 1 = (∈p)g 2 = 0, and hence by the Frobenius theorem the vector flelds ? ? ? ? 0 1 g 1 (x) = ? 1 ? , g 2 (x) = ? 0 ? 0 2x 2 would deflne an involutive distribution, which they do not. 13.2.2 Involutivity and full state feedback linearization It follows from Theorem 13.1 that system (13.4), (13.5) which has maximal possible relative degree n is full state feedback linearizable. The theorem also states that, given smooth functions f,g, existence of h deflning a system with relative degree n implies linear x),..., n (? ?independence of vectors g 1 (? x) for all x ? X 0 , and involutivity of the regular distribution deflned by vector flelds g 1 ,...,g n?1 . The converse is also true, which allows one to state the following theorem. Theorem 13.2 Let f : X 0 ≤? R n and g : X 0 ≤? R n be n + 1 times continuously difierentiable functions deflned on an open subset X 0 of R n . Let g k with k = 1,...,n be deflned as in Theorem 13.1. x),..., n (?(a) If system (13.4) is full state feedback linearizable on X 0 then vectors g 1 (? x) ?form a basis in R n for all x ? X 0 , and the distribution deflned by vector flelds g 1 ,...,g n?1 in involutive on X 0 . (b) If for some ? x),..., n (?x 0 ? X 0 vectors g 1 (? x) form a basis in R n , and the distribution deflned by vector flelds g 1 ,...,g n?1 in involutive in a neigborhood of ?x 0 , there exists ? ? an open subset X 0 of X 0 such that ?x 0 ? X 0 and system (13.4) is full state feedback ? linearizable on X 0 .