Massachusetts Institute of Technology
Department of Electrical Engineering and Computer Science
6.243j (Fall 2003): DYNAMICS OF NONLINEAR SYSTEMS
by A. Megretski
Lecture 10: Singular Perturbations and Averaging
1
This lecture presents results which describe local behavior of parameter-dependent ODE
models in cases when dependence on a parameter is not continuous in the usual sense.
10.1 Singularly perturbed ODE
In this section we consider parameter-dependent systems of equations
‰
x˙(t) = f(x(t),y(t),t),
(10.1)
?y˙ = g(x(t),y(t),t),
where ? → [0,?
0
] is a small positive parameter. When ? > 0, (10.1) is an ODE model.
For ? = 0, (10.1) is a combination of algebraic and difierential equations. Models such
as (10.1), where y represents a set of less relevant, fast changing parameters, are fre-
quently studied in physics and mechanics. One can say that singular perturbations is the
“classical” approach to dealing with uncertainty, complexity, and nonlinearity.
10.1.1 The Tikhonov’s Theorem
A typical question asked about the singularly perturbed system (10.1) is whether its
solutions with ? > 0 converge to the solutions of (10.1) with ? = 0 as ? ? 0. A su–-
cient condition for such convergence is that the Jacobian of g with respect to its second
argument should be a Hurwitz matrix in the region of interest.
Theorem 10.1 Let x
0
: [t
0
,t
1
] ∞? R
n
, y
0
: [t
0
,t
1
] ∞? R
m
be continuous functions
satisfying equations
x˙
0
(t) = f(x
0
(t),y
0
(t),t), 0 = g(x
0
(t),y
0
(t),t),
1
Version of October 15, 2003
2
where f : R
n
£ R
m
£ R ∞? R
n
and g : R
n
£ R
m
£ R ∞? R
m
are continuous functions.
Assume that f,g are continuously difierentiable with respect to their flrst two arguments
in a neigborhood of the trajectory x
0
(t),y
0
(t), and that the derivative
A(t) = g
2
?
(x
0
(t),y
0
(t),t)
is a Hurwitz matrix for all t → [t
0
,t
1
]. Then for every t
2
→ (t
0
,t
1
) there exists d > 0 and
C > 0 such that inequalities |x
0
(t) ? x(t)|? C? for all t → [t
0
,t
1
] and |y
0
(t) ? y(t)|? C?
for all t → [t
2
,t
1
] for all solutions of (10.1) with |x(t
0
) ? x
0
(t
0
)|? ?, |y(t
0
) ? y
0
(t
0
)|? d,
and ? → (0,d).
The theorem was originally proven by A. Tikhonov in 1930-s. It expresses a simple
principle, which suggests that, for small ? > 0, x = x(t) can be considered a constant
when predicting the behavior of y. From this viewpoint, for a given t
?
→ (t
0
,t
1
), one can
expect that
y(t
?
+ ??) … y
1
(?),
where y
1
: [0,?) is the solution of the “fast motion” ODE
y˙
1
(?) = g(x
0
(t
?
),y
1
(?)), y
1
(0) = y(t
?
).
Since y
0
(t
?
) is an equilibrium of the ODE, and the standard linearization around this
equilibrium yields
–
˙
(?) … A(t
?
)–(?)
where –(?) = y
1
(?) ? y
0
(t
?
), one can expect that y
1
(?) ? y
0
(t
?
) exponentially as ? ? ?
whenever A(t
?
) is a Hurwitz matrix and |y(t
?
) ? y
0
(t
?
)| is small enough. Hence, when ? > 0
is small enough, one can expect that y(t) … y
0
(t).
10.1.2 Proof of Theorem 10.1
First, let us show that the interval [t
0
,t
1
] can be subdivided into subintervals ¢
k
=
[?
k?1
,?
k
], where k → {1,2,...,N} and t
0
= ?
0
< ?
1
< ¢¢¢ < ?
N
= t
1
in such a way that
for every k there exists a symmetric matrix P
k
= P
k
?
> 0 for which
P
k
A(t) + A(t)
?
P
k
< ?I ? t → [?
k?1
,?
k
].
Indeed, since A(t) is a Hurwitz matrix for every t → [t
0
,t
1
], there exists P(t) = P(t)
?
> 0
such that
P(t)A(t) + A(t)
?
P(t) < ?I.
Since A depends continuously on t, there exists an open interval ¢(t) such that t → ¢(t)
and
P(t)A(?) + A(?)
?
P(t) < ?I ? ? → ¢(t).
3
Now the open intervals ¢(t) with t → [t
0
,t
1
] cover the whole closed bounded interval
[t
0
,t
1
], and taking a flnite number of t
?
k
, k = 1,...,m such that [t
0
,t
1
] is completely
covered by ¢(t
?
k
) yields the desired partition subdivision of [t
0
,t
1
].
Second, note that, due to the continuous difierentiability of f,g, for every ? > 0 there
exist C,r > 0 such that
? ? ? ?
|f(x
0
(t) + –
x
,y
0
(t) + –
y
,t) ?f(x
0
(t),y
0
(t),t)|? C(|–
x
|+ |–
y
|)
and
? ? ? ? ?
|g(x
0
(t) + –
x
,y
0
(t) + –
y
,t) ?A(t)–
y
|? C|–
x
|+ ?|–
y
|
?
for all t → R, –
?
x
→ R
n
, –
y
→ R
m
satisfying
? ?
t → [t
0
,t
1
], |–
x
?x
0
(t)|? r, |–
y
?y
0
(t)|? r.
For t → ¢
k
let
|–
y
|
k
= (–
y
?
P
k
–
y
)
1/2
.
Then, for
–
x
(t) = x(t) ?x
0
(t), –
y
(t) = y(t) ?y
0
(t),
we have
|–
˙
x
|? C
1
(|–
x
|+ |–
y
|
k
),
?|–
˙
y
|
k
??q|–
y
|
k
+ C
1
|–
x
|+ ?C
1
(10.2)
as long as –
x
,–
y
are su–ciently small, where C
1
,q are positive constants which do not
depend on k. Combining these two derivative bounds yields
d
(|–
x
|+ (?C
1
/q)|–
y
|) ? C
2
|–
x
|+ ?C
2
dt
for some constant C
2
independent of k. Hence
|–
x
(?
k?1
+ ?)|? e
C
3
?
(|–
x
(?
k?1
)|+ (?C
1
/q)|–
y
(?
k?1
)|) + C
3
?
for ? → [0,?
k
??
k?1
]. With the aid of this bound for the growth of |–
x
|, inequality (10.2)
yields a bound for |–
y
|
k
:
|–
y
(?
k?1
+ ?)|? exp(?q?/?)|–
y
(?
k?1
)|+ C
4
(|–
x
(?
k?1
)|+ (?C
1
/q)|–
y
(?
k?1
)|) + C
4
?,
which in turn yields the result of Theorem 10.1.
4
10.2 Averaging
Another case of “potentially discontinuous” dependence on parameters is covered by the
following “averaging” result.
Theorem 10.2 Let f : R
n
£ R £ R ∞? R
n
be a continuous function which is ?-periodic
with respect to its second argument t, and continuously difierentiable with respect to its
flrst argument. Let ?x
0
→ R
n
be such that f(?x
0
, t, ?) = 0 for all t, ?. For ?x → R
n
deflne
?
?
?
f(?x,?) = f(?x, t, ?).
0
?
If df/dx|
x=0,?=0
is a Hurwitz matrix, then, for su–ciently small ? > 0, the equilibrium
x · 0 of the system
x˙(t) = ?f(x, t, ?) (10.3)
is exponentially stable.
Though the parameter dependence in Theorem 10.2 is continuous, the question asked
is about the behavior at t = ?, which makes system behavior for ? = 0 not a valid
indicator of what will occur for ? > 0 being su–ciently small. (Indeed, for ? = 0 the
equilibrium ?x
0
is not asymptotically stable.)
To prove Theorem 10.2, consider the function S : R
n
£ R ∞? R
n
which maps x(0), ?
to x(?) = S(x(0), ?), where x(¢) is a solution of (10.3). It is su–cient to show that the
x,?) of S
˙
with respect to its flrst argument, evaluated at ? x
0
derivative (Jacobian) S
˙
(? x = ?
and ? > 0 su–ciently small, is a Schur matrix. Note flrst that, according to the rules on
difierentiating with respect to initial conditions, S
˙
(?x
0
, ?) = ¢(?,?), where
d¢(t, ?) df
= ? (0,t, ?)¢(t, ?), ¢(0,?) = I.
dt dx
?
Consider D(t, ?) deflned by
?
d¢(t, ?) df
? ?
= ? (0,t,0)¢(t,?), ¢(0, ?) = I.
dt dx
?
Let –(t) be the derivative of ¢(t,?) with respect to ? at ? = 0. According to the rule for
difierentiating solutions of ODE with respect to parameters,
?
t
df
–(t) = (0, t
1
,0)dt
1
.
dx
0
Hence
?
–(?) = df/dx|
x=0,?=0
5
is by assumption a Hurwitz matrix. On the other hand,
?
¢(?,?) ? ¢(?,?) = o(?).
Combining this with
?
¢(?,?) = I + –(?)? + o(?)
yields
¢(?,?) = I + –(?)? + o(?).
Since –(?) is a Hurwitz matrix, this implies that all eigenvalues of ¢(?,?) have absolute
value strictly less than one for all su–ciently small ? > 0.