LPV Techniques For Control of An Inverted Pendulum
LPV Techniques For Control of An Inverted Pendulum
LPV Techniques For Control of An Inverted Pendulum
Authorized licensed use limited to: UNIVERSITY PUTRA MALAYSIA. Downloaded on May 27,2024 at 10:00:14 UTC from IEEE Xplore. Restrictions apply.
Note that the second of these constraints is especially difficult where
to handle, as it concerns the internal properties of the controller
and cannot be directly treated through the properties of the
closed-loop system. We shall see, however, that a suitable multi-
objective extension of the polytopic technique provides an effec-
tive means to overcome this difficulty. Another way to handle and
this implementation constraint would have been to tune the
weighting functions until adequate closed-loop plant and open-
loop controller specifications are met. However, this procedure
revealed itself to be intractable in this application and in most in-
stances led to high order controllers, as it requires complicated
weighting functions.
Another important issue is to evaluate the benefits of LPV
synthesis techniques in regard to classical robust control tech-
niques such as H _ andp syntheses. It turns out in this application 4
M,:= -m& , R = 2m2!,!,
that though the H _ and p controllers are capable of providing 3
some stability guarantees they are, as expected, outperformed by
LPV controllers at the performance level.
This article is organized as follows. The next section de-
scribes the modeling of the ADIP and introduces the problem
specifications. A brief review of the LPV synthesis techniques
used for the ADIP is given in the following section. The full
design procedure up to the nonlinear simulations and real ex-
periment results are then presented, followed by some con-
cluding remarks.
All LMI-related computations in the application were per-
formed using the LMI Control Toolbox [23], p controllers were
designed using the p-Analysis and Synthesis Toolbox [22], the
nonlinear simulations were obtained using MATLABI~IMULINK In this applicatiion,the first joint is actuated and the second joint
facilities and LPV controllers were implemented using the REAL- is free. From (2)-(9), we can derive the following equations.
TIMEWORKSHOP.
(MI -M,)cp, + Rcos(cp, -cp,)cp,
LPV Modeling of the ADIP +Rsin(cp, -cp,)$ + ( m , + 2m,)!,gsin(cp,)
In this section, the LPV synthesis model for the ADIP is de-
veloped and the design specification? are introduced. Consider -m,t,gsin(cp,)=%, , (10)
the two-link arm depicted in Fig. 2. It is well known [l], [ 2 ] that
the motion equation is described as
M ( q ) q + C ( q ,q ) + G ( q )= 7, (2)
The main control objective is to maintain the second arm in a ver-
tical position like an inverted pendulum using the rotation of the
(first) actuated arm. In the following, the first arm and the second
arm are called arm and pendulum, respectively. In the physical
experiment corresponding to Fig. 2 the arm is actuated by a mo-
tor driven by a velocity-control power amplifier. The physical
quantities are given as follows:
,
! = 0.13 m, !, = 0.15 m, m, = 0.05 kg,
m2 = 0.03 kg, g = 9.8 m l s 2 . (12)
As the velocity ip, of the first arm can follow the command in-
put voltage U to the amplifier because of the lightness of the sec-
ond arm, we cain assume that the dynamics from the input voltage
to the velocity ip, is almost equally given by
d .
-cp
1 .
=--cp,+-LL
ICu
dt I T , c
This means thait (10) can be simplified to (13) with reasonable ac-
curacy.
Fig. 2. For modeling of A D P On the other hand, (1 1) becomes
February 1999 45
Authorized licensed use limited to: UNIVERSITY PUTRA MALAYSIA. Downloaded on May 27,2024 at 10:00:14 UTC from IEEE Xplore. Restrictions apply.
4 we get
2& cos('pl - cpz I@,+ - 1z@, = g sin(9,) + 21 sin(cp, - cp,)$.
3
(14)
Then, defining
Gathering the equation = r (p with (13) and (19) the fol-
rx :=2t,sin(cp,), ry:=2&,cos(cp,), (15) y.1
lowing simple LPV model is obtained
and using (14), another description for the ADIP is as follows:
0 1 0 0
0 0 0 0
0 0 0 0
The pendulum has two kinds of equilibrium states: 0 0 0 - +
unstable equilibrium state: cp; = 0 (tip pointing upwards)
stable equilibrium state: cpi = n (tip pointing downwards).
P1
In this application, we will only consider the difficult situation
where the tip is pointing upwards. An immediate linearization of
(1 6) around 'pi = 0 then leads to
4
i--+ -1, cp, = ( g + I;)(P,.
3
This yields upper (<) and lower (7,) for the range of variation
-
of ry:
-
<I := [2e I cos(@,
ry E [ry, COS(O)].
(22)
From (22), ry is normalized as
I - ,
Authorized licensed use limited to: UNIVERSITY PUTRA MALAYSIA. Downloaded on May 27,2024 at 10:00:14 UTC from IEEE Xplore. Restrictions apply.
where 0, denotes the new normalized scheduling variables, the difficulty of this problem comes from the fact that the ADIP be-
following LPV model of LFT type is obtained: comes uncontrollable as the pendulum gets closer to the horizontal
position, hence the need for a gain-scheduled controller. Apart
0 1 0 0 - ,
from the maximization of the range where stabilization holds, we
must also proviide performance in terms of settling-timeand over-
shoot in response to reference signals. This will be detailed in a
later section. Also the controller should exhibit adequate roll-off
in the high-frequency range for noise attenuation.
with
,
and it is easily verified that p and p2are polytopic coordinates,
thatis,p, >Oandp, >Oandp, +p2 =l.
February 1999 47
Authorized licensed use limited to: UNIVERSITY PUTRA MALAYSIA. Downloaded on May 27,2024 at 10:00:14 UTC from IEEE Xplore. Restrictions apply.
x = Ax + B,w, + B,w + B,u,
z, = C,x + Doewe+ D,,w + D,,u,
z = C , x + D,,w, + D,,w + D,,u,
y = C,x + D,,w, + D,,w + D Z 2 u ,
w , = o ( t ) z, ,
(31)
where @(t)is a time-varying parameter matrix
and is usually assumed to have a block-diagonal
structure in the form
o ( t ) = diag(...,@ , ( t ,...,
) e,(t)I ,... ), (32)
and is normalized such that
o ( t ) ' o ( t ) 5 I , t 2 0. (33)
Blocks denoted 0, and 8, I are generally re-
ferred to as full and repeated-scalar blocks ac-
cording to the p analysis and synthesis literature
[ 3 ] ,[4]. Note that straightforward computations
[;]=I[;
lead to the state-space representation
A B, ;;j+[;;jw
$1 -DeeO(t))-l[CeDe1 0821
:1.1
hence the plant with inputs w and u and outputs z
,
(34)
x = A ( p ( t ) ) x+ B , ( p ( t ) ) w+ B*(p(t))u I
{ p pv,1, t 2 0 ,
p ( t ) ~ P ~ : = c o,,,,..., (36)
where the notation CO{ .} stands for the convex
hull of the set { .}.
Clearly, the state-space data of the plant (35)
range over a matrix polytope and thus it trivially
Fig. 7. H , Control results (y = 2.5114, (plc = 45") holds that
Authorized licensed use limited to: UNIVERSITY PUTRA MALAYSIA. Downloaded on May 27,2024 at 10:00:14 UTC from IEEE Xplore. Restrictions apply.
i 1
B,(P(t)) B,(p(t))
A(P(t))
Ci(P(t)) Dii(P(t)) Dl,(p(t)) E
:=col[i: :::i], Bl I
D,,,
DZI, Q2i
i =1,2,...,r } ,
(37)
where
i‘~~q
czi
t;:;
D2I1 D22,
-4l0 5 10 15 20
I
:=
1
W , : ) B,(P,,,)B,(P,>,)
CAP”, 1 D,,(P”,1 Dl,(P”, 1
C,(PY, 1 D*,(P”,1 D,,(P”, 1
i =1, ...,r .
3
(38)
Time [sec]
February 1999 49
Authorized licensed use limited to: UNIVERSITY PUTRA MALAYSIA. Downloaded on May 27,2024 at 10:00:14 UTC from IEEE Xplore. Restrictions apply.
tailed description of the LMI characterizations involved. LMI re-
gions are self-conjugate regions of the complex plane which are
useful to specify transient constraints for the closed-loop dynam- where CO, is a design parameter. The family of test or reference
signals r for the arm rotation in the range (21) which are used
ics. Interesting examples of such regions include vertical and
hereafter can be described as follows.
horizontal strips, circles, conic sectors, etc., and their intersec-
tions thereof which again are LMI regions (see Fig. 6 for an illus-
tration). Moreover, any self-conjugate region of the complex
plane can be approximated to any desired accuracy with an LMI
region.
Another practical advantage of LMI region constraints is that
they can be exploited to indirectly enforce constraints on the con-
troller dynamics and thus on the controller bandwidth. This is of
primary importance in any realistic design. We stress here that ,
where q,,c,x:= 21 sin(cp,,), z = 0 5 and cp,,(5 Cp,) is a target value.
such an approach is only indirect and might require preliminary A simple drawing of such signals is depicted in Fig. 5. Then the
adjustments and trials to translate original constraints on the con- specifications for the control system are the following:
troller into constraints on the closed-loop dynamics. These ad-
justments are useful to avoid unnecessary restrictions on the (Sl) the closed-loop system is internally stable.
design problem. From a theoretical viewpoint, achieving con-
troller bandwidth specifications remains a delicate and unsolved (S2) the L,-induced gain of the operator connecting w = Y to
question, except for some classes of fixed-structure controllers
such as PID and the like. In the context of this paper, it is enough
to have in mind that LMI region constraints provide a sensible
though not direct approach to this problem. A more abstract jus-
z= [ is bounded by y, where CO,
is a design parameter.
Authorized licensed use limited to: UNIVERSITY PUTRA MALAYSIA. Downloaded on May 27,2024 at 10:00:14 UTC from IEEE Xplore. Restrictions apply.
range cpI E [-(PI, (PI]. Specification (S2) translates
performance tracking and high-frequency gain at-
tenuation objectives. The specification (S3) ex-
press that stabilization but also performance and
roll-off requirements must be achieved on the larg-
est range of possible dynamics of the ADIP. The
specification (S4)is directly dictated from physi-
cal hardware limitations.
0 5 10 15 20 0 5 10 15 20
nal U , and the (2,2)-subplot shows 0,. We first ob- Time [sec] Time [sec]
serve the consistency of the responses of the
simulation and the experiment, which to some ex-
tent validates the simplified model we have used
for synthesis. One can also see that the control sys- Fig. 10. LFT Control results (y = 0.4733, cp,, = 60").
February 1999 51
Authorized licensed use limited to: UNIVERSITY PUTRA MALAYSIA. Downloaded on May 27,2024 at 10:00:14 UTC from IEEE Xplore. Restrictions apply.
tem has very poor performance using this controller even on the Design of an LFT Controller
somewhat reduced target cp,, = 45". An LPV control system with LFT structure is depicted in Fig. 9.
An LFT-LPV controller
p Controller Based on an Uncertain Model
We decided to improvethese preliminary results usingp synthesis
by explicitly taking account of the changing dynamics of the ADP,
where <PI = 65". We have used first-orderp scaling and the parame-
ters were tuned to o, = 0.02,03, = 0.8 and areg = 0.1, rYe8= 50,
Oreg = 45". With the H _ synthesis steps enforcing closed-loop pole
constraints, we obtained some controllers with poles is designed using the LFT description of the ADIP, where
-
(-61.497 f 25.596j , - 47.080, - 33.753, 'pl = 65 '. The parameters w , and o,were set to 0.02 and 05, re-
spectively. The underlying LTI dynamics of the LFT controller
- 0.8993, - 0.3723, - 0.2206,0},
are easily obtained by instantiating the LFT controller at some
which are clearly satisfactory with respect to the implementation frozen values of 8,. For the extreme values of 8, the following
constraints. In our example, usual p synthesis algorithms would dynamics are obtained.
lead to unacceptable controller dynamics. As before, the nonlin- e, = 1:{-365.56, - 359.74,- 291.81,
ear simulations and the hardware experiment are shown in Fig. 8. - 24.064,-1.3786,O) ,
As expected, this second controller provides better perfor-
mance on the target (p,, = 45". However, the very same controller e, = -1:{-365.56,- 359.72,- 291.84,
but with the target 9,' increased to 60" does not even provide sta- -42.888, -1.4367,O).
bility and we did not find any simple way to guarantee the same
level of performance on this larger range. Such dynamics are again satisfactory in regard to implementa-
Summing up our results, the H , controller is able to stabilize tion constraints and have been derived by minimizing, through
the range 45"but provides very poor performance. The p control- an LMI formulation, the norm of the A matrix of the LPV con-
ler provides adequate performance on the range 45" but no longer troller in the construction procedure. The nonlinear simulations
works for ranges up to 60", illustratinga fundamental tradeoff be- and the hardware experiments are shown in Fig. 11.As expected,
tween performance and the size of the operating range. In many both performance and the the size of the operating range have
applications this tradeoff is very limiting and can only be negoti- been enhanced as compared to previous LTI controllers.
ated outside the set of fixed controller structures by exploiting
gain-scheduling strategies.This is considered in the next section. Design of a Polytopic Controller
An LPV control system with polytopic structure is depicted in
LPV Syntheses Fig. 11. A similar design is now conducted using the polytopic
In this section, the synthesis techniques discussed in previ- description of the ADIP, where (p, = 65 ", leading to polytopic
ously are exploited to improve both performance and the operat- LPV controllers
ing range of the ADIP.
-35.159, -2.1349,O).
Authorized licensed use limited to: UNIVERSITY PUTRA MALAYSIA. Downloaded on May 27,2024 at 10:00:14 UTC from IEEE Xplore. Restrictions apply.
Simulation
Concluding Remarks
We can summarize the results of the four kinds
of synthesis methods as shown in Table 1.
This shows that each synthesis method can the-
oretically stabilize the operating range [-?&,‘pI]
but experimentally achieves only [-(p,,, cp,,]. As
for they-synthesis, we observe some gap between
the two range, that is, Cp, =65” and cp,, =45”.
This comes from the model approximation used
in the syntheses. In Table 1, T s indicates the set-
tling times in the experiments.
A quick view of the merits and drawbacks of
each technique is presented in Table 2.
0 5 10 15 20 Finally, we have presented a comprehensiveap-
Time [sec] Time [sec]
plication of LPV control techniques to the control of
18,1<1 an m-driven inverted pendulum. The particular
1.5
I interest of this application lies in the fact that all in-
gredients of the design problem have to be taken
into account, from the specificationsup to the con-
straints inherent to real-world implementations. In
this context, it has been shown that currently avail-
able synthesis methodologies, such as p and LPV
techniques, may fail to provide acceptable answers.
A major obstacle is undoubtly the implementation
-2 ,-.......~~.-........~....,....~...,,.,.,. constraint that puts hard limitations on the control-
-3
0 5 10 15 20
-1.5
0 5 10 15
I
20
ler dynamics. These limitations are generally diffi-
cult to handle within the usual formulation of LPV
Time [sec] Time [sec] control techniques.
It has been shown that a suitable extension of
these techniques including LMI region con-
straints on the closed-loop dynamics can over-
come this difficulty. When implementable, it has
been observed that LPV controllers outperform
1OOr I 1 1 fixed p controllers both in robustness and perfor-
mance. These observations were confirmed by
simulations but more importantly by a number of
records on the physical experiment.
References
[l] M.W. Spong and M. Vidyasagar, Robot Dynamics and
Control,Wiley, 1989.
1001 1 -4 1
...
f [2]C. Canudas de Wit, B. Siciliano, and G. Bastin (eds.),
0 5 10 15 20 0 5 10 15 20 The Theory of Robot Control, Springer, 1996.
Time [sec] Time [sec]
[3] M.K.H. Fan, A.L. Tits, and J.C. Doyle, “Robustness in
18,1<1
1.5t I the Presence of Mixed Parametric Uncertainty and Un-
modelized Dynamics,” IEEE Trans. Automat. Contl:, vol.
1 AC-36, no. 1, pp. 25-38, 1991.
February 2999 53
Authorized licensed use limited to: UNIVERSITY PUTRA MALAYSIA. Downloaded on May 27,2024 at 10:00:14 UTC from IEEE Xplore. Restrictions apply.
[ 191M. Chilali and P. Gahinet, “ H _ Design with Pole Placement Constraints:
Table 1. Performance indexes for LTI and LPV methods
an LMI Approach,” IEEE Trans. Autom. Contl:, vol. 41, 3, pp. 358-367,
Synthesis 1996.
[20] G. Garcia, D. Arzelier, J. Daafonz, and J. Bernussou, “An [LMI} For-
2.57
mulation for Robust Disk Pole Assignment via Output Feedback,” IFAC
0.79 45” 2.5s World Congress, San Fransisco, 1996
LFT-LPV 0.47 65O 60” 2s [21] T. Iwasaki and R.E. Skelton, “All Controllers for the General H _ Control
Problem: LMI Existence Conditions and State Space Formulas,” Auto-
Poly-LPV 0.42 65 O 60” 2s matica, vol. 30, no. 8, pp. 1307-1317, 1994.
[22] G.J. Balas, J.C. Doyle, K. Glover, A. Packard, and R. Smith, ‘b-Analy-
sis and Synthesis Toolbox,” The Mathworks, 1993.
1231 P. Gahinet, A. Nemirovski, A.J. Laub, and M. Chilali, “LMI Control
1 Table 2. Merits and drawbacks in LTI and LPV methods 1 Toolbox,” The Mathworks, 1995.
I less performance
needs pole constraint
niques,’’ http://momiji98.ces.kyutech.ac.jp/publication.html,
1998.
Authorized licensed use limited to: UNIVERSITY PUTRA MALAYSIA. Downloaded on May 27,2024 at 10:00:14 UTC from IEEE Xplore. Restrictions apply.