Research Article  Open Access
Iraj Hassanzadeh, Saleh Mobayen, "Controller Design for Rotary Inverted Pendulum System Using Evolutionary Algorithms", Mathematical Problems in Engineering, vol. 2011, Article ID 572424, 17 pages, 2011. https://doi.org/10.1155/2011/572424
Controller Design for Rotary Inverted Pendulum System Using Evolutionary Algorithms
Abstract
This paper presents evolutionary approaches for designing rotational inverted pendulum (RIP) controller including genetic algorithms (GA), particle swarm optimization (PSO), and ant colony optimization (ACO) methods. The goal is to balance the pendulum in the inverted position. Simulation and experimental results demonstrate the robustness and effectiveness of the proposed controllers with regard to parameter variations, noise effects, and load disturbances. The proposed methods can be considered as promising ways for control of various similar nonlinear systems.
1. Introduction
During the past decades, many modern control methodologies such as nonlinear control, optimal control, adaptive control, and variable structure control have been widely proposed for control approaches [1–3]. However, these methods are theoretically complex and difficult to implement. Also, proportionalintegralderivative (PID) controller is a wellknown method for industrial control processes. The later approach has been broadly employed in industries because of its simple structure and robust performance in a wide range of operating conditions [3]. Unfortunately, it has been difficult to tune up PID controller gains accurately because many industrial plants are often very complex consisting of issues such as higher order, time delays, and nonlinearities [4, 5]. Ziegler and Nichols proposed the first method utilizing the classical tuning rules. Though, it is hard to determine optimal PID controller parameters with ZieglerNichols formula in general [1, 3]. To overcome these difficulties, various methods have been proposed. The ability of using numerical methods for efficiently and accurately characterizing the quality of a particular design has excited control engineers to apply stochastic global optimizers.
Over the past years, several evolutionary algorithms have been proposed to search for optimal PID controllers. Among them, GA has received great attention and PSO has been successfully applied to various fields [6–10]. Also, ACO is a relatively recent approach to solve optimization problems by simulating the behavior of ant colonies and modeling the behavior of ants, which are known to be able to find the shortest path from their nest to a food source [11]. The ACO method proposed in our paper has the following advantages: applicability to any kind of optimization problems, combinatorial or continuous, easy implementation, high rate of successful optimizations, and low run time.
In this paper, we compare the efficiency of three intelligent algorithms, that is, GA, PSO, and ACO methods. These evolutionary algorithms are used to adjust the PID controller parameters in order to ensure adequate servo and regulatory behavior of the closedloop system. Also, we formulate the problem of designing PID controllers as an optimization dilemma which adjusts five performance indexes, that is, maximum overshoot, rise time, settling time, and steadystate error of the response and system control energy.
2. Rotational Inverted Pendulum System
The rotational inverted pendulum system is a wellknown test platform for evaluating various control algorithms. Also, it has some significant reallife applications such as position control, aerospace vehicles control, and robotics [12, 13]. The system consists of a rotational arm and a pendulum where the rotational arm is actuated by a motor with the objective of balancing the pendulum in the inverted position. A schematic diagram of the RIP system is represented in Figure 1, where u, l_{p}, m_{p}, α, r, θ, and J_{b} are the motor input, the pendulum length, the pendulum mass, the pendulum angle, the arm length, the arm angle and effective mass moment of inertia, respectively.
The plane of the pendulum is orthogonal to the radial arm. Figure 2 indicates the RIP system built in robotics research lab in our department. Also, the block diagram of the whole system is shown in Figure 3.
In this section, the dynamic equations of the RIP system considering backlash and friction effects are presented. The RIP dynamics are governed by [12, 14]
The above nonlinear model can be found in the following equations:
The parameters of nonlinear model of the system are represented in Table 1.

Using (2.1)–(2.3), the RIP system is easily simulated using Simulink and Matlab. The Simulink block diagram of RIP system shown in Figures 4 and 5 illustrates the step response without controllers indicating the whole system is unstable. The controller parameters generated by heuristic algorithms are employed iteratively in relevant simulation blocks, and the cost function is calculated in the manner presented in the next section.
(a)
(b)
(c)
3. GA, PSO, and ACO: An Overview
In the following, brief reviews of GA, PSO and ACO principle are illustrated.
3.1. GA
Considering Darwin's original ideas, life in all its diverse forms is evolved by natural selection and adaptation processes controlled by the survivability of the fittest species. GA is an evolutionary optimizer that takes a sample of possible individuals and employs selection, crossover, and mutation as the primary operators for optimization [15, 16].
3.2. PSO
Considering the social behavior of swarm of fish, bees, and other animals, the concept of the particle swarm optimization (PSO) is developed. The PSO is a robust stochastic evolutionary computation method based on the movement of swarms looking for the most fertile feeding location [16].
From the above statements, it is obvious that the theoretical bases of the two optimization methods rest upon two completely different structures. The GA is based on genetic encoding and natural selection, and the PSO method is based on social swarm behavior. PSO is based on the principle that all solutions can be represented as particles in a swarm. Each particle has a position and velocity vector, and each position coordinate represents a parameter value. Similar to GA, PSO requires a fitness evaluation function that takes the particle’s position and assigns a fitness value to it. and are the personal best () position and global best () position of the ith particle. Each particle is initialized with a random position and velocity. The velocity of each particle is accelerated toward the global best and its own personal best based on the following equation:
Here rand() and Rand() are two random numbers in the range [], and are the acceleration constants, and is the inertia weight factor. The parameter helps the particles converge to , rather than oscillating around it. Suitable selection of provides a balance between global and local explorations. In general, w is set according to the following equation [16, 17]:
The positions are updated based on their movement over a discrete time interval () as follows, with usually set to 1:
Then the fitness at each position is reevaluated. If any fitness is greater than , then the new position becomes and the particles are accelerated toward that point. If the particle’s fitness value is greater than , then is replaced by the current position.
3.3. ACO
ACO is a relatively recent approach to solve optimization problems by simulating the behavior of ant colonies and modeling the behavior of ants, which are known to be able to find the shortest path from their nest to a food source [18]. ACO is an optimization technique that has been recently developed and recognized as effective for combinatorial optimization problems.
All ants start their tours from source node and end up their tours in destination node. In each node, an ant chooses its path probabilistically, and the probability of choosing an edge is proportional to the pheromone on the edge, that is, roulette wheel selection.
All edges have an initial amount of pheromone, . After completion of all tours, first pheromone values on all edges are lowered, reflecting evaporation: where is the pheromone evaporation rate. Then all ants deposit pheromone on all edges they have crossed in their tours: where is the tour of the th ant, is its cost, and is a positive constant which allows to adjust maximum pheromone deposit.
The algorithms parameters have been chosen based on trial and error as follows. For GA method, population size = 50, crossover rate = 0.5, mutation rate = 0.01, and maximum generations = 20; for PSO algorithm, number of particles = 50; acceleration constants , and maximum iteration = 20; for ACO algorithm, , , and . Each algorithm is implemented in Matlab. All of the programs are run on a 2.1 GHz Core 2 Duo processor with 2 GB of memory. Each of the optimization methods is tested in 50 independent runs involving 50 different initial trial solutions.
4. Problem Formulation and Controller Design
Considered performance index includes the overshoot , rise time , settling time , steadystate error , and control energy . We find the appropriate parameters for the controllers minimizing the performance indexes. The proposed cost function is considered as follows [3, 19]: where β is the weighing factor. In this paper, β is set to 1 and 1.5 to investigate different possible solutions. The algorithms stages for searching proper parameters of PID controller are as follows.
First, specify the lower and upper bounds of controller parameters and initialize the particles of the population randomly. Each particle, that is, (controller parameters) is sent to Matlab Simulink. Then, the values of the performance criteria in the time domain, namely, , , , , and are calculated iteratively. After that, cost function is evaluated for each particle according to these performance criteria. If the cost for local best solution is less than the cost of the current global best solution, the global solution is replaced with local solution. At the end of each iteration, the program checks the stop criterion. If the number of iterations reaches the maximum designated by the user, the latest global best solution is recorded and the algorithm is brought to an end.
5. Simulation Results
The lower and upper bounds of the three controller parameters are shown in Table 2.

In order to examine the dynamic behaviors and convergence characteristics of the proposed methods, two statistical indexes, namely, the mean value () and the standard deviation () of cost values of all individuals during the computation processes, are used. The mean value displays the accuracy of the algorithm, and the standard deviation measures the convergence speed of the algorithm. The formulas for calculating these values are as follows, respectively [19, 20]: where is the cost value of the individual and is the population size.
Several simulations are performed to investigate and compare controllers’ convergence characteristics. As it can be seen in simulations in Figures 6 and 7, though all controllers can obtain stable mean cost value using the same cost function and simulation conditions, the ACOPID controller has better cost value and mean value, showing that it can achieve better accuracy. Simultaneously, we can also find from Figures 8 and 9 that the convergence tendency of the standard deviation of cost values in the ACOPID controller is much faster than other ones. This can prove that the ACO method has better convergence efficiency.
Also, our simulation results demonstrate that ACO method is faster than GA and slower than PSO and the run time in 20 iterations for ACO is 2408.17 sec in comparison with 1805.42 sec for PSO and 3865.903 sec for GA.
5.1. Servo Behavior
In the following, the optimization procedure has been applied to the RIP system in reference tracking (servo behavior). For analysis of this behavior, the reference signal () for pendulum angle is first given equal to zero, and then it is determined equal to . Figures 10 and 11 show the best results of the arm and pendulum angles and control energy value for different values of β. As it can be seen, ACObased controller makes fine responses, indicating the superiority over other controllers.
(a)
(b)
(c)
(a)
(b)
(c)
The simulation results of the best solution for various values of β in 50 runs are summarized in Table 3. As it can be observed from Table 3, the overshoot is decreased using ACOPID in comparison with GAPID and PSOPID. This improvement is 25.693% () and 19.335% (β = 1.5) in comparison with PSOPID and 55.071% () and 37.936% (β = 1.5) in comparison with GAPID. Also, it is inferred that rise time of the step response in all simulations is quite similar and settling time of step response using GA is better than other controllers. According to total cost value, ACOPID controlled systems have less cost values.

Now, the servo performance is considered for the reference signal () of pendulum angle equal to . Figures 12 and 13 show the pendulum angles (servo response) for and different values of β.
5.2. Regulatory Behavior
The control systems are always subject to external disturbances and internal noise which affect the system dynamics. If the nature of the disturbance is identified, it can be modeled mathematically. However, in practice, the nature of the disturbances is not clear and we may not be able to simulate them easily. In RIP system, the disturbance and noise effects can be applied by adding an additional load to the end of pendulum and adding noise to the position sensor, respectively.
The simulations are done subject to a bandlimited white noise (noise power = 0,000523 and sampling time = 0.1 sec) and 10% parameter value changes. Simulation results shown in Figures 14 and 15 illustrate the robustness and effectiveness of the proposed controllers subject to the noise and disturbance.
(a)
(b)
(c)
(a)
(b)
(c)
Servo and regulatory results motivate to consider the proposed procedure as a suitable tool for controller parameters design and also stimulate investigating the possibility of further research on design and development of other practical control systems.
6. Experimental Results
We have performed experiments on the RIP system set at the University of Tabriz in the robotics research lab. The applied card in this project is PCI6602 which creates the connection between computer and system and has A/D and D/A converters. Also the arm and pendulum links angles are measured using two E40S Autonics company encoders. The experimental results of the proposed methods on RIP system are shown in Figures 16 and 17. In these figures, the time interval [] which belongs to the swingup period is deleted to focus on controllers' performance in the balance mode. The experimental results are very consistent with the simulation results shown in Figures 10 and 11 which not only prove the performance of proposed methods but also verify the availability of the system model. The fact that the simulated and real controlled responses are practically identical validates the identified system model.
(a)
(b)
(c)
(a)
(b)
(c)
In order to study the stability of designed control system using ACO algorithm subject to parameters variations, we perform the following experiment. In this experiment, the adding effect of a 25 g body mass with the length of 13 cm and in the presence of disturbances is validated. Figure 18 shows the results of this practical test. Then, the proposed method is a very powerful technique in completely eliminating the effects of the disturbances and providing satisfactory servo behavior.
(a)
(b)
(c)
7. Conclusion
In this paper, we present three evolutionary algorithms for designing of intelligent controllers of the RIP system. Each of the algorithms is tested in 50 independent runs involving 50 different initial solutions. The rotational inverted pendulum system is considered as a case study. Through the simulation results, the proposed controllers perform efficient search for proper PID parameters. To evaluate the controller performance, we tested the ability of the closedloop system to follow set point changes (servo behavior) and the ability of the closedloop system to reject disturbances (regulatory behavior). The work demonstrates that all methods can solve searching and tuning the controller parameters efficiently. The proposed methods could be considered as promising ways for nonlinear control systems in general. One of the important features of the system is using of xPCTarget toolbox and inputoutput card in Simulink environment which utilizes hardware in the loop (HIL), telelab implementation and fastprototyping properties. The topic of our future researches is to employ other cognitive methods in order to achieve better results for designing controller and improving the performance in real time. Also, implementation, of heuristic algorithms for designing adaptive controllers will be our future challenging task. Furthermore, teleoperation control of RIP system using haptic device would be another challenge.
Acknowledgments
This paper was financially supported by the Research Affairs at University of Tabriz. The authors would like to appreciate Mr. Abbas Harifi for his guidance and assistance.
References
 N. B. Nichols and J. G. Ziegler, “Optimum settings for automatic controllers,” ASME Transactions, vol. 64, pp. 759–768, 1942. View at: Google Scholar
 R. A. Krohling and J. P. Rey, “Design of optimal disturbance rejection PID controllers using genetic algorithms,” IEEE Transactions on Evolutionary Computation, vol. 5, no. 1, pp. 78–82, 2001. View at: Publisher Site  Google Scholar
 Z. L. Gaing, “A particle swarm optimization approach for optimum design of PID controller in AVR system,” IEEE Transactions on Energy Conversion, vol. 19, no. 2, pp. 384–391, 2004. View at: Publisher Site  Google Scholar
 A. Visioli, “Tuning of PID controllers with fuzzy logic,” IEE Proceedings: Control Theory and Applications, vol. 148, no. 1, p. 18, 2001. View at: Publisher Site  Google Scholar
 G. Zhou and J. D. Birdwell, “Fuzzy logicbased PID autotuner design using simulated annealing,” in Proceedings of the IEEE/IFAC Joint Symposium on ComputerAided Control System Design, pp. 67–72, Tucson, Ariz, USA, March 1994. View at: Google Scholar
 D. H. Kim, “Hybrid GABF based intelligent PID controller tuning for AVR system,” Applied Soft Computing Journal, vol. 11, no. 1, pp. 11–22, 2011. View at: Publisher Site  Google Scholar
 M. W. Iruthayarajan and S. Baskar, “Evolutionary algorithms based design of multivariable PID controller,” Expert Systems with Applications, vol. 36, no. 5, pp. 9159–9167, 2009. View at: Publisher Site  Google Scholar
 Y. Y. Fu, C. J. Wu, T. L. Chien, and C. N. Ko, “Integration of PSO and GA for optimum design of fuzzy PID controllers in a pendubot system,” Artificial Life and Robotics, vol. 13, no. 1, pp. 223–227, 2008. View at: Publisher Site  Google Scholar
 V. Mukherjee and S. P. Ghoshal, “Intelligent particle swarm optimized fuzzy PID controller for AVR system,” Electric Power Systems Research, vol. 77, no. 12, pp. 1689–1698, 2007. View at: Publisher Site  Google Scholar
 W. D. Chang, “PID control for chaotic synchronization using particle swarm optimization,” Chaos, Solitons and Fractals, vol. 39, no. 2, pp. 910–917, 2009. View at: Publisher Site  Google Scholar
 L. D. S. Coelho and D. L. D. A. Bernert, “A modified ant colony optimization algorithm based on differential evolution for chaotic synchronization,” Expert Systems with Applications, vol. 37, no. 6, pp. 4198–4203, 2010. View at: Publisher Site  Google Scholar
 N. Muškinja and B. Tovornik, “Swinging up and stabilization of a real inverted pendulum,” IEEE Transactions on Industrial Electronics, vol. 53, no. 2, pp. 631–639, 2006. View at: Publisher Site  Google Scholar
 H. Mei and Z. He, “Study on stability control for single link rotary inverted pendulum,” in Proceedings of the International Conference on Mechanic Automation and Control Engineering (MACE '10), pp. 6127–6130, Wuhan, China, June 2010. View at: Publisher Site  Google Scholar
 Q. Yan, “Output tracking of undergraduated rotary inverted pendulum by nonlinear controller,” in Proceedings of the 42nd IEEE Conference on Decision and Control, vol. 3, pp. 2395–2400, Maui, Hawaii, USA, 2003. View at: Google Scholar
 Y. RahmatSamii, “Genetic algorithm and particle swarm optimization in engineering electromagnetics,” in Proceedings of the 17th International Conference on Applied Electromagnetics and Communications, Dubrovnik, Croatia, 2003. View at: Google Scholar
 J. Kennedy and R. Eberhart, “Particle swarm optimization,” in Proceedings of the International Conference on Neural Networks, vol. 4, pp. 1942–1948, Perth, WA , Australia, November 1995. View at: Google Scholar
 I. Hassanzadeh and S. Mobayen, “PSObased controller design for rotary inverted pendulum system,” Journal of Applied Sciences, vol. 8, no. 16, pp. 2907–2912, 2008. View at: Publisher Site  Google Scholar
 M. Dorigo and T. Stützle, Ant Colony Optimization, MIT Press, Cambridge, 2004.
 S. Mobayen, Identification and control of rotational inverted pendulum using intelligent methods, M.S. thesis, University of Tabriz, Tabriz, Iran, 2008.
 R. L. Haupt and S. E. Haupt, Practical Genetic Algorithms, John Wiley & Sons, New York, NY, USA, 2nd edition, 1998.
Copyright
Copyright © 2011 Iraj Hassanzadeh and Saleh Mobayen. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.