Abstract

System identification is a complex optimization problem which has recently attracted the attention in the field of science and engineering. In particular, the use of infinite impulse response (IIR) models for identification is preferred over their equivalent FIR (finite impulse response) models since the former yield more accurate models of physical plants for real world applications. However, IIR structures tend to produce multimodal error surfaces whose cost functions are significantly difficult to minimize. Evolutionary computation techniques (ECT) are used to estimate the solution to complex optimization problems. They are often designed to meet the requirements of particular problems because no single optimization algorithm can solve all problems competitively. Therefore, when new algorithms are proposed, their relative efficacies must be appropriately evaluated. Several comparisons among ECT have been reported in the literature. Nevertheless, they suffer from one limitation: their conclusions are based on the performance of popular evolutionary approaches over a set of synthetic functions with exact solutions and well-known behaviors, without considering the application context or including recent developments. This study presents the comparison of various evolutionary computation optimization techniques applied to IIR model identification. Results over several models are presented and statistically validated.

1. Introduction

System identification is a complex optimization problem which has recently attracted the attention in the field of science and engineering. System identification is important in the disciplines of control systems [1], communication [2], signal processing [3], and image processing [4].

In a system identification configuration, an optimization algorithm attempts to iteratively determine the adaptive model parameters to get an optimal model for an unknown plant by minimizing some error function between the output of the candidate model and the output of the plant. The optimal model or solution is attained when such error function is effectively reduced. The adequacy of the estimated model depends on the adaptive model structure, the optimization algorithm, and also the characteristic and quality of the input-output data [5].

Systems or plants can be better modeled through infinite impulse response (IIR) models because they emulate physical plants more accurately than their equivalent FIR (finite impulse response) models [6]. In addition, IIR models are typically capable of meeting performance specifications using fewer model parameters. However, IIR structures tend to produce multimodal error surfaces whose cost functions are significantly difficult to minimize [7]. Hence, in order to identify IIR models, a practical, efficient, and robust global optimization algorithm is necessary to minimize the multimodal error function.

Traditionally, the least mean square (LMS) technique and its variants [8] have been extensively used as optimization tools for IIR model identification. The wide acceptance of such gradient based optimization techniques is due to the low complexity and simplicity of implementation. However, the error surface for the IIR model is mostly multimodal with respect to the filter coefficients. This may result in leading traditional gradient-descent approaches into local optima [9].

The difficulties associated with the use of gradient based optimization methods for solving several engineering problems have contributed to the development of alternative solutions. Evolutionary computation techniques (ECT) such as the particle swarm optimization (PSO) [10], artificial bee colony (ABC) [11], electromagnetism-like method (EM) [12], cuckoo search (CS) [13], and flower pollination algorithm (FPA) [14] have received much attention regarding their potential as global optimization methods in real-world applications. Inspired by the evolution process and survival of the fittest in the biological world, ECT are search methods that are different from traditional optimization methods. They are based on a collective learning process within a population of candidate solutions. The population in ECT is usually arbitrarily initialized, and each iteration (also called a generation) evolves towards better and better solution regions by means of randomized processes where several operators are applied to each candidate solution. ECT have been applied to many engineering optimization problems and have proven to be effective for solving some specific problems, including multimodal optimization, dynamic optimization, noisy optimization, and multiobjective optimization [1517]. Hence, they are becoming increasingly popular tools to solve various hard optimization problems.

As an alternative to gradient based techniques, the problem of IIR modelling has also been handled through evolutionary computation techniques. In general, they have been demonstrated to yield better results than those based on gradient algorithms with respect to accuracy and robustness [9]. Such approaches have produced several robust IIR identification systems by using different evolutionary computation techniques such as PSO [18], ABC [19], EM [20], and CS [21], whose results have been individually reported.

ECT are often designed to meet the requirements of particular problems because no single optimization algorithm can solve all problems competitively [22]. Therefore, when new alternative algorithms are proposed, their relative efficiency must be appropriately evaluated. Many efforts [2325] have also been devoted to comparing ECT to each other. Typically, such comparisons have been based on synthetic numerical benchmark problems with most studies verifying if one algorithm outperforms others over a given set of benchmarks functions overlooking any statistical test. However, few comparative studies of various ECT considering the application context are available in the literature. Therefore, it is very important to discuss and compare the performance of ECT methods from an application point of view.

This paper presents the comparison of various evolutionary computation optimization techniques that are applied to IIR model identification. In the comparison, special attention is paid to recently developed algorithms such as the cuckoo search (CS) and the flower pollination algorithm (FPA), including also popular approaches as the particle swarm optimization (PSO), the artificial bee colony (ABC) optimization, and the electromagnetism-like optimization (EM) algorithm. Results over several models with different ranges of complexity are presented and validated within a statistically significant framework.

The rest of this paper is organized as follows: Section 2 presents a review of the evolutionary computation techniques that are employed in the comparison whereas Section 3 discusses the IIR system identification problem. In Section 4 all experimental results are depicted with some concluding remarks being drawn in Section 5.

2. Evolutionary Computation Techniques (ECT)

In the real world, many optimization problems can be considered as black box challenges. Often, less information is available about an optimization problem itself unless the information emerges from function evaluations. In the worst case, nothing is known about the characteristics of the fitness function, for example, whether it is unimodal or multimodal.

On the other hand, ECT are used to estimate the solution to complex optimization problems since they adapt easily to black-box formulations and extremely ill-behaved functions. ECT are based on a collective learning process within a population of candidate solutions. The population in ECT is usually arbitrarily initialized while each iteration (also called a generation) evolves towards better solution regions by means of randomized processes with several operators being applied to each candidate solution. ECT have been applied to many engineering optimization problems ensuring an effective solution for some specific problems, including multimodal optimization, dynamic optimization, noisy optimization, multiobjective optimization, and others [1517].

Therefore, ECT are becoming increasingly popular tools to solve various hard optimization problems. This section presents a brief description of five evolutionary computation techniques: swarm optimization (PSO), artificial bee colony (ABC) optimization and electromagnetism-like optimization (EM), cuckoo search (CS), and flower pollination algorithm (FPA), which have been all employed in our comparative study.

2.1. Particle Swarm Optimization (PSO)

PSO, proposed by Kennedy and Eberhart in 1995 [10], is a population-based stochastic optimization technique that is inspired on the social behavior of bird flocking or fish schooling. The algorithm searches for the optimum using a group or swarm formed by possible solutions of the problem, which are called particles. From the implementation point of view, in the PSO operation, a population () of particles (individuals) evolves from the initial point to a total gen number of iterations . Each particle    represents a -dimensional vector where each dimension corresponds to a decision variable of the optimization problem at hand. The quality of each particle (candidate solution) is evaluated by using an objective function whose final result represents the fitness value of . During the evolution process, the best global position () seen so far is stored with the best position () being reached by each particle. Such positions are computed by considering a minimization problem as follows: In this work, the modified PSO version proposed by Lin et al. in [26] has been implemented. Under such approach, the new position of each particle is calculated by using the following equations: where is called the inertia weight that controls the impact of the current velocity on the updated velocity (). and are the positive acceleration coefficients that rule the movement of each particle towards the positions and , respectively. and are uniformly distributed random numbers that are chosen within the interval .

2.2. Artificial Bee Colony (ABC)

The artificial bee colony (ABC) algorithm, proposed by Karaboga [11], is an ECT inspired by the intelligent foraging behavior of a honeybee swarm. In the ABC operation, a population () of food locations (individuals) is evolved from the initial point to a total gen number of iterations . Each food location () represents a -dimensional vector where each dimension corresponds to a decision variable of the optimization problem to be solved. After initialization, an objective function evaluates each food location to assess whether it represent an acceptable solution (nectar-amount) or not. Guided by the values of such an objective function, the candidate solution is evolved through different ABC operations (honeybee types). In the main operator, each food location generates a new food source in the neighborhood of its present position as follows: where is a randomly chosen food location, satisfying the condition . The scale factor is a random number between . Once a new solution is generated, a fitness value representing the profitability associated with a particular solution is calculated. The fitness value for a minimization problem can be assigned to a candidate solution by the following expression: where represents the objective function to be minimized. Once the fitness values are calculated, a greedy selection process is applied between and . If is better than , then the candidate solution is replaced by ; otherwise, remains.

2.3. Electromagnetism-Like (EM) Algorithm

The EM algorithm, proposed by İlker et al. [12] is a simple and population-based search algorithm which has been inspired by the electromagnetism phenomenon. In EM, individuals emulate charged particles which interact to each other based on the electromagnetism laws of repulsion and attraction. The method utilizes , -dimensional points , , where each point is a -dimensional vector containing the parameter values to be optimized whereas denotes the iteration (or generation) number. The initial population (being ) is taken from uniformly distributed samples of the search space. We denote the population set at the th generation by , because members of change with . After the initialization of , EM continues its iterative process until a stopping condition (e.g., the maximum number of generations, ) is met. An iteration of EM consists of three steps. In the first step each point in moves to a different location by using the attraction-repulsion mechanism of the electromagnetism theory. In the second step, points moved by the electromagnetism principle are further moved locally by a local search procedure. Finally, in the third step, in order to generate the new population , a greedy selection process selects best points between those produced by the local search procedure and the originals. Both the attraction-repulsion mechanism and the local search in EM are responsible for driving the members of to the proximity of the global optimum.

2.4. Cuckoo Search (CS) Method

CS is one of the latest nature-inspired algorithms that has been developed by Yang and Deb [13]. CS is based on the brood parasitism of some cuckoo species. In addition, this algorithm is enhanced by the so-called Lévy flights [27], rather than by simple isotropic random walks. From the implementation point of view of the CS operation, a population () of eggs (individuals) is evolved from the initial point to a total gen number of iterations . Each egg () represents a -dimensional vector where each dimension corresponds to a decision variable of the optimization problem to be solved. The quality of each egg (candidate solution) is evaluated by using an objective function whose final result represents the fitness value of . Three different operators define the evolution process of CS: (A) Lévy flight, (B) the replacing of nests operator for constructing new solutions, and (C) the elitist selection strategy.

(A) The Lévy Flight. One of the most powerful features of cuckoo search is the use of Lévy flights to generate new candidate solutions (eggs). Under this approach, a new candidate solution () is produced by perturbing the current with a change of position . In order to obtain , a random step is generated by a symmetric Lévy distribution. For producing , Mantegna’s algorithm [28] is employed as follows: where and are -dimensional vectors and . Each element of and is calculated by considering the following normal distributions: where represents the Gamma distribution. Once has been calculated, the required change of position is computed as follows: where the product denotes entrywise multiplications whereas is the best solution (egg) seen so far in terms of its fitness value. Finally, the new candidate solution is calculated by using

(B) Replacing Some Nests by Constructing New Solutions. Under this operation, a set of individuals (eggs) are probabilistically selected and replaced with a new value. Each individual () can be selected with a probability . In order to implement this operation, a uniform random number is generated within the range . If is less than , the individual is selected and modified according to (5); otherwise remains with no change. This operation can be resumed by the following model: where rand is a random number normally distributed whereas and are random integers from 1 to .

(C) The Elitist Selection Strategy. After producing either by the operator A or by the operator B, it must be compared with its past value . If the fitness value of is better than , then is accepted as the final solution; otherwise, is retained. This procedure can be resumed by the following statement: The elitist selection strategy denotes that only high-quality eggs (best solutions near to the optimal value) which are the most similar to the host bird’s eggs have the opportunity to develop (next generation) and become mature cuckoos.

2.5. Flower Pollination Algorithm (FPA)

The flower pollination algorithm (FPA), proposed by Yang [14], is an ECT inspired by the pollination process of flowers. In FPA, individuals emulate a set of flowers or pollen gametes which behaves based on biological laws of the pollination process. From the implementation point of view, in the FPA operation, a population () of flower positions (individuals) is evolved from the initial point to a total gen number of iterations . Each flower () represents a -dimensional vector where each dimension corresponds to a decision variable of the optimization problem to be solved. In FPA, a new population is produced by considering two operators: local and global pollination. A probabilistic global pollination factor is associated with such operators. In order to decide which operator should be applied to each current flower position , a uniform random number is generated within the range . If is less than , the global pollination operator is applied to . Otherwise, the local pollination operator is considered.

Global Pollination Operator. Under this operator, the original position is displaced to a new position according to the following model: where is the global best position seen so far whereas controls the length of the displacement. The value is generated by a symmetric Lévy distribution according to (5)-(6).

Local Pollination Operator. In the local pollination operator, the current position is perturbed to a new position as follows: where and are two randomly chosen flower positions, satisfying the condition . The scale factor is a random number between .

3. IIR Model Identification (Problem Formulation)

System identification is the mathematical representation of an unknown system by using only input-output data. In a system identification configuration, an optimization algorithm attempts to iteratively determine the adaptive model parameters to get an optimal model for the unknown plant based on minimizing some error function between the output of the candidate model and the actual output of the plant.

The use of infinite impulse response (IIR) models for identification is preferred over their equivalent FIR (finite impulse response) models since the former produce more accurate models of physical plants for real world applications [6]. In addition, IIR models are typically capable of meeting performance specifications using fewer model parameters. Figure 1 represents an IIR identification model of any arbitrary system.

An IIR system can be represented by the transfer function: where and are the number of numerator and denominator coefficients of the transfer function, respectively, and, and are the pole and zero parameters of the IIR model (). Equation (13) can be written as difference equation of the form: where and represent the th input and output of the system, respectively. Therefore, the set of unknown parameters that models the IIR system is represented by . Considering that the number of unknown parameters of is , the search space of feasible values for is .

According to the block diagram of Figure 1, the output of the plant is whereas the output of the IIR filter is . The output difference between the actual system and its model yields the error . Hence, the problem of IIR model identification can be considered as a minimization problem of the function stated as the following: where is the number of samples used in the simulation.

The aim is to minimize the cost function by adjusting . The optimal model or solution is attained when the error function reaches its minimum value, as follows:

4. Experimental Results

In the comparison study, a comprehensive set of experiments has been used to test the performance of each evolutionary computation technique. The set considers the use of IIR models with different orders. Such experimental set has been carefully selected to assure compatibility between similar works reported in the literature [1821]. In the comparison, five ETC have been considered: PSO, ABC, EM, CS, and FPA.

The parameter setting for each evolutionary computation algorithm that is used in the comparison is described as follows.(1)PSO: the parameters are set to , ; besides, the weight factor decreases linearly from 0.9 to 0.2 [18].(2)ABC: the algorithm has been implemented using the guidelines provided by its own reference [19], using the parameter .(3)EM: particle number = 50, , , . Such values, according to [12, 20] represent the best possible configuration.(4)CS: according to [13, 21], the parameters are set to and the number of generations .(5)FPA: the probabilistic global pollination factor is set to 0.8. Under such value, the algorithm presents the best performance according to [14].

For all algorithms, the population size has been set to 25 () whereas the maximum iteration number has been configured to 3000 generations ().

The results are divided into two sections. In the first set, the performance of each ETC for each identification experiment is presented. In the second set, the results are analyzed from a statistical point of view by using the Wilcoxon test.

4.1. IIR Model Identification Results

The results are reported considering three experiments that include a second-order plant with a first-order IIR model; a second-order plant with a second-order IIR model; and finally, a high-order plant with a high-order model. Each case is discussed below.

(1) A Plant with a Second-Order System and a First-Order IIR Model (First Experiment). In this experiment, each algorithm is applied to identify a second-order plant through a first-order IIR model. Under such conditions, the unknown plant and the IIR model hold the following transfer functions: In the simulations, it has been considered a white sequence of 100 samples for the input . Since a reduced order model is employed to identify a plant of a superior order, is multimodal [19]. The error surface is shown in Figure 2.

The performance evaluation over 30 different executions is reported in Table 1 considering the following indexes: the best parameter values (ABP), the average value (AV), and the standard deviation (SD). The best parameter values (ABP) report the best model parameters obtained during the 30 executions while the average value (AV) indicates the average minimum value of , considering the same number of executions. Finally, the standard deviation (SD) reports the dispersion from the average value regarding 30 executions.

According to Table 1, the CS algorithm provides better results than PSO, ABC, and EM. In particular, the results show that CS maintains a considerable precision (the lowest AV value) and more robustness (smallest SD value). Nevertheless, the CS performance is similar to the FPA algorithm. On the other hand, the worst performance is reached by the PSO algorithm. Such a fact corresponds to its difficulty (premature convergence) to overcome local minima in multimodal functions.

(2) A Plant with Second-Order System and Second-Order IIR Model (Second Experiment). In the second experiment, the performance for each algorithm is evaluated at the identification of a second-order plant through a second-order IIR model. Therefore, the unknown plant and the IIR model hold the following transfer functions: For the simulations, the input that is applied to the system and to the IIR model simultaneously has been configured as a white sequence with 100 samples. Since the order of the model is equal to the order of the to-be-identified system , only one global minimum exists in [19]. The results of this experiment over 30 different executions are reported in Table 2.

The results in Table 2 show that PSO, ABC, EM, CS, and FPA have similar values in their performance. The evidence shows that evolutionary algorithms maintain a similar average performance when they face unimodal low-dimensional functions [29, 30]. In particular, the test remarks that the small difference in performance is directly related to a better exploitation mechanism included in CS and FPA.

(3) A Superior-Order Plant and a High-Order Model (Third Experiment). Finally, the performance for each algorithm is evaluated at the identification of a superior-order plant through a high-order IIR model. Therefore, the unknown plant and the IIR model hold the following transfer functions: Since the plant is a sixth-order system and the IIR model a fourth-order system, the error surface is multimodal just as it is in the first experiment. A white sequence with 100 samples has been used as input. The results of this experiment over 30 different executions are reported in Tables 3 and 4. Table 3 presents the best parameter values (ABP) whereas Table 4 shows the average value (AV) and its standard deviation (SD).

According to the AV and SD indexes in Table 4, the CS algorithm finds better results than PSO, ABC, EM, and FPA. The results show that CS presents better precision (AV value) and robustness (SD value). These results also indicate that CS, FPA, and EM are able to identify the sixth-order plant under different accuracy levels. On the other hand, PSO and ABC obtain suboptimal solutions whose parameters weakly model the unknown system.

4.2. Statistical Analysis

In order to statistically validate the results, a nonparametric statistical significance-proof which is known as Wilcoxon’s rank sum test for independent samples [31, 32] has been conducted over the “the average value” (AV) data of Tables 1, 2, and 4 with a 5% significance level. The test has been conducted considering 30 different executions for each algorithm. Table 5 reports the values produced by Wilcoxon’s test for the pairwise comparison of the “the average value” of four groups. Such groups are formed by CS versus PSO, CS versus ABC, CS versus EM, and CS versus FPA. As a null hypothesis, it is assumed that there is no significant difference between averaged values of the two algorithms. The alternative hypothesis considers a significant difference between the AV values of both approaches.

For the case of PSO, ABC, and EM, all values reported in Table 5 are less than 0.05 (5% significance level) which is a strong evidence against the null hypothesis. Therefore, such evidence indicates that CS results are statistically significant and that it has not occurred by coincidence (i.e., due to common noise contained in the process). On the other hand, since the values for the case of CS versus FPA are more than 0.05, there is not statistical difference between both. Therefore, it can be concluded that the CS algorithm is better than PSO, ABC, and EM in the application of IIR modeling for system identification. However, CS presents the same performance as FPA and therefore there is not statistical evidence that CS surpasses the FPA algorithm.

5. Conclusions

This paper presents a comparison study between five evolutionary algorithms for the IIR-based model identification. Under this research, the identification task is considered as an optimization problem. In the comparison, special attention is paid to recently developed algorithms such as the cuckoo search (CS) and the flower pollination algorithm (FPA), also including popular approaches such as the particle swarm optimization (PSO), the artificial Bee colony optimization (ABC), and the electromagnetism-like (EM) optimization algorithm.

The comparison has been experimentally evaluated over a test suite of three benchmark experiments that produce multimodal functions. The experiment results have demonstrated that CS outperforms PSO, ABC, and EM in terms of both the accuracy (AV values) and robustness (SD values), within a statistically significant framework (Wilcoxon test). However, there is not statistical evidence that CS surpasses the FPA performance.

The remarkable performance of CS and FPA is explained by two different features: (i) operators (such as Lévy flight) that allow a better exploration of the search space, increasing the capacity to find multiple optima, and (ii) their exploitation operators that allow a better precision of previously found solutions.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.