Abstract

In order to improve the iris classification rate, a novel biogeography-based optimization algorithm (NBBO) based on local search and nonuniform variation was proposed in this paper. Firstly, the linear migration model was replaced by a hyperbolic cotangent model which was closer to the natural law. And, the local search strategy was added to traditional BBO algorithm migration operation to enhance the global search ability of the algorithm. Then, the nonuniform variation was introduced to enhance the algorithm in the later iteration. The algorithm could achieve a stronger iris classifier by lifting weaker similarity classifiers during the training stage. On this base, the convergence condition of NBBO was proposed by using the Markov chain strategy. Finally, simulation results were given to demonstrate the effectiveness and efficiency of the proposed iris classification method.

1. Introduction

Iris classification is especially suitable for recognition with uniqueness, stability, inviolability, and reliability characteristics and has been one of the hottest biological characteristic recognition research spot recently [1]. It has been widely used technology in national defence, financial industry, and entrance guard system [2, 3]. Alfred Wallace and Charles Darwin proposed the theory of biogeography to study the distribution, migration, and extinction of habitats of biological species in the 19th century. Inspired by biogeography, Simon proposed the BBO algorithm in 2008 [4]. BBO is a new swarm intelligence optimization algorithm and solves the optimization problem by simulating the mathematical model of species migration in biogeography. In general, BBO can find a good initial candidate solution and generate an acceptable optimization solution, which has been widely used in image processing [5], scheduling optimization [6, 7], parameter estimation [8], power flow calculation [9], and load analysis [10]. However, BBO has some shortcomings, such as weak search ability and easy to fall into local optimum in the later period of operation. How to use BBO global search ability to improve the iris classification rate has important theoretical and practical value.

In recent years, many researchers have worked on the improvement of the BBO performance. For instance, Wang et al. [11] combined the chaotic mapping strategy with the BBO optimal migration model and proposed a biogeography optimization algorithm based on based on the adaptive population migration mechanism. Feng et al. [12] proposed an improved BBO with random ring topology and Powell’s method, in which the local loop topology was used instead of the global topology. The adaptive Powell method was modified to adapt the evolutionary process to improve the accuracy of the solution. Li et al. [13] designed a hybrid algorithm by combining the artificial bee colony algorithm (ABC) with BBO to obtain ABC’s exploring ability and BBO’s developing ability. Chen et al. [14] combined Cuckoo Search (CS) and bBBO using two search strategies: heterogeneous rhododendron search and biogeography-based discovery. Zhao et al. [15] proposed an optimization method based on two-stage differential biogeography to solve the problem of premature convergence and reduce rotation variance.

In order to improve the iris classification rate, the NBBO based on local search and nonuniform variation was proposed in this paper. Firstly, the hyperbolic cotangent migration model was used to replace the original linear migration model to improve the adaptability of the algorithm. Then, the local search strategy was added to the BBO migration operation in order to enhance the algorithm search ability and improve the convergence speed of the algorithm. The nonuniform variation was used in the operation to increase the search ability of the algorithm and prevent the local optimum, which could achieve a stronger iris classifier by lifting weaker similarity classifiers during the training stage.

The rest of the paper was organized as follows. Section 2 presented the improved biogeography-based optimization algorithm, including migration model, migration operations, and nonuniform variation operation. Section 3 presented NBBO test on different test functions and iris classification samples. Finally, a brief conclusion and future work are given in Section 4.

2. Methods

In nature, biological species live in different areas with obvious boundaries, which are called habitats. Habitat suitability index (HSI) is used to describe the survival degree of species suitable for habitat. HSI is affected by humidity, temperature, habitat area, and vegetation diversity. These factors are called suitability variables (SIV). Habitats with higher HSI can accommodate more biological species, whereas habitats with lower HSI can only accommodate fewer biological species. BBO is a population-based algorithm, which regards each solution as a habitat, fitness as HSI, and each component of solution as a SIV. The optimization problem can be solved by simulating the migration and mutation process in biogeography. The two main operations of BBO are migration operation and mutation operation [16, 17].

Like other intelligent algorithms, BBO is prone to oscillation in the iterative process, resulting in slow and premature convergence, poor local search ability, and inaccuracy. This section presents the NBBO algorithm in terms of migration model and operation. The flow chart of the improved algorithm is shown in Figure 1.

2.1. Migration Model

There are different migration models in biogeography. As shown in formula (1), the original BBO adopts the simplest linear migration model. But, the ecosystem is nonlinear in nature; small changes in a certain part of the system may have complex effects on the whole system. For the actual species, the migration process is more complicated than what is described by the linear model. Among the related improved algorithms, the cosine mobility model proposed by Ma performs best under most of the functions tested [18], and the mobility model close to the natural law is better than the simple linear model. In order to better adapt to the nonlinear migration problem, this paper proposes a hyperbolic cotangent nonlinear migration model to improve the performance of the basic BBO algorithm.where and are related to the number of species and is the maximum number of species. represents the immigration rate which refers to the probability of species moving into the habitat, represents the emigration rate which refers to the probability of species moving out of the habitat, represents the maximum immigration rate, and represents the maximum emigration rate, all of which are functions of the number of habitat populations . When the number of habitat species is 0, the immigration rate is the highest, that is, , the migration rate is the smallest, that is, . When the number of species reaches the maximum number of species that can be accommodated in the habitat, the immigration rate drops to 0, and the migration rate reaches the maximum, that is, . When the number of species is , the immigration rate and the emigration rate of the species are equal, that is, , and the dynamic equilibrium state is reached at this time.

In the hyperbolic tangential mobility model, the trend of mobility changing with species number is like that of cosine model, but the amplitude is more moderate than that of cosine model.

2.2. Local Search Migration Operations

In the original BBO, a generalized search was carried out based on the immigration rate and the emigration rate, a habitat with a higher emigration rate was randomly selected, and the characteristics of the corresponding position (SIV) were moved to the habitat with low HSI. However, in some habitats with high migration rates, the characteristics of the corresponding locations SIV are not necessarily the best for the migrated habitats. If migrated directly, it may result in a decrease in the suitability index of the move into habitat.

Therefore, in order to enhance the global search ability of the algorithm and improve the convergence speed of the algorithm, the selected migration characteristics can be modified by using equation (6), which can reduce the poor migration characteristics from the habitat with higher migration rate to the low migration rate. Similarly, if you move out of the same habitat with a high migration rate each time, as the number of evolutions increases, the population diversity decreases, and the convergence speed will slow down:where represents a space allocated to each individual in the population for storing algorithm parameters. represents the current individual. represents the new individual generated by the local search. and .

2.3. Nonuniform Variation Operation

In the early stage of iteration, the population position is usually far from the optimal solution and needs to be iterated many times to find a feasible search space when the search radius is too small. At the end of iteration, the population position is close to or near the optimal solution. A small range of searches is used as a fine-tuning of the solution vector. It is shown by equation (7), as the traditional mutation operator cannot search the optimal solution efficiently, the nonuniform mutation operator is introduced. The whole population is searched at a large range during the early stage of search. As the number of iterations increases, the search range is gradually narrowed, and the algorithm is effectively balanced to escape from local minimum.where denotes the probability for the number of species; denotes the species number; and denotes the maximum variation rate given; .

The specific operation of nonuniform variation is like uniform variation, but it focuses on searching for small areas near the original individual. Assuming that the variation of the component of the generation ( is the current number of iterations) of Habitat is performed, the variation component is as follows:where and are the upper and lower limits of component , respectively. and represent random numbers in the range of and meet the nonuniform distribution:

This feature allows the mutation operator to search the entire space evenly in the initial stage of the algorithm (when is very small), and the latter part of the algorithm focuses on several local range exact searches. can be defined as follows:where is a random number that fits evenly within the range, is the maximum number of iterations, and is the system parameter that determines the degree of dependency on the number of iterations, select here .

Then, the specific steps of NBBO algorithm are as follows: (Algorithm 1)

Begin
Algorithmic parameters setting:
Population size = 50,
Maximum number of iterations G = 500,
Maximum immigration rate ,
Maximum emigration rate ,
Mutation probability ,
Number of elites retained .
  / Initialization /
Random generation of a set of initial habitats constitutes the initial population.
Calculate the fitness value of each habitat in the population.
/ is the condition for the end of the iteration /
Sort the habitat in descending order according to the fitness value.
The immigration rate and emigration rate of each habitat were calculated according to formula (1).
   / End of initialization/
  / Migration /
 Select according to the immigration probability
/ is dimension /
  
    Select according to the migration probability
   Randomly select a characteristic variable from
   Replace a random characteristic variable in with
  else
   Perform a local search on the characteristic variable
    according to formula (6) can get .
   Replace random characteristic variable in with .
  end if
 end for
end for
  / End of migration /
  / Mutation /
 Calculate the probability of mutation according to and
 Selection of unmutated habitat based on
  
  Replace with a randomly generated by formula (8)
   
    Replace with a randomly generated by (9)
   end if
  end if
 end if
end for
/ End of mutation /
    Recalculate the habitat fitness values
end while
end
2.4. Convergence Analysis

By using Markov chain strategy, we can analyze the convergence of the proposed NBBO algorithms [19, 20]. Let be a population with improved algorithm algebra of . We divide into subsets based on the fitness of the population. is a subset of them; next, we use Markov chain to analyze the global convergence of the improved BBO algorithm.

The evolution process of the NBBO algorithm is mainly composed of selection, migration and mutation operations, which are independent of evolution algebra. Considering the number of individuals of BBO algorithm, the evolution process of the NBBO algorithm satisfies the finite homogeneous Markov chain. Population subset can be equivalent to a state on a finite homogeneous Markov chain, means the probability of being in state , and then and .

The transition probability of state is as follows:

Then, the state transition matrix of the Markov chain iswhere

It can be found by formulas (12) and (13) , . Since the sum of the probabilities of each row in the Markov transition matrix is 1, . It can be concluded that the state transition matrix of Markov chain satisfies the reducible random matrix. According to [21], the definition of a reductive random matrix is as follows:

Let matrix of order be a reducible random matrix which can be obtained by the same row transformation and column transformation, namely,where is the order primitive matrix and and are nonzero matrices.

Then, we have

The above matrix is a stable random matrix, and satisfies the following conditions:

Therefore, state transition matrix is a reductive random matrix, then

In formula (12) and , since the sum of the probabilities of each line in the Markov transition matrix is 1; there must be , then

Formula (19) shows that when the number of iterations , probability , That is, no matter how the initial state can finally converge to the global optimal solution with probability 1, it can be concluded thatwhere is the optimal fitness value of population , represents the global optimal fitness value, and indicates the probability that the optimal individual of the generation is the global optimal value. It is proved that the NBBO algorithm has global convergence.

3. Simulation Experiments

3.1. Test Function Simulation

In order to test the optimization performance of the proposed IMBBO algorithm, 13 standard test functions were selected for simulation experiments [22, 23]. Table 1 shows the test function expression, the range of values for each independent variable, and the theoretical optimal value. We select the optimal value, average value, and standard deviation of the test results as the evaluation criteria to test the algorithm performance.

The experimental function of Table 1 is numerically tested using the proposed NBBO algorithm proposed in this paper, and the basic BBO algorithm, grey wolf optimization algorithm [24] (DE), elephant herding optimization [25] (EHO), salp swarm algorithm [26] (DEBBO), and particle swarm optimization biogeography optimization algorithm [27] and (BBOPSO) for comparison of results. The experimental parameter settings of the algorithm are shown in Table 2.

To study the performance of the algorithm in different dimensions, the dimensions of test function and are , the dimension of test functions is , and the dimension of test functions is . In order to ensure that the comparison between each method is fair, we ensure that the experimental environment of each method is consistent during the experiment: operating system: window 10, memory: 8 GB, programming language: MATLAB 2016, and ensure that the setting of common parameters of the algorithm is consistent, and the parameters of different algorithms are set according to the parameters of specific references. As the algorithm is running randomly, the six algorithms independently run each test function 30 times to avoid unnecessary errors caused by the random operation of the algorithm.

Tables 3 and 4 show the recorded optimal function value, average value, and standard deviation obtained by the algorithm. In the iterative process of the algorithm, we record the best fitness of each evaluation, find the average optimal value of each evaluation and draw a curve, which reflects the convergence trend of the algorithm. The convergence curves of some test functions are shown in Figures 2 and 3.

In order to compare the optimization performance of each algorithm more easily and clearly, the algorithm with the best optimization performance of the optimal value, average value and standard deviation of each test function is boldly marked in Tables 3 and 4. From Tables 3 and 4 and the simulation diagram, we find that the convergence speed of the analytical algorithm can be qualitatively and quantitatively analyzed. For the simple multidimensional unimodal function in Figure 2, the NBBO algorithm has faster convergence speed and solution precision in the later iteration although its advantage is not obvious in the initial stage of iteration. For the more complex multidimensional multimodal function in Figure 3, the other five algorithms have the phenomenon of search stagnation or slow search, i.e., they fall into local optimum and are difficult to jump out. Then, by comparing the optimal value of each test function and the average value of the optimization results, we can see that IMBBO algorithm has achieved a smaller value. Although the optimization results of DE, DEBBO, and BBOPSO algorithms also reach the global optimal value in the optimization of some test functions with different dimensions, it can be seen from the optimization results of single peak function and multi peak function that NBBO algorithm has a great advantage in the optimization ability. Figures 2 and 3 show that the proposed NBBO algorithm can not only improve the convergence of the algorithm but also improve the evolution ability and the ability to jump out of the local optimal solution. It is superior to the other four comparison algorithms in the stability of the optimization result. In summary, when solving the optimization problem of single-peak or multi-peak functions, the proposed NBBO can quickly and effectively find the optimal solution under a short number of iterations, and its optimal solution has higher precision and stability.

Considering the fixed dimension test function has few dimensions to be adjusted, the accuracy and stability of each algorithm are relatively high. For the multidimensional test function, when its dimension D is increased from 10 to 30 or 50, the accuracy and stability of the algorithm have decreased. This is because the objective function is more complicated and the algorithm needs to make more adjustments. However, no matter how the dimension changes, the proposed NBBO algorithm still maintains a stable and high-precision optimization performance for high-dimensional complex functions. In reference [28], derrick proposed that for the evaluation of the performance of the improved evolutionary algorithm, only based on the average value and the standard deviation value to compare the advantages and disadvantages of the algorithm is not rigorous enough, so statistical test should be carried out. The statistical test shows that the improved algorithm has significant improvement advantages over other existing algorithms. Therefore, in order to verify the significant difference between the IMBBO algorithm and other algorithms, a nonparametric estimation method, Wilcoxon rank sum test, is used for statistical analysis. For each test function, the 30 times solution results of IMBBO algorithm are tested with those of BBO, DE, EHO, DEBBO, and BBOPSO algorithms respectively, and H0 is assumed: the solution results of the two algorithms are generally the same; H1: the solution results of the two algorithms are generally different. Set the threshold value of significance level of zero hypothesis test α = 0.05. When the test result , reject the zero hypothesis; the experimental results of the two algorithms have significant difference; when the test result , accept the zero hypothesis; the experimental results of the two algorithms have no significant difference. The specific inspection results are shown in Tables 5 and 6.

Tables 5 and 6 show the test results of test function . Among them, for the probability p value less than the significance level threshold value of 0.05, bold display; “W” is the significance test judgment result, combined with Tables 3 and 4, use “+” to indicate that NBBO is more accurate and significant than other algorithms, use “−” to indicate that NBBO is less accurate and significant than other algorithms, and use “ = ” to indicate that NBBO algorithm. There is no significant difference in the accuracy of other algorithms; “N/A” means that significance judgment cannot be made. From the test results, most of the test values are less than 0.05 and “” is “+,” so zero hypothesis is rejected. Therefore, there are significant differences between the calculation results of IMBBO and BBO, DE, EHO, DEBBO, and BBOPSO, and NBBO is significantly better.

To sum up, the biogeography-based optimization algorithm based on local search and nonuniform variation has good results for most of the benchmark functions, which effectively solves the slow convergence speed of the original BBO algorithm and easy to fall into the local optimization problem in the later stage of the search and greatly improves the convergence speed and search accuracy of the BBO algorithm without adding too many programming steps.

3.2. Iris Classification Based on NBBO Multilayer Perceptron

In order to test the training performance of NBBO multilayer perceptron for iris classification, the mean square error (MSE) of the actual output value of multilayer perceptron and the expected output value is used as an indicator to measure the performance of multilayer perceptron [29]. MSE of all training samples is calculated as follows:where is the number of training samples, is the number of outputs, is the expected output of the input unit under training samples, and is the actual output of the input unit under training samples. The MSE value is smaller is equal to the performance of the multilayer perceptron is better, and vice versa. The HIS of the habitat is calculated by

Figure 4 shows the schematic diagram of a multilayer perceptron based on NBBO algorithm. NBBO algorithm receives the average MSE of all training samples and all expected output samples as the objective function for iterative training and adjusts the link weights and deviations through continuous iterative evolution, so as to provide the optimal connection weight and deviation for iris classification samples after iterative training.

The iris dataset has 150 training samples divided into 3 categories: Setosa, Versicolor, and Virginica, with a total of 4 basic features: sepal length, sepal width, petal length, and petal width. We used the structure of multilayer perceptron as 4-9-3 to classify the problem. Table 7 shows the experimental results of the data set algorithm. Figure 5 shows the average convergence trend and classification accuracy.

Compared with other five iris classification algorithms in Table 7, we can see that the proposed NBBO has the better optimal value, average value, and standard deviation than the other five algorithms. The optimal and average values of the MSE calculated by NBBO are better than other algorithms, which indicates that the optimization performance of the NBBO is the best among the seven algorithms. At the same time, the minimum standard deviation calculated by NBBO also indicates that the NBBO algorithm inherits the strong robustness of the BBO algorithm and is more robust than other algorithms.

Figure 5(a) shows the classification convergence curves of the six algorithms for the iris classification problem, among which the convergence effect of the DE algorithm is the worst of the six algorithms, and the EHO and BBOPSO algorithms fall into the local optimum in the late iteration. BBO algorithm and DEBBO algorithm have similar convergence precision but the convergence speed is not ideal. The proposed NBBO algorithm is optimal in convergence precision and convergence speed. Figure 5(b) shows that DE, EHO, and BBOPSO show significant fluctuations in the classification accuracy of 10 independent operations. The NBBO algorithm has the best stability and average, and its classification accuracy is the highest among the three stable algorithms BBO, DEBBO, and NBBO.

4. Conclusions and Future Work

This paper proposed a novel biogeography-based optimization algorithm based on local migration strategy and nonuniform mutation for iris classification. Firstly, the hyperbolic cotangent transfer model was used to replace the original transfer model to enhance the adaptability of the algorithm. Secondly, the local transfer strategy was used to improve the global search ability of the algorithm. Finally, the nonuniform mutation operation was used to enhance the development ability of the algorithm in the later stage of iteration for the optimal solution in a small range, to improve the ability of the algorithm to exit the local optimum. The simulation results showed that the proposed NBBO has higher stability, better convergence accuracy, and faster convergence speed than other method for iris classification. How to use NBBO algorithm to solve other multiobjective optimization problem compare with multiobjective optimization method [3033] and realize better applications for engineering practice will be the focus of our next work.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by the Natural Science Research Programme of Colleges and Universities of Anhui Province under grant KJ2020ZD39, the Foundation for Talented Young People of Anhui Polytechnic University under grant 2016BJRC008, the Open Research Fund of Anhui Key Laboratory of Detection Technology and Energy Saving Devices, and Anhui Polytechnic University under grant DTESD2020A02.