Abstract

A polygonal approximation method based on the new artificial bee colony (NABC) algorithm is proposed in this paper. In the present method, a solution is represented by a vector, and the objective function is defined as the integral square error between the given curve and its corresponding polygon. The search process, including the employed bee stage, the onlooker bee stage, and the scout bee stage, has been constructed for this specific problem. Most experiments show that the present method when compared with the DE-based method can obtain superior approximation results with less error norm with respect to the original curves.

1. Introduction

Polygonal approximation is a common representation of digital curves [1]. A polygonal representation of a digital curve can significantly reduce the amount of data needed to be processed while at the same time preserving important information about the curve. It can be applied in shape analysis, pattern classification, image understanding, 3D reconstruction, and CAD applications. In recent decades, a number of methods have been proposed for approximation. In general, they can be categorized into three approaches: given the number of breakpoints, , find the approximation with breakpoints such that the approximation error is minimized; given an error , find the approximation with the minimum number of breakpoints that is distant from the given curve by no more than ; and dominant point-detection approaches. In this paper, the present method is categorized into the first approach for polygonal approximation.

Teh and Chin [2] indicated that dominant point detection relies on the determination of a local support region rather than the estimation of discrete curvature. B. K. Ray and K. S. Ray [3] developed a technique to determine the polygon of an object curve by simultaneously maximizing the line length and minimizing the approximation error. Pei and Horng [4] proposed a dynamic programming technique for constructing an approximation using circular arcs. Huang and Sun [5] developed a method based on genetic algorithms for the polygonal approximation of plane curves. Ho and Chen [6] solved the polygonal approximation problem using an efficient evolutionary algorithm with a novel orthogonal array crossover, which is an efficient general-purpose algorithm capable of solving large parameter optimization problems. Yin [7] proposed a polygonal approximation method using an ant colony search algorithm. Sarkar [8] formulated the polygonal approximation task as a minimization problem with ISE as the objective function and then used the differential evolution-based method to solve it. Kolesnikov [9] presented a method based on searching for the shortest path in a feasibility graph for the polygonal approximation of digital curves with a minimum number of approximation segments for a given error bound. Wang et al. [10] proposed an approach, integer particle swarm optimization, for polygonal approximation. Their method found a polygon with the minimum number of vertices whose integral square error is less than a given maximum tolerance.

In recent years, researchers have attempted to apply nature-inspired methods to the problem of pattern recognition and image processing. In this paper, the new artificial bee colony (NABC) algorithm [11] is employed for approximation of digital curves with line segments. The present method can output solutions very close to the optimal one in a relatively short time. The remainder of this paper is organized as follows. In Section 2, the problem is illustrated. In Section 3, the NABC algorithm is described. In Section 4, the NABC algorithm for polygonal approximation is presented. In Section 5, the experimental studies for the comparison of the present method with the DE-based method are shown. Finally, conclusions are given in Section 6.

2. Problem Statement

Given a curve with n clockwise-ordered points, denoting , let and represent the arc and the chord starting at point and continuing to point in the clockwise direction along the original curve . The approximation error of line segment , denoted as , is defined as where is the perpendicular distance of arc point to the chord .

The objective of the polygonal approximation problem is to find a polygon, assuming that the set of its breakpoints is , where , such that the integral square error between the polygon and the given curve is minimized. The integral square error () is defined as follows:

Therefore, there exists () combinations to consider when searching for the breakpoints of the given curve. Generally, this problem is not trivial and always turns into a time-consuming iterative search.

3. The New Artificial Bee Colony (NABC) Algorithm

The artificial bee colony (ABC) algorithm [1214] was proposed by Karaboga for optimizing numerical problems. Karaboga and Ozturk [15] applied it to training feed-forward neural networks to classify nine data sets. Xu and Duan [16] proposed an artificial bee colony algorithm with edge potential function to accomplish the target recognition task for aircraft. Zhang et al. [17] used the ABC algorithm for clustering. Zhu and Kwong [18] proposed the Gbest-guided ABC (GABC) algorithm, which takes advantage of the information of the global best solution to guide the search for new candidate solutions in order to improve the exploitation. A set of numerical benchmark functions were tested, and the results show that the GABC algorithm can outperform the ABC algorithm in most of their experiments. Pan et al. [19] proposed a discrete artificial bee colony (DABC) algorithm to solve the lot-streaming flow shop scheduling problem. Horng [20] proposed the multilevel thresholding algorithm for image segmentation based on the technology of the artificial bee colony algorithm, while Gao et al. [21] proposed a modified ABC algorithm, which is based on each bee searching only around the best solution of the previous iteration in order to improve the exploitation. Draa and Bouaziz [22] used an artificial bee colony algorithm for image contrast enhancement. In their method, the grey levels of the input image are replaced by a new set of grey levels. Talatahari et al. [23] presented a new optimization method based on the artificial bee colony algorithm for solving parameter identification problems. Sarkar et al. [24] presented a perturbed martingale approach to global optimization, and the results show that their method appears to substantively improve upon the performance of the particle swarm optimization method.

The ABC algorithm simulates the intelligent foraging behavior of honey bee swarms. The colony of artificial bees contains three groups of bees: employed bees, onlooker bees, and scout bees. In the ABC algorithm, the first half of the colony consists of employed bees while the second half is constituted of the onlooker bees. A bee associated with a food resource is called an employed bee, while a bee waiting in the dance area to get information concerning food sources is called an onlooker bee. An employed bee whose food source is exhausted by the employed and onlooker bees becomes a scout. The scout bee carries out random searches to discover new sources. The number of food sources, denoted by , is equal to the number of employed bees. The position of a food source represents a possible solution to the optimization problem, while the nectar amount of a food source corresponds to the quality (fitness) of the associated solution. Each solution () is an -dimensional vector. Here, is the number of optimization parameters and is also the number of breakpoints of a polygon in this paper.

In [11], an improved version of ABC, called NABC, was proposed. A set of twelve benchmark functions were used in their experiments. The simulation results showed that the NABC algorithm is significantly better or at least comparable to the original ABC algorithm. In the initialization stage, NABC generates a randomly distributed initial population of solutions (food source positions). The fitness of all solutions is evaluated. Then, a solution pool is constructed by storing the best 100% solutions in the current swarm with .

In the employed bee stage, the original ABC algorithm generates a candidate solution in the neighborhood of its present solution as follows: where is a randomly produced index which is different from , is a random dimension index selected from the set , and is a random number in the range [−1, 1]. In [11], the new ABC/best/1 strategy was proposed in the employed bee stage. The NABC algorithm generates a candidate solution as follows: where is randomly chosen from the solution pool and , are two randomly selected solutions from the current swarm with .

In the onlooker bee stage, an artificial onlooker bee chooses a food source depending on the probability value associated with that food source, , calculated by the following expression: where is the fitness value of the solution . In the scout bee stage, if a solution is abandoned, then the scout bee discovers a new food source to replace .

The difference between the NABC algorithm and the original ABC algorithm is that the NABC algorithm constructs a solution pool and a candidate solution vi is generated by using (4) instead of (3). In the NABC approach, bees can search the neighborhood of different best solutions. This can help avoid fast attraction and is helpful for accelerating the convergence speed.

The flowchart of the NABC algorithm is given in Figure 1. The pseudocode of the NABC algorithm is shown in Algorithm 1.

 (1)  // Initialization stage
  Generate the initial population , =
   = 0, =
  set to 1
 (2) Evaluate the fitness of the population
Repeat
 (3) Update the solution pool
 (4) // Employed bee stage
  For each employed bee
    Produce new solution (food source position) by using (4)
    Calculate the fitness value
    If > then is replaced by and is set to 0 else + 1
  End
 (5) Calculate the probability values by using (5) for the solutions ( = )
 (6) // Onlooker bee stage
  For each onlooker bee
    Select a solution depending on
    Produce new solution (food source position) by using (4)
    Calculate the fitness value
    If > then is replaced by and is set to 0 else + 1
  End
 (7) Memorize the best solution so far
 (8) // Scout bee stage
    If there is an abandoned solution ( > ) for the scout then replace it with
    a new solution which will be randomly produced
     = + 1
Until

4. The Proposed NABC Algorithm for Polygonal Approximation

In this section, the representation of solutions and the fitness function are described, and the search process of the NABC algorithm for accomplishing the task of polygonal approximation is also presented.

4.1. Representation of Solutions

In the initialization stage, the solution () is generated, where , is the real number, and the for all . Then, the value of cycle is set to 0 and the trial number of each solution , , is equal to 0.

4.2. Fitness Function

The solution corresponds to a polygon and the set of its breakpoints is , where = floor, . First, the integral square error (ISE) is computed by using (2). Then, the fitness value can be calculated by the following expression:

4.3. The Search Process
4.3.1. The Employed Bee Stage

In this stage, the ith () employed bee produces a new solution by using (4) and computes the fitness value of the new solution. To ensure that , the elements of each solution vector are sorted in ascending order of magnitude. If the fitness of the new solution is higher than that of the old solution , the solution is replaced by and is set to 0; otherwise, the old solution is kept and is increased by 1.

4.3.2. The Onlooker Bee Stage

First, the probability value is calculated by using (5) for the solution . Each onlooker bee which selects a solution depending on also produces a new solution by using (4). Then, it applies the greedy selection process between and . As described in Section 4.3.1, if the fitness of the new solution is higher than that of the old solution , the solution is replaced by and is set to 0; otherwise, the old solution is kept and is increased by 1.

4.3.3. The Scout Bee Stage

As described above, if the solution is not improved through the employed stage and the onlooker bee stage, the value of solution will be increased by 1. If the of solution is more than the parameter “limit,” the solution is considered to be an abandoned solution; meanwhile, the employed bee will be changed into a scout. The scout randomly produces the new solution to replace . Then, the value of is set 0, and this scout is changed into an employed bee.

4.3.4. Check the Termination Criterion

If the cycle is greater than the maximum cycle number (MCN), then the algorithm is finished and the best solution found so far is output.

5. Experimental Results

The present method is experimentally tested and compared with the DE-based method [8]. The program is coded in C language and run on a PC with a Pentium 4 CPU. The control parameter that is the number of food sources, which is equal to the number of employed bees or onlooker bees (SN), is set to 100, the value of ‘‘limit’’ is set to 50, the maximum cycle number (MCN) is set to 600, and the parameter p for constructing a solution pool is set to 0.06 in the experiments.

The DE algorithm is an evolutionary algorithm for solving numerical problems. The NABC algorithm is a novel swarm intelligent algorithm inspired by the foraging behaviors of a honeybee colony. The two methods have attracted the attention of researchers and have been widely used in solving engineering optimization problems. The main difference between the two methods is that they adopt different strategies to produce new solutions. The two methods start with an initial population of solutions. In the DE algorithm, new solutions are generated through the operations of mutation, crossover, and selection. In the NABC algorithm, as shown in Figure 1, new solutions are generated after performing a solution pool construction, employed bee stage, onlooker bee stage, and scout bee stage.

In order to verify the performance of the present method, four digital curves, namely, a figure-of-eight curve (see Figure 2(a)), a chromosome-shaped curve (see Figure 3(a)), a leaf-shaped curve (see Figure 4(a)), and a semicircle-shaped curve (see Figure 5(a)), are applied to be tested. The simulation conducted twenty independent runs. The best solutions, average solutions, and the standard deviations (S.D.) of the solutions are reported. The simulation results for the figure-of-eight curve, the chromosome-shaped curve, the leaf-shaped curve, and the semicircle-shaped curve are listed in Tables 1, 2, 3, and 4, respectively. For the average solutions, the present method outperforms the DE-based method. For the best solutions, it is shown that the present method produces better polygonal approximations than the DE-based method in most cases. In addition, the low S.D. reveals that the present method has higher stability than the DE-based method.

In Figures 25, Parts (b)–(d) show the best results obtained by the present method with different numbers of breakpoints employed. The computation times for the four digital curves are listed in Table 5. The present method can output solutions immediately because the processing time of all cases is less than 1 second in the experiments.

To demonstrate the applicability of the present method, it is applied to the screwdriver-shaped curve (see Figure 6(a)) and the fish-shaped curve (see Figure 7(a)). The maximum cycle number (MCN) is set to 1,000 for these two relatively larger curves. For the screwdriver-shaped curve, the point number of this curve is 267. Figures 6(b) and 6(c) show the best approximated polygon obtained through the present method with breakpoint numbers of 40 and 45, respectively. The computation times for Figures 6(b) and 6(c) are 2.547 s and 2.656 s, respectively. For the fish-shaped curve, the point number is 537. Figures 7(b) and 7(c) show the best approximated polygon obtained through the present method with breakpoint numbers of 50 and 60, respectively. The computation times for Figures 7(b) and 7(c) are 4.671 s and 4.891 s, respectively. The results of the approximation are tabulated in Table 6. It is seen that the present method can also give good approximation results.

6. Conclusions

A new approach based on the new artificial bee colony (NABC) algorithm has been developed for approximation of digital curves with line segments. Given the number of breakpoints, the algorithm finds an approximation with the minimum integral square error. Consequently, the NABC algorithm is a new population-based method and it has the advantages of finding the global or near-global minimum of a multimodal search space, having fast convergence, being simple and flexible, and using few control parameters. The present method simulates the behavior of honey bees foraging to develop the algorithm to select adequate breakpoints for polygonal approximation.

The method has been tested with four different well-known curves. The present method is efficient in dealing with the application because the processing time of all cases is less than 1 second. Compared with the DE-based method, the present method can obtain better approximation results in most cases for both average solutions and best solutions. For the screwdriver-shaped curve and the fish-shaped curve, the present method can also obtain good approximation results. From these experiments, these results have demonstrated the feasibility of the present method for polygonal approximation of digital curves with line segments.

Conflict of Interests

The author declares that there is no conflict of interests regarding the publication of this paper.