A Method for Estimating View Transformations from Image Correspondences Based on the Harmony Search Algorithm

Cuevas, Erik; Díaz, Margarita

doi:https://doi.org/10.1155/2015/434263

Computational Intelligence and Neuroscience

On this page

Abstract Introduction Experimental Results Conclusions References Copyright Related Articles

Special Issue

Fusion of Computational Intelligence Techniques and Their Practical Applications

View this Special Issue

Research Article | Open Access

Volume 2015 | Article ID 434263 | https://doi.org/10.1155/2015/434263

A Method for Estimating View Transformations from Image Correspondences Based on the Harmony Search Algorithm

Erik Cuevas¹and Margarita Díaz²

Academic Editor: Rahib H. Abiyev

Received30 Sept 2014

Accepted12 Dec 2014

Published03 Aug 2015

Abstract

In this paper, a new method for robustly estimating multiple view relations from point correspondences is presented. The approach combines the popular random sampling consensus (RANSAC) algorithm and the evolutionary method harmony search (HS). With this combination, the proposed method adopts a different sampling strategy than RANSAC to generate putative solutions. Under the new mechanism, at each iteration, new candidate solutions are built taking into account the quality of the models generated by previous candidate solutions, rather than purely random as it is the case of RANSAC. The rules for the generation of candidate solutions (samples) are motivated by the improvisation process that occurs when a musician searches for a better state of harmony. As a result, the proposed approach can substantially reduce the number of iterations still preserving the robust capabilities of RANSAC. The method is generic and its use is illustrated by the estimation of homographies, considering synthetic and real images. Additionally, in order to demonstrate the performance of the proposed approach within a real engineering application, it is employed to solve the problem of position estimation in a humanoid robot. Experimental results validate the efficiency of the proposed method in terms of accuracy, speed, and robustness.

1. Introduction

The goal of estimating geometric relations in images is to find an appropriate global transformation to overlay images of the same scene taken at different viewpoints. It can be applied in image processing when an object moves in front of a static camera and when a static scene is captured by a moving camera or multiple cameras from different viewpoints. This methodology has been widely adopted in many applications, for instance, when series of images can be stitched together to generate a panorama image [1–3]. Also, multiple image superresolution approaches can be applied in the overlapped region calculated according to the estimated geometry [4–6]. The motion of a moving object can also be estimated using its geometric relations [7] and a distributed camera network can be calibrated, where each camera’s position, orientation, and focal length can be calculated based on their correspondences [8–10]. Another example is the robot position that can be controlled or estimated through the estimation of the fundamental matrix/homography [11–13].

In a modelling problem, those data that can be explained by the hypothetical model are known as inliers of this model. Other points, for example, those generated by matching errors, are called outliers. The outliers are caused by external effects not related to the investigated model. Based on different criteria, several robust techniques have been proposed to identify points as inliers or outliers, being the random sampling consensus (RANSAC) algorithm [14] the most well known [15–17].

RANSAC adopts a simple hypothesize-and-evaluation process. Under such approach, a minimal subset of elements (correspondences) is sampled randomly, and a candidate model is hypothesized using this subset. Then, the candidate model is evaluated on the entire dataset separating all elements from the dataset into inliers and outliers, according to their degree of matching (error scale) to the candidate model. These steps are iterated until there is a high probability that an accurate model could be found during iterations. The model with the largest number of inliers is considered as the estimation result.

Although RANSAC algorithm is simple and powerful, it presents two main problems [18, 19]: the high consumption of iterations and the inflexible definition of its objective function. In the RANSAC algorithm, candidate models are generated by selecting data samples. Since such a strategy is completely random, a large number of iterations are required to explore a representative subset of noisy data and to find a reliable model that could contain the maximum number of inliers. In general terms, the number of iterations is strongly affected by the contamination level of the dataset. The other crucial issue is the objective function to evaluate the correctness of a candidate model from contaminated data. In the RANSAC methodology, the best estimation result is the model that maximizes the number of inliers. Therefore, the objective function involves the count, one by one, of the number of inliers associated with a candidate model. Such an objective function is fixed and prone to obtain suboptimal models under different circumstances [19].

Several variants have been proposed in order to enhance the performance of the RANSAC method. One example constitutes the approach MLESAC [20] which searches the best hypothesis by maximizing the likelihood via the RANSAC process by assuming that the inlier data would distribute as a Gaussian function and outliers are distributed randomly. Alternatively, instead of giving the error scale (i.e., the threshold to separate inliers from outliers) a priori, the SIMFIT method [21] proposes its prediction based on an iterative procedure. Other representative works, such as the projection-pursuit method [22] and TSSE (two-step scale estimator) [23], employ the mean shift technique to model the inlier distribution and obtain an inlier scale. Such approaches enables RANSAC to be data-driven; however, the whole process becomes quite time consuming.

Although all the proposed variants allow solving one of the two main RANSAC problems, the other challenge still remains. Such situation comes from the fact that the estimation process is approached as an optimization problem where the search strategy is a random walking algorithm while the objective function is fixed to the number of inliers associated with the candidate model. In order to overcome the typical RANSAC problems, we propose to visualize the RANSAC operation as a generic optimization procedure. Under this point of view, a new efficient search strategy can be added for reducing the number of consumed iterations. Likewise, it can be defined as a new objective function which incorporates other elements that allow an accurate evaluation of the quality of a candidate model.

Two important difficulties in selecting a search strategy for RANSAC are the high multimodality and the complex characteristics of the estimation process produced by the elevated contamination of the dataset. Under such circumstances, classical methods present a bad performance [24, 25], making way for recent new approaches that have been proposed to solve complex and ill-posed engineering problems. These methods include the application of modern optimization techniques such as evolutionary algorithms and metaheuristic techniques [26, 27] which have delivered better solutions over those obtained by classical methods.

The harmony search algorithm (HS) introduced by Geem et al. [28] is one example of these approaches. HS is an optimization algorithm based on the metaphor of the improvisation process that occurs when a musician searches for a better state of harmony. The HS produces a new candidate solution from all existing solutions. In HS, the solution vector is analogous to the harmony in music, and its generation schemes are analogous to musician’s improvisations. With regard to other metaheuristics in the literature, HS imposes fewer mathematical prerequisites; therefore, it can be easily modified for solving several sorts of engineering optimization challenges [29, 30]. Numerical comparisons have established that the convergence of HS is faster than GA [29, 31, 32]. Such a fact has attracted the attention of the evolutionary computation community. It has been effectively applied to solve a wide range of practical optimization problems such as structural optimization [33], parameter estimation of the nonlinear Muskingum model [34], design optimization of water distribution networks [35], vehicle routing [36], image segmentation [37], and circle detection in images [38].

Although HS allows identifying promising regions at the solution space within a reasonable time interval, it underperforms in local searching, in particular for parameter identification applications [39–42]. In order to enhance the fine-tuning (accuracy) properties of HS, the local search parameter (BW) is dynamically adjusted to improve the balance between exploration and exploitation during the search process (see [29]). However, considering that the adjustment follows an exponential function, longer exploitation periods are allowed, affecting the exploring capacity of HS particularly when it is applied to complex objective functions. A better adjustment alternative, which employs the use of a linear model, has been recently proposed in [43]. It presents better searching capacities than the approaches based on exponential functions. For this reason, such an approach is used in our method.

In this paper, a new method is presented for the robust estimation of multiple view relations from point correspondences. The approach combines the RANSAC method with the HS. Upon such combination, the proposed method adopts a different sampling strategy in comparison to RANSAC to generate putative solutions. Under the new mechanism, new candidate solutions are built iteratively by considering the quality of models generated by previous candidate solutions, rather than relying over a pure random selection as it is the case of RANSAC. Likewise, a more accurate objective function is incorporated to accurately evaluate the quality of a candidate model. As a result, the proposed approach can substantially reduce the number of iterations still preserving the robust capabilities of RANSAC. The method is generic and its use is illustrated by the estimation of homographies, considering synthetic and real images. Additionally, in order to demonstrate the performance of the proposed approach in a real engineering application, it is employed to solve the problem of position estimation of a humanoid robot. Experimental results validate the efficiency of the proposed method in terms of accuracy, speed, and robustness.

The paper is organized as follows. Section 2 explains the problem of image matching considering multiple views. Section 3 introduces the fundamentals of the RANSAC method. Section 4 explains the harmony search algorithm while Section 5 presents the proposed approach. Section 6 exhibits the experimental set and its performance results. Section 7 exposes a robotic application of the proposed approach. Finally, Section 8 establishes final conclusions.

2. View Relations from Point Correspondences

The problem of image matching consists in finding a geometric transformation that maps one image of a scene to another image taken from a different point of view. To determine the correspondence among points, it is necessary to find corresponding points on both images. Such point pairs can be obtained as a result of applying an automatic algorithm of detection and matching [44, 45]. The detected points are described by vectors of parameters (descriptors), and frequently these parameters do not allow discriminating one point from another with complete certainty. As a result, an erroneous matching about the correspondence of points located on different parts of different images may emerge.

In this section the geometric relations of points between two views are discussed, considering the case of homography.

Assume that there is a collection of pairs of the corresponding points that are found on two images where and are the positions of points in the first and second images, respectively.

Two perspective images can geometrically be linked through a plane of the scene by a homography (see Figure 1). This projective transformation relates corresponding points of the plane projected into two images by or . The homography across two views can be computed by solving a linear system from a set of four point matches [46]. The quality of the estimated homography is evaluated by considering the distance between the position of the point calculated with the help of the matrix and the actually observed position. Therefore, the mismatch error produced by the -correspondence () is defined as the sum of squared distances from the points to their estimated positions: where and correspond to the errors produced in the first and second images, respectively.

Figure 2 shows the error evaluation process of for a particular example which involves five correspondences distributed in both views. In the example, the correspondence presents a considerable error evaluated as the distances between the points and their positions calculated with the use of the matrix .

3. Random Sampling Consensus (RANSAC) Algorithm

The goal of RANSAC is to estimate the geometric transformation (the homography ) from image correspondences over two views. Potentially there are a significant number of mismatches amongst the correspondences. Correct matches will obey the homography transformation. Therefore, the aim is to obtain a set of inliers consistent with the homography transformation by using a robust technique. In this case outliers are points inconsistent with the homography transformation. In order to solve such a problem, the RANSAC algorithm has proven to be the most successful [15–17].

RANSAC solves the problem of model parameters estimation by finding the best hypothesis among the set of all possible hypotheses generated by the source data. Such source data are typically contaminated by noise. In order to build the hypothesis about the unknown parameters, a sample of the minimum size () required for model estimation is obtained (e.g., a sample of only two points is sufficient to calculate a straight line, , and of four to obtain a homography, ). Under this consideration, the probability of finding an outlier is reduced. Considering that the number of elements contained in a sample is small, the amount of possible samples that can be generated from the complete source data is enormous. Under such circumstances, the exhausting testing of all samples for a reasonable time is impossible. RANSAC faces such problem because it only considers samples which are randomly selected and evaluated. Algorithms of the RANSAC family consist of iterations of the following cycle.(1)Construct a sample consisting of different elements.(2)Build the hypothesis based on the sample .(3)Evaluate the degree of agreement of the hypothesis with the set of all source data .

After the construction and evaluation of all hypotheses, the hypothesis with the best degree of agreement is chosen among them. It is considered as a robust estimate of the model parameters. Such operation can be described as follows: The maximization of the degree of agreement (number of inliers) is equivalent to the minimization of the penalty function whose value depends on the number of outliers. Therefore, the degree of agreement is computed as follows: where is a permissible error, is the number of elements contained in the source data , and is the quadratic error produced by the th data considering the hypothesis . In the context of this paper, corresponds to which represents the error produced by the th correspondence.

The hypothesis with a minimum penalty (i.e., with the maximum degree of agreement) is chosen as the best matching criterion. In the original scheme of RANSAC, the quality of a hypothesis is defined as the number of inliers. For a given value of the permissible error , the point that produces the error is regarded to be an inlier of if its value does not exceed the threshold ; otherwise the point is regarded as an outlier.

In the RANSAC algorithm, the optimal hypothesis is found and the penalty is minimized by using a search strategy of random walking; therefore many attempts are necessary to investigate in sufficient detail the space of possible samples and to find the sample for which the hypothesis has the greatest degree of agreement on the source data. The number of iterations and thus the time spent for the search can be reduced by choosing points according to some directed rules, rather than randomly. Optimization algorithms can be considered as a robust scheme in contrast to the random search [47]. In an optimization algorithm, new candidate solutions are generated in accordance to the information obtained from past candidate solutions.

In this paper, we propose a different approach based on the HS as optimization algorithm. The goal is to demonstrate that the new method, by combining the idea of testing minimum-sized samples with the directed search inspired by the improvisation process that occurs when a musician searches for a better state of harmony, allows performing an efficient search among the correspondences to generate models of higher quality. It is also shown that the number of inliers found by the new method with the use of a fixed number of samples is significantly greater than the number of inliers determined by the family of algorithms based on RANSAC.

4. Harmony Search Algorithm

In the basic HS, each solution is called a “harmony” and is represented by an -dimension real vector. An initial population of harmony vectors are randomly generated and stored within a harmony memory (HM). A new candidate harmony is thus generated from the elements in the HM by using a memory consideration operation either by a random reinitialization or a pitch adjustment operation. Finally, the HM is updated by comparing the new candidate harmony and the worst harmony vector in the HM. The worst harmony vector is replaced by the new candidate vector in case it is better than the worst harmony vector in the HM. The above process is repeated until a certain termination criterion is met. The basic HS algorithm consists of three basic phases: HM initialization, improvisation of new harmony vectors, and updating of the HM. The following discussion addresses details about each stage.

4.1. Initializing the Problem and Algorithm Parameters

In general, the global optimization problem can be summarized as follows: min , , where is the objective function, is the set of design variables, is the number of design variables, and and are the lower and upper bounds for the design variable , respectively. The parameters for HS are the harmony memory size, that is, the number of solution vectors lying on the harmony memory (HM), the harmony-memory consideration rate (HMCR), the pitch adjusting rate (PAR), the distance bandwidth (BW), and the number of improvisations (NI) which represents the total number of iterations. It is obvious that an adequate selection for HS parameters would enhance the algorithm’s ability to search for the global optimum under a high convergence rate.

4.2. Harmony Memory Initialization

In this stage, initial vector components at HM, that is, HMS vectors, are configured. Let represent the th randomly generated harmony vector: for and , where rand(0,1) is a uniform random number between 0 and 1. Then, the HM matrix is filled with the HMS harmony vectors as follows:

4.3. Improvisation of New Harmony Vectors

In this phase, a new harmony vector is built by applying the following three operators: memory consideration, random reinitialization, and pitch adjustment. Generating a new harmony is known as “improvisation.” In the memory consideration step, the value of the first decision variable for the new vector is chosen randomly from any of the values already existing in the current HM, that is, from the set . For this operation, a uniform random number is generated within the range . If is less than HMCR, the decision variable is generated through memory considerations; otherwise, is obtained from a random reinitialization between the search bounds . Values of the other decision variables are also chosen accordingly. Therefore, both operations, memory consideration and random reinitialization, can be modelled as follows: Every component obtained by memory consideration is further examined to determine whether it should be pitch-adjusted. For this operation, the pitch adjusting rate (PAR) is defined as to assign the frequency of the adjustment and the bandwidth factor (BW) to control the local search around the selected elements of the HM. Hence, the pitch adjusting decision is calculated as follows: Pitch adjusting is responsible for generating new potential harmonies by slightly modifying original variable positions. Such operation can be considered similar to the mutation process in evolutionary algorithms. Therefore, the decision variable is either perturbed by a random number between and BW or left unaltered. In order to protect the pitch adjusting operation, it is important to assure that points lying outside the feasible range must be reassigned, that is, truncated to the maximum or minimum value of the interval.

4.4. Updating the Harmony Memory

After a new harmony vector is generated, the harmony memory is updated by the survival of the fit competition between and the worst harmony vector , according to its fitness value, in the HM. Therefore will replace and become a new member of the HM in case the fitness value of is better than the fitness value of .

4.5. Computational Procedure

The computational procedure of the basic HS can be summarized as shown in Procedure 1 [18].

Step 1. Set the parameters HMS, HMCR, PAR, BW and NI.
Step 2. Initialize the HM and calculate the objective function value of each harmony vector.
Step 3. Improvise a new harmony as follows:
for ( = 1 to ) do
if ( < HMCR) then
Select randomly a number where
=
if ( < PAR) then
where
end if
if

end if
if

end if
else
, where
end if
end for
Step 4. Update the HM as if
Step 5. If NI is completed, the best harmony vector according to its fitness value in the HM is
returned; otherwise go back to Step 3.

4.6. Dynamical Linear Adjustment of BW

Every metaheuristic algorithm needs to address the issue of exploration-exploitation of the search space. Exploration is the process of visiting entirely new points of a search space whilst exploitation is the process of refining those points within the neighborhood of previously visited locations in order to improve their solution quality.

In HS, the BW parameter controls the local search around HM elements. A large BW value eases the algorithm’s searching at a larger scope, while a small BW value is appropriate for fine-tuning of best solution vectors.

In the standard HS, the BW value is considered as a constant number. However, in this work, the BW value is dynamically adjusted as to favor exploration at early stages while exploitation is reinforced during final stages of the searching process. The adjustment uses a linear model defined as follows: where is the iteration index, while and are the maximum and minimum BW values, respectively. In contrast to exponential adjustment [26], linear models, as the one used in this paper, allow a better balance between exploration and exploitation (fine-tuning) of the search process [40].

Since all candidate solutions are generated by using the HS operators, there is a low probability to be trapped into local minima [48]. HS can effectively handle challenging multimodal optimization problems [49, 50]. Such fact contrasts to well-known genetic algorithms (GA) [51] and particle swarm optimization (PSO) [52] which usually tends to conduct the whole population towards the best candidate solution [53] producing premature convergence.

5. Method for Geometric Estimation Using HS

The estimation of model parameters in algorithms of the RANSAC family is implied to find an optimal sample of length from a set consisting of elements. In the standard scheme, RANSAC uses a random walking algorithm as a search strategy. The idea of the proposed method considers the use of HS to generate samples based on information about their quality, rather than randomness. The quality of a sample, that is, the fitness of a harmony , is defined as the matching degree of the hypothesis that is constructed based on the correspondence numbers coded within .

Considering that the problem consists in estimating the parameters of through a set of different correspondences, the proposed approach can be described as shown in Algorithm 1.

(1) Configuration
(a) Set the parameters HMS, HMCR, PAR, BW_min, BW_max and NI.
(2) Initial population.
(a) Build the harmony memory (HM) where each individual
consists 4 random non-repeating indices from 1 to .
(b) Compute homography (hypothesis ) by using the indices from .
(c) Calculate the fitness value as the matching quality of the constructed homography
considering the whole available data . Such fitness value is calculated by using a
new objective function defined as:
, (A)
where represents the quadratic error produced by the th correspondence
considering the hypothesis whereas is the penalty associated with the mismatch
magnitude. Such error corresponds to the mismatch generated by the evaluation of .
(3) Iterations .
(a) Generate a new harmony (candidate solution) as follows:

for ( = 1 to ) do
if ( < HMCR) then
Select randomly a number where
=
if ( < PAR) then
where
end if
if

end if
if

end if
else
, where
end if
end for
(b) Compute homography by using the indices from .
(d) Calculate the fitness value as the matching quality of the constructed
homography considering the whole available data . Such fitness value is
calculated by using the objective function described in (A).
(e) Update the HM as if
(4) Estimation result
(a) The best estimation consists of the parameters computed by using the indices from
the best element of HMin terms of its affinity, so that .

The proposed approach combines the RANSAC method with the HS adopting a different sampling strategy in comparison to RANSAC to generate putative solutions. Under the new mechanism, at each iteration, new candidate solutions are built taking into account the quality of the models that have been generated by previous candidate solutions, rather than purely random as it is the case in RANSAC.

Since the approach visualizes the RANSAC operation as a generic optimization procedure, different objective functions can be incorporated to accurately evaluate the quality of a candidate model. Although several objective functions can be tested, this work employs the expression in Equation (A).

In contrast to the traditional RANSAC algorithm, the objective function considers two different aims: the number of inliers and the approximation error. The idea is to find the candidate homography that maximizes the number of inliers and simultaneously minimizes the approximation error. Under such circumstances, the obtained estimation represents the solution that presents the best trade-off between both objectives. As a result, the proposed approach can substantially reduce the number of iterations, still preserving the robust capabilities of RANSAC method.

6. Experimental Results

In this section, a comprehensive set of experiments have been conducted to test the performance of the proposed approach. The results are divided into two different categories: (1) effect of the main HS parameters in the estimation results and (2) comparison results over synthetic and real homographies.

In the experiments, three performance indexes are considered: the number of inliers (NofI), the error (), and the number of function evaluations (NFE). The first two indexes assess the accuracy of the solution whereas the last one measures the computational cost.

The number of inliers (NofI) expresses the amount of elements contained in the set of detected inliers. The error () provides a quality measure of the estimated relation. In case of synthetic data, the error is calculated as where is the inlier point calculated by the estimated relation in the -view, is the inlier ground true point, and is the Euclidian distance between the points. Therefore, evaluates the fit of the estimated relation, computed from the noisy data, against the known ground truth points.

In the case of real data the error is assessed from the standard deviation of the inliers. Thus, is computed as follows: where is the quadratic error produced by the th inlier. In the context of this paper, corresponds to which represents the error produced by the th inlier.

The number of function evaluations (NFE) specifies the total number of transformations that have been evaluated by the algorithm until the best estimation has been reached.

6.1. Effect of the HS Parameters

Several parameters define the performance of HS. However, from all of them, the harmony-memory consideration rate (HMCR) and pitch adjusting rate (PAR) are the most important [55]. To study the impact of these parameters, over the performance of HS in the estimation procedure, different values have been tested on the computation of a synthetic homography. Such a homography was generated, in its first view, by using a rectangular pattern of 8 × 6 elements within a 2-dimensional space of . Then, such points were transformed by a random homography and contaminated by normally distributed noise for constructing their correspondences in the second view. A set of outliers was added by selecting randomly data points within the space limits. In the test, the fraction of outliers is of 75%. In order to illustrate the experimental setup, Figures 3(a) and 3(b) exhibit the first and second views, respectively. Considering the correspondence points, the HS-RANSAC algorithm generates the estimation of . In Figure 3(a), the black squares indicate the position in the first view of a point from the second view as a result of the transformation. Likewise, the black squares in Figure 3(b) exhibit the position in the second view of a point from the first view as a result of the transformation.

(a)

(b)

In the experiment, the maximum number of iterations is set to 950. HMS, , , , and are fixed to 50, 10, 1, 0.001, and 5, respectively. The results report the number of inliers (NofI) and the produced estimation error () of HS-RANSAC, averaged over 30 runs, for the different values of HMCR and PAR. In the experiment, the parameter values are modified considering specific interval. HMCR varies from 0.5 to 0.8 whereas PAR changes from 0.1 to 0.4. The results, shown in Table 1, suggest that a proper combination of different parameter values can improve the performance of HS-RANSAC and the quality of the estimations. The best parameter configuration in the experiment is highlighted in Table 1.

After considering the analysis of Table 1, the parameter values for the proposed estimator are defined in Table 2. Once defined, such values have been kept in all experiments reported in this paper.

6.2. Comparison Results over Synthetic and Real Homographies

We have applied the proposed method to estimate homographies on real and synthetic data in order to compare its performance against other estimation algorithms such as the standard RANSAC [14], the MLESAC [20], the SIMFIT method [21], the projection-pursuit algorithm [22], the TSSE [23], and the PSO algorithm (PSO-RANSAC) [54]. The first five approaches are RANSAC-based estimators whose results are broadly known. In all cases, the algorithms are tuned according to the value set which is originally proposed by their own references. However, the PSO method has been included as a reference, only to validate the performance of the HS as an optimization approach.

In order to conduct a fair comparison between the HS version used in this work and PSO, an enhanced version of PSO has been also chosen with similar characteristics. Therefore, it is used in the comparisons, the PSO version reported in [54]. Such an approach is proposed to mitigate the premature convergence problem of the original PSO method. It incorporates two new elements: (1) a weight factor and (2) a constriction factor . Similar to BW in the HS method, the weight factor is linearly decreased during the algorithm execution to regulate the attraction force towards the best particle seen so far. On the other hand, the constriction factor permits limiting the particle velocities in order to control their trajectories. Under such circumstances, the enhanced PSO version is used in combination with RANSAC considering the following configuration: , , , and whereas the weight factor decreases linearly from 0.9 to 0.2. Additionally, the constriction factor is fixed to 2. Such a configuration presents the best possible performance according to [54].

6.2.1. Homography Estimation with Synthetic Data

This section reports the experimental results corresponding to the estimation of homography matrix considering synthetic data. In the experiments, the same synthetic homography produced in Section 4.1 has been used (see Figure 3). The only difference is that the fraction of the incorporated outliers varies from 0 to 100%.

In the experiment, each algorithm’s execution requires 1000 iterations. Since the proposed HS-RANSAC involves 50 initial evaluations (size of the harmony memory), it requires the execution of only 950 iterations to reach the 1000 evaluations. On the other hand, PSO-RANSAC possesses 10 particles; for this reason 100 generations need to be evolved in order to fulfill the 1000 iterations.

Figure 4 presents the performance for each algorithm. The results present the averaged outcomes obtained throughout 50 different executions. In order to appropriately analyze these results, it is necessary to define the concept of a breakdown point [18]. The breakdown point is identified as the highest outlier ratio from which the algorithm degrades its capacity to find inliers. It can be seen from Figure 4(a) that standard RANSAC has a breakdown point at 40%, the MLESAC at 55%, the SIMFIT method at 70%, the projection-pursuit algorithm at 50%, the TSSE at 45%, and the PSO-RANSAC at 80%. In contrast to such methods, the proposed approach, HS-RANSAC, does not seem to have a prominent breakdown point, since its capacity to detect inliers smoothly degrades. It is also observed that the HS-RANSAC algorithm presents the best performance in terms of the number of inliers (NofI), as it is able to detect most of them. For the estimated , the error (Figure 4(b)) is fairly comparable for all methods until they reach their breakdown points. Nonetheless, the proposed algorithm performed better, being the only algorithm that consistently found the minimum error at all outlier ratios.

(a)

(b)

(c)

In terms of number of function evaluations (NFE), Figure 4(c) shows that the standard RANSAC, the MLESAC, the projection-pursuit algorithm, and the TSSE invest approximately the same number of iterations for reaching their best estimation of . Since such methods use a random walking algorithm as a search strategy, the NFE significantly grows as the number of outliers increases. On the other hand, the PSO-RANSAC and the HS-RANSAC (that use an optimization algorithm as search strategy) maintain a considerably low NFE value with independence of the number of outliers.

From the experiment, it is evident that the use of an optimization approach can considerably reduce the NFE value. However, there is no optimization algorithm suitable to find a good enough estimation considering the high multimodality and complex characteristics of the estimation process which is produced by the elevated contamination of the dataset. Therefore, although the PSO-RANSAC finds its best estimated fundamental matrix investing approximately the same number of evaluations as the HS-RANSAC, such estimated matrix represents only a suboptimal solution. This fact can be observed in Figure 4(b) where it is clear that the PSO-RANSAC algorithm presents higher values in comparison to the HS-RANSAC approach. The reason for this problem points to those operators used by PSO for modifying the individual positions. In PSO, during their evolution, the position of each agent in the next iteration is updated yielding an attraction towards the position of the best particle seen so far. Such behavior shows that the entire population, as the algorithm evolves, concentrates around the best particle, favoring the premature convergence (reaching suboptimal solutions) [53].

6.2.2. Homography Estimation with Real Images

In this section, the experimental results of the estimation of homographies considering real images are reported. To evaluate the estimation performance of the proposed method, Table 3 tabulates the comparative inlier detection performance of the standard RANSAC [14], the MLESAC [20], the SIMFIT method [21], the projection-pursuit algorithm [22], the TSSE [23], the PSO algorithm (PSO-RANSAC) [54], and the proposed HS-RANSAC approach, in terms of the detection rate (DR), the error (), and the number of function evaluations (NFE). The experimental dataset includes 4 images (images A, B, C, and D) which are shown in Figures 5, 6, 7, and 8. Such images contain a determined number of inliers which have been detected and counted by a human expert (, , , and ). Such values act as ground truth for all the experiments. For the comparison, the detection rate (DR) is defined as the ratio between the number of inliers correctly detected by the algorithm (NofI value) and the total number of inliers determined by the expert. The results consider 50 different executions for each algorithm over the four images. Experimental results show that the proposed HS method accomplishes at least a 94.2% of inlier detection accuracy. A close inspection of Table 3 also reveals that the proposed approach is able to achieve the smallest error (), yet requiring a few number of function evaluations (NFE) for most cases.

Figures 5, 6, 7, and 8 also exhibit the results after applying the HS-RANSAC estimator. Such results present the median case obtained throughout 50 runs.

7. Engineering Application: Position Estimation in a Humanoid Robot

Additionally, in order to demonstrate the performance of the proposed approach in a real engineering application, the paper also reports the application of the HS-RANSAC to solve the problem of position estimation of a humanoid robot.

In the last decades, much work has already been accomplished in the area of humanoid robotics [56, 57]. Position determination for humanoid robots is a critical problem, since it is used to control their balance and locomotion. Recently, a notable research [58] has been devoted to achieving better performance in system position for humanoid robots by using sensor fusion methods. In general, integrating information from different sensors increases not only the versatility of the system, but also its cost and complexity. Vision is one of the most studied sensory modalities for position and navigation purposes since it provides rich information of the environment.

The framework of the approach presented in this section, as an application, is a vision system consisting of a fixed camera mounted on a Bioloid© humanoid robot. In the approach, the position of the robot is computed considering the homography estimated by the HS-RANSAC. Therefore, the idea is to calculate the planar motion of the humanoid robot through the estimated homographies. Figure 9 illustrates the process of planar motion calculation.

The homography can be related to camera motion and plane location as follows: where is the distance from the camera to the plane (the height of the humanoid approximately). describes a rotation about the axis and can be expressed as And is a translation vector with the form As the unit normal is , considering the point , the rotation matrix , and the vector (where and are calculated from the homography ), the new planar position can be computed as More details about planar motion based on homography can be found in [59]. The HS-RANSAC algorithm and 11–14 were implemented in a Raspberry Pi. Since the computation must be verified in real time, the number of iterations is fixed to only 150. Figure 10 shows the calculated positions from the homographies estimated during the humanoid locomotion. Such a figure demonstrates that the information of the estimated position adequately reflects the humanoid movement in spite of the reduced number of iterations.

8. Conclusions

In this paper, a new method for robustly estimating homographies from point correspondences based on the evolutionary algorithm has been presented. The approach combines the RANSAC method and the harmony search (HS) algorithm. With the combination, the proposed method adopts an alternative sampling strategy in comparison with RANSAC to build putative solutions. Under the new mechanism, new candidate solutions are generated iteratively by taking into consideration the quality of models produced by previous candidate solutions, instead of relying over a pure random selection as it is the case of RANSAC. On the other hand, a more accurate objective function was incorporated to adequately asses the quality of a candidate model. As a result, the proposed approach can substantially reduce the number of iterations still preserving the robust capabilities of RANSAC.

The proposed approach has been compared to other similar techniques proposed in the literature such as standard RANSAC [14], the MLESAC [17], the SIMFIT method [18], the projection-pursuit algorithm [19], the TSSE [20], and the PSO algorithm (PSO-RANSAC) [52]. The efficiency of the algorithm has been evaluated in terms of the detection rate (DR, NofI), accuracy (), and computational cost (NFE). Experimental results that consider real and synthetic data provide evidence on the remarkable performance of the proposed algorithm in comparison to such methods. Additionally, in order to demonstrate the performance of the proposed approach in a real engineering application, it has been employed to solve the problem of position estimation in a humanoid robot.

Although the experimental results indicate that the proposed method can yield better results on estimating homographies, it should be noticed that the aim of our paper is not intended to beat all the RANSAC methods which have been proposed earlier but to show that the use of evolutionary approaches can effectively serve as an attractive alternative to solve complex optimization problems, yet demanding fewer function evaluations.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

The present research has been supported under the Grant CONACYT CB 181053. The second author acknowledges the SEP of Mexico for its support (2014-2015).

References

R. Szeliski and H.-Y. Shum, “Creating full view panoramic image mosaics and environment maps,” in Proceedings of the Conference on Computer Graphics (SIGGRAPH '97), pp. 251–258, August 1997.
View at: Google Scholar
Y. He and R. Chung, “Image mosaicking for polyhedral scene and in particular singly visible surfaces,” Pattern Recognition, vol. 41, no. 3, pp. 1200–1213, 2008.
View at: Publisher Site | Google Scholar
M. Brown and D. G. Lowe, “Automatic panoramic image stitching using invariant features,” International Journal of Computer Vision, vol. 74, no. 1, pp. 59–73, 2007.
View at: Publisher Site | Google Scholar
A. Akyol and M. Gökmen, “Super-resolution reconstruction of faces by enhanced global models of shape and texture,” Pattern Recognition, vol. 45, no. 12, pp. 4103–4116, 2012.
View at: Publisher Site | Google Scholar
H. Huang, H. He, X. Fan, and J. Zhang, “Super-resolution of human face image using canonical correlation analysis,” Pattern Recognition, vol. 43, no. 7, pp. 2532–2543, 2010.
View at: Publisher Site | Google Scholar
K. Jia and S. Gong, “Hallucinating multiple occluded face images of different resolutions,” Pattern Recognition Letters, vol. 27, no. 15, pp. 1768–1775, 2006.
View at: Publisher Site | Google Scholar
O. Deniz, G. Bueno, E. Bermejo, and R. Sukthankar, “Fast and accurate global motion compensation,” Pattern Recognition, vol. 44, no. 12, pp. 2887–2901, 2011.
View at: Publisher Site | Google Scholar
E. Montijano and C. Sagues, “Distributed multi-camera visual mapping using topological maps of planar regions,” Pattern Recognition, vol. 44, no. 7, pp. 1528–1539, 2011.
View at: Publisher Site | Google Scholar
J. Su, R. Chung, and L. Jin, “Homography-based partitioning of curved surface for stereo correspondence establishment,” Pattern Recognition Letters, vol. 28, no. 12, pp. 1459–1471, 2007.
View at: Publisher Site | Google Scholar
T. T. Santos and C. H. Morimoto, “Multiple camera people detection and tracking using support integration,” Pattern Recognition Letters, vol. 32, no. 1, pp. 47–55, 2011.
View at: Publisher Site | Google Scholar
H. Zhang and J. P. Ostrowski, “Visual motion planning for mobile robots,” IEEE Transactions on Robotics and Automation, vol. 18, no. 2, pp. 199–208, 2002.
View at: Publisher Site | Google Scholar
C. Sagüés and J. J. Guerrero, “Visual correction for mobile robot homing,” Robotics and Autonomous Systems, vol. 50, no. 1, pp. 41–49, 2005.
View at: Publisher Site | Google Scholar
G. López-Nicolás, J. J. Guerrero, and C. Sagüés, “Visual control of vehicles using two-view geometry,” Mechatronics, vol. 20, no. 2, pp. 315–325, 2010.
View at: Publisher Site | Google Scholar
M. A. Fischler and R. C. Bolles, “Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography,” Communications of the Association for Computing Machinery, vol. 24, no. 6, pp. 381–395, 1981.
View at: Publisher Site | Google Scholar | MathSciNet
E. İmre and A. Hilton, “Order statistics of RANSAC and their practical application,” International Journal of Computer Vision, 2014.
View at: Publisher Site | Google Scholar
X. Wu, Q. Zhao, and W. Bu, “A SIFT-based contactless palmprint verification approach using iterative RANSAC and local palmprint descriptors,” Pattern Recognition, vol. 47, no. 10, pp. 3314–3326, 2014.
View at: Publisher Site | Google Scholar
F. Zhou, Y. Cui, Y. Wang, L. Liu, and H. Gao, “Accurate and robust estimation of camera parameters using RANSAC,” Optics and Lasers in Engineering, vol. 51, no. 3, pp. 197–212, 2013.
View at: Publisher Site | Google Scholar
J. Matas and O. Chum, “Randomized RANSAC with $T_{d, d}$ test,” Image and Vision Computing, vol. 22, no. 10, pp. 837–842, 2004.
View at: Publisher Site | Google Scholar
C.-M. Cheng and S.-H. Lai, “A consensus sampling technique for fast and robust model fitting,” Pattern Recognition, vol. 42, no. 7, pp. 1318–1329, 2009.
View at: Publisher Site | Google Scholar
P. H. S. Torr and A. Zisserman, “MLESAC: a new robust estimator with application to estimating image geometry,” Computer Vision and Image Understanding, vol. 78, no. 1, pp. 138–156, 2000.
View at: Publisher Site | Google Scholar
S. B. Heinrich, “Efficient and robust model fitting with unknown noise scale,” Image and Vision Computing, vol. 31, no. 10, pp. 735–747, 2013.
View at: Publisher Site | Google Scholar
R. Subbarao and P. Meer, “Beyond RANSAC: user independent robust regression,” in Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition Workshop (CVPRW '06), vol. no. 1, p. 101, June 2006.
View at: Publisher Site | Google Scholar
H. Wang and D. Suter, “Robust adaptive-scale parametric model estimation for computer vision,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 11, pp. 1459–1474, 2004.
View at: Publisher Site | Google Scholar
L. Jun-Hua and L. Ming, “An analysis on convergence and convergence rate estimate of elitist genetic algorithms in noisy environments,” Optik, vol. 124, no. 24, pp. 6780–6785, 2013.
View at: Publisher Site | Google Scholar
E. Mendel, R. A. Krohling, and M. Campos, “Swarm algorithms with chaotic jumps applied to noisy optimization problems,” Information Sciences, vol. 181, no. 20, pp. 4494–4514, 2011.
View at: Publisher Site | Google Scholar | MathSciNet
H. Pan, L. Wang, and B. Liu, “Particle swarm optimization for function optimization in noisy environment,” Applied Mathematics and Computation, vol. 181, no. 2, pp. 908–919, 2006.
View at: Publisher Site | Google Scholar | MathSciNet
H.-G. Beyer, “Evolutionary algorithms in noisy environments: theoretical issues and guidelines for practice,” Computer Methods in Applied Mechanics and Engineering, vol. 186, no. 2–4, pp. 239–267, 2000.
View at: Publisher Site | Google Scholar
Z. W. Geem, J. H. Kim, and G. V. Loganathan, “A new heuristic optimization algorithm: harmony search,” Simulation, vol. 76, no. 2, pp. 60–68, 2001.
View at: Publisher Site | Google Scholar
M. Mahdavi, M. Fesanghary, and E. Damangir, “An improved harmony search algorithm for solving optimization problems,” Applied Mathematics and Computation, vol. 188, no. 2, pp. 1567–1579, 2007.
View at: Publisher Site | Google Scholar | MathSciNet
M. G. Omran and M. Mahdavi, “Global-best harmony search,” Applied Mathematics and Computation, vol. 198, no. 2, pp. 643–656, 2008.
View at: Publisher Site | Google Scholar | MathSciNet
K. S. Lee and Z. W. Geem, “A new meta-heuristic algorithm for continuous engineering optimization: harmony search theory and practice,” Computer Methods in Applied Mechanics and Engineering, vol. 194, no. 36–38, pp. 3902–3933, 2005.
View at: Publisher Site | Google Scholar
K. S. Lee, Z. W. Geem, S.-H. Lee, and K.-W. Bae, “The harmony search heuristic algorithm for discrete structural optimization,” Engineering Optimization, vol. 37, no. 7, pp. 663–684, 2005.
View at: Publisher Site | Google Scholar | MathSciNet
K. S. Lee and Z. W. Geem, “A new structural optimization method based on the harmony search algorithm,” Computers and Structures, vol. 82, no. 9-10, pp. 781–798, 2004.
View at: Publisher Site | Google Scholar
J. H. Kim, Z. W. Geem, and E. S. Kim, “Parameter estimation of the nonlinear Muskingum model using harmony search,” Journal of the American Water Resources Association, vol. 37, no. 5, pp. 1131–1138, 2001.
View at: Publisher Site | Google Scholar
Z. W. Geem, “Optimal cost design of water distribution networks using harmony search,” Engineering Optimization, vol. 38, no. 3, pp. 259–280, 2006.
View at: Publisher Site | Google Scholar
Z. W. Geem, K. S. Lee, and Y. Park, “Application of harmony search to vehicle routing,” American Journal of Applied Sciences, vol. 2, no. 12, pp. 1552–1557, 2005.
View at: Publisher Site | Google Scholar
D. Oliva, E. Cuevas, G. Pajares, D. Zaldivar, and M. Perez-Cisneros, “Multilevel thresholding segmentation based on harmony search optimization,” Journal of Applied Mathematics, vol. 2013, Article ID 575414, 24 pages, 2013.
View at: Publisher Site | Google Scholar | MathSciNet
E. Cuevas, N. Ortega-Sánchez, D. Zaldivar, and M. Pérez-Cisneros, “Circle detection by harmony search optimization,” Journal of Intelligent & Robotic Systems: Theory and Applications, vol. 66, no. 3, pp. 359–376, 2012.
View at: Publisher Site | Google Scholar
A. Kaveh and S. Talatahari, “Particle swarm optimizer, ant colony strategy and harmony search scheme hybridized for optimization of truss structures,” Computers and Structures, vol. 87, no. 5-6, pp. 267–283, 2009.
View at: Publisher Site | Google Scholar
S. Mun and Z. W. Geem, “Determination of individual sound power levels of noise sources using a harmony search algorithm,” International Journal of Industrial Ergonomics, vol. 39, no. 2, pp. 366–370, 2009.
View at: Publisher Site | Google Scholar
S. Mun and Z. W. Geem, “Determination of viscoelastic and damage properties of hot mix asphalt concrete using a harmony search algorithm,” Mechanics of Materials, vol. 41, no. 3, pp. 339–353, 2009.
View at: Publisher Site | Google Scholar
Q.-K. Pan, P. N. Suganthan, J. J. Liang, and M. F. Tasgetiren, “A local-best harmony search algorithm with dynamic sub-harmony memories for lot-streaming flow shop scheduling problem,” Expert Systems with Applications, vol. 38, no. 4, pp. 3252–3259, 2011.
View at: Publisher Site | Google Scholar
Q.-K. Pan, P. N. Suganthan, M. F. Tasgetiren, and J. J. Liang, “A self-adaptive global best harmony search algorithm for continuous optimization problems,” Applied Mathematics and Computation, vol. 216, no. 3, pp. 830–848, 2010.
View at: Publisher Site | Google Scholar | MathSciNet
H. Bay, A. Ess, T. Tuytelaars, and L. V. Gool, “SURF: speeded up robust features,” Computer Vision and Image Understanding, vol. 110, no. 3, pp. 346–359, 2008.
View at: Google Scholar
C. Harris and M. Stephens, “A combined corner and edge detector,” in Proceedings of the 4th Alvey Vision Conference, pp. 147–151, 1988.
View at: Publisher Site | Google Scholar
R. Hartley and A. Zisserman, Multiple View Geometry in Computer Vision, Cambridge University Press, 2nd edition, 2004.
View at: MathSciNet
P. Meer, “Robust techniques in computer vision,” in Emerging Topics in Computer Vision, G. Medioni and S. B. Kang, Eds., pp. 107–190, Prentice Hall, Boston, Mass, USA, 2004.
View at: Google Scholar
E. Valian, S. Tavakoli, and S. Mohanna, “An intelligent global harmony search approach to continuous optimization problems,” Applied Mathematics and Computation, vol. 232, pp. 670–684, 2014.
View at: Publisher Site | Google Scholar | MathSciNet
V. Kumar, J. K. Chhabra, and D. Kumar, “Parameter adaptive harmony search algorithm for unimodal and multimodal optimization problems,” Journal of Computational Science, vol. 5, no. 2, pp. 144–155, 2014.
View at: Publisher Site | Google Scholar | MathSciNet
S. M. Ashrafi and A. B. Dariane, “Performance evaluation of an improved harmony search algorithm for numerical optimization: Melody Search (MS),” Engineering Applications of Artificial Intelligence, vol. 26, no. 4, pp. 1301–1321, 2013.
View at: Publisher Site | Google Scholar
C. Hamzaçebi, “Improving genetic algorithms' performance by local search for continuous function optimization,” Applied Mathematics and Computation, vol. 196, no. 1, pp. 309–317, 2008.
View at: Publisher Site | Google Scholar
J. Kennedy and R. Eberhart, “Particle swarm optimization,” in Proceedings of the IEEE International Conference on Neural Networks, vol. 4, pp. 1942–1948, December 1995.
View at: Google Scholar
B. O. Arani, P. Mirzabeygi, and M. S. Panahi, “An improved PSO algorithm with a territorial diversity-preserving scheme and enhanced exploration-exploitation balance,” Swarm and Evolutionary Computation, vol. 11, pp. 1–15, 2013.
View at: Publisher Site | Google Scholar
M. Clerc and J. Kennedy, “The particle swarm—explosion, stability, and convergence in a multidimensional complex space,” IEEE Transactions on Evolutionary Computation, vol. 6, no. 1, pp. 58–73, 2002.
View at: Publisher Site | Google Scholar
J. Chen, Q.-k. Pan, and J.-q. Li, “Harmony search algorithm with dynamic control parameters,” Applied Mathematics and Computation, vol. 219, no. 2, pp. 592–604, 2012.
View at: Publisher Site | Google Scholar | MathSciNet
A.-L. Lee and J.-H. Kim, “3-Dimensional pose sensor algorithm for humanoid robot,” Control Engineering Practice, vol. 18, no. 10, pp. 1173–1182, 2010.
View at: Publisher Site | Google Scholar
H. Shahbazi, K. Jamshidi, A. H. Monadjemi, and H. Eslami, “Biologically inspired layered learning in humanoid robots,” Knowledge-Based Systems, vol. 57, pp. 8–27, 2014.
View at: Publisher Site | Google Scholar
J. F. Seara and G. Schmidt, “Intelligent gaze control for vision-guided humanoid walking: methodological aspects,” Robotics and Autonomous Systems, vol. 48, no. 4, pp. 231–248, 2004.
View at: Publisher Site | Google Scholar
B. Liang and N. Pears, “Visual navigation using planar homographies,” in Proceedings of the IEEE International Conference on Robotics and Automation (ICRA '02), vol. 1, pp. 205–210, 2002.
View at: Google Scholar

Copyright

Copyright © 2015 Erik Cuevas and Margarita Díaz. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

2148

Downloads

1402

Citations