Abstract

Traditional statistical thresholding methods, directly constructing the optimal threshold criterion using the class variance, have certain versatility but lack the specificity of practical application in some cases. To select the optimal threshold for infrared image thresholding, a simple and efficient method based on cloud model is proposed. The method firstly generates the cloud models corresponding to image background and object, respectively, and defines a novel threshold dependence criterion related with the hyper-entropy of these cloud models and then determines the optimal grayscale threshold by the minimization of this criterion. It is indicated by the experiments that, compared with selected methods, using both image thresholding and target detection, the proposed method is suitable for infrared image thresholding since it performs good results and is reasonable and effective.

1. Introduction

Image thresholding converts a gray level image into a binary image, and it is one group of popular and simple methods. Many different techniques have been proposed and developed over the years [14]. Comprehensive overviews and comparative studies of image thresholding can be found in the recent literature [5, 6].

Among these thresholding methods, the most common idea is to optimize some threshold dependent functions, which include the information and properties of the images, as known to all, the statistical image thresholding. The Otsu method, a typical example, has been widely used [7] as one of the best threshold selection methods for general real world images. Based on the Otsu method, many modified methods or other statistical variations had been proposed, such as minimum variance method (Hou for short) [8], standard deviation-based method (Li) [9], and median-based method (Xue) [10]. The method in [11] proposed a cloud model-based framework for range-constrained thresholding and improved four traditional methods, while the method in [12] converted the histogram of image into a series of normal cloud models by cloud transformation, which is as an improvement of Gaussian Mixed Model.

In general, the existing statistical methods have been proven useful and successful in many applications [13]. However, none of them is generally applicable to all images, and different algorithms should be usually not equally suitable for any given particular application. We believe that image thresholding is also an essential part in infrared image tracking system, since target detection is an important problem in infrared image sequences with various cluttered environments, and image thresholding can be usually used to separate candidate targets in the image because of its simplicity and efficiency. Unfortunately, most statistical methods cannot provide satisfied results for infrared image thresholding without considering the practical features, which are not exclusive and lack sufficient attention on specific application prospect. In this sense, the automatic selection of optimum threshold for infrared image is still a challenge.

Almost all of infrared images are with mixture non-Gaussian model, narrow grayscale range, and low-contrast objects, and there are similar statistical properties between classes of objects and background. In addition, small targets to be detected exist in many cases [14]. These features of infrared images are our major concerns. To enforce the weak point of the previous statistical thresholding methods, we propose a cloud model-based approach for infrared image thresholding. Our intentions are twofold: () using cloud models to depict the classes of background and object in a more robust way, () presenting a new statistical threshold criterion related with cloud models to determine the optimal threshold. Different from the existing methods, especially our previous publications [11, 12], the proposed method used and only used cloud model and did not use any existing methods. In other words, cloud model is no longer an assistant tool for existing methods. Cloud model is a cognitive model between a qualitative concept and its quantitative instantiations [1517] and has been used in image thresholding with uncertainty [11, 12, 18]. We have done quantitative and qualitative validation of the proposed approach against several infrared images. Comparison has been made with respect to seven methods, including three traditional state-of-the-art algorithms [1921] and four relative methods [710]. The experimental results, both image thresholding and target detection, demonstrate that our approach is efficient and effective.

The rest of the paper is organized as the following: Section 2 presents an overview of related works. Then, Section 3 proposes a novel cloud model-based algorithm for infrared image thresholding, and the algorithm analysis is also discussed, as well as implementation and computational complexity. Section 4 shows the experimental results, both infrared image thresholding and an application of target detection. Section 5 provides some discussions on the proposal. Finally, the conclusion is drawn in Section 6.

For an image of pixels, each pixel is represented by its grayscale , , and denotes the grayscale level, which is 256 for an 8-bit grayscale image. Then the histogram can be as , , constructed by counting the frequencies of the grey levels. For convenience, we only consider a bi-level thresholding problem for infrared images and suppose dark background is with lower grayscale while bright object is with higher grayscale. Given a threshold , the segmented image would be divided into two classes , where consists of pixels with gray levels , and with gray levels in .

2.1. The Otsu Method

The Otsu method is one of the simple and popular techniques for statistical image thresholding. Otsu’s rule for selecting the optimal threshold can be written aswhere , are the cumulative probability of two classes, that is, background pixels and object pixels , and can be defined asand , are the standard deviations of these classes:in addition, , are the means of these classes:

2.2. The Hou Method

Hou et al. [8] proved that the Otsu method tends to divide an image into object and background of similar sizes and presented an improved method for image thresholding. Hou’s criterion obtains the optimal threshold by minimizing the sum of class variance:

The Hou method overcomes the class probability and the class variance effects using the relative distance and the average distance, but there still exist some disadvantages, such as noise or inhomogeneity.

2.3. The Li Method

Li et al. [9] believed that both Otsu and Hou neglect specific characteristic of practical images and get unsatisfactory segmentation results when applied to those images with similar statistical distributions in both object and background. In other words, for two Gaussian classes with equal variances but distinct sizes or with equal sizes but distinct variances, the Otsu and Hou methods would perform not as perfectly as for two classes with more equal sizes and more equal variances. Aiming at the images with similar distributions in classes of background and object, especially for infrared images, Li improved the weakness of the Otsu and Hou methods and proposed the new criterion related with the minimal standard deviation, which can be rewritten as

2.4. The Xue Method

The above methods in (1), (5), and (6) choose class variance of some form as criterions for threshold determination, while Xue and Titterington [10] argued that when the class distribution is skew or heavy-tailed, or when there are outliers in the sample, the mean absolute deviation from the median is a more robust estimator of location and dispersion than the class variance. Based on the consideration, Xue presented a median-based extension for the Otsu method and stated improving the robustness with the presence of skew or heavy-tailed class-conditional distributions. Xue’s rule can be stated aswhere denote the mean absolute deviations from the median , for two classes, and are defined as

Although Xue’s extension could accomplish more robust performance than that of the original Otsu method, the Xue method seemed not to notice Hou’s motivation originated from the Otsu method, and then it is destined that there are several dissatisfactions if involving some applications, as well as infrared images.

3. The Cloud Model-Based Method

3.1. Preliminaries

Cloud model, proposed by Li et al. [15, 16], is the innovation and development of membership function in fuzzy theory and uses probability and mathematical statistics to analyze the uncertainty [15, 22]. In theory, there are several forms of cloud model, which are successfully used in various applications, including knowledge representation [15, 23], intelligence control [16, 24], intelligent computing [2527], data mining [28], and image segmentation [12, 18]. However, the normal cloud model is commonly used in practice, and the universality of normal distribution and bell membership function are the theoretical foundation for the universality of normal cloud model [16].

Let be a universe set described by precise numbers and let be a qualitative concept related to . Given a number , which randomly realizes the concept , satisfies , where , and the certainty degree of on is as below:then the distribution of on is defined as a normal cloud, and is defined as a cloud drop.

The MATLAB function of the normal cloud generator is included in the supplementary files (available online at http://dx.doi.org/10.1155/2016/1571795) (see Appendix ). The overall property of a concept can be represented by three numerical characters of normal cloud model, expected value , entropy , and hyper-entropy . is the mathematical expectation of the cloud drop distributed in universal set. is the uncertainty measurement of the qualitative concept, and it is determined by both randomness and fuzziness of the concept. is the uncertain measurement of entropy, which is determined by randomness and fuzziness of [16].

It is worth noting that hyper-entropy of a cloud model is a deviation measure from a normal distribution, which is the quantification on how a distribution deviates the Gaussian distribution. For comparison, Wang (Lixin Wang, written personal communication, May 2011) construct a random variable, whose central moments are as close as possible to those of the cloud model. And the mean, the variance, and the third central moment of the constructed variable are equal to cloud model. Therefore, the quantification of the difference between cloud model and Gaussian distribution is achieved in some extents. Accordingly, an accurate quantity on the deviation measure can be obtained, , from the point of view of statistical characteristics, especially the fourth central moment. Hence, the distribution of cloud drops can be regarded as a generalized normal distribution. The details on this property are included in the supplementary files (see Appendix ).

Compared with interval type-2 fuzzy sets widely researched and used [29], cloud model is based on probability and mathematical statistics, its hyper-entropy lets us capture and handle the higher-order uncertainty, and it is equivalent to the secondary grade of Gaussian type-2 fuzzy sets [18], which have been little studied but may be very useful [30].

3.2. The Cloud Model-Based Criterion

Given a threshold , the background pixels can be obtained from the original image . Let the cloud model for the background class be . Considering as the input, three numerical characters would be generated by backward cloud generator [15]. More specifically, the expected value is the grayscale mean of background pixels, and it is formalized aswhere the cardinality of a set is the number of members. Notice that (10) is clearly equivalent to (4).

Next, the entropy is directly related to the first-order absolute central moment from the mean, written as

The derivation on (11) is included in the supplementary files (see Appendix ).

The last parameter, hyper-entropy , can be defined as

Similarly, the corresponding cloud model for object class can be also calculated. We take the original image in Figure 1(a) as a typical example, whose ground-truth image and the grayscale histogram are shown in Figures 1(b) and 1(c). We fix the optimal threshold according to its ground-truth image in Figure 1(b), and then the numerical characters of cloud models for background and object are calculated, respectively, that is, and . Figure 1(d) demonstrates the joint distribution of cloud drops and its certainty degree. Cloud model has depicted the gray level distribution of the sample image, and it is an approximate normal distribution, or a generalized normal distribution, rather than a normal distribution. Furthermore, the grayscale distributions between background and object must not be similar since the shapes of two cloud models are seriously different from Figure 1(d), as well as the ratio of these numerical characters. We will further analyze the difference in another section below.

Once the cloud models are prepared, the remaining is to construct an appropriate criterion for image thresholding. Suppose the cloud model for the background class is and that of the object class is ; our criterion is related to the hyper-entropies , of cloud models , . In an effort to eliminate the above limitation, we propose a novel criterion based on cloud model, which can be formulated as

Then the optimal threshold can be determined as follows:

The segmented result by the proposed method is shown in Figure 2(a), from which one can observe that the cloud model-based method yields the acceptable performance, and the result image is closed with ground-truth image in Figure 1(b). To view the details, the evolution curve of function values is shown in Figure 2(b). With varied grayscale level, the function value increases steadily and achieves the maximum near 150 and then decreases dramatically. At the optimal gray level, the curve is with the minimal value, followed by a slight growth.

It should be noted that the curve reaches max near 150, where the cloud models have little overlap (see Figure 1). In other words, this interval is full of chaos from the perspective of grayscale intensity, and the differences between the hyper-entropy of two cloud models are relatively highest. This means that one class achieves better partition while the other achieves worse. Maybe, these two classes even have similar standard deviation or population size in this condition. Unfortunately, this is not very compatible with the practical features of infrared images. As can be seen from (13) and (14), the new criterion actually attempts to divide an image into two parts with lower and similar hyper-entropy. Therefore, the classes of background and object have the higher intraclass similarity without distractions and certainly the lower interclass similarity. This is intuitively appealing as a good image segment. In ideal cases with two zero hyper-entropies, the two classes will yield bimodal Gaussian distribution, regardless of the population size.

For the image in Figure 1(a), the segmented results by four traditional methods are shown in Figure 3, including Otsu, Hou, Li, and Xue. The result images in Figures 3(a), 3(b), and 3(d) by Otsu, Hou, and Xue almost completely misclassify the object pixels, and the Li method exhibits an acceptable result as shown in Figure 3(c), but there is still suffering from undersegmentation; the background pixels are misclassified into object, as labelled by a yellow rectangle. For reference, each evolution curve of function value is listed in Figure 1. The results by Otsu and Xue are totally wrong. Although the Hou method finds a faulty threshold, it is competitive, and its function value is very closed to the real optimal solution. With the modification of the Hou method, result by Li is more qualified, but the final threshold is still imperfect.

To make an in-depth analysis on features of the sample image, we plot the histograms with -axes on both left and right sides, as shown in Figure 4(a); the grayscale level versus frequency of background with -axis is labelled on the left, while frequency of object is on the right. This image appears as a distinct unimodal grayscale distribution, and the peak ratio between the classes of background and object is very startling, which is about 300 : 1, which is high above tolerable ability of the Otsu, Hou, and Xue methods. Thus, the failure by these methods is a natural consequence. This is also a reason leading to the difference between each pair of parameters of cloud models, as mentioned in previous section.

To observe whether or not the grayscale distributions of two classes satisfy Gaussian form, we use double Gaussian functions to fit the histograms of background and object, respectively, and the fitting error rate is plotted in Figure 4(b). The grayscale distribution of background is more likely a Gaussian form, whose fitting error rate is smaller and acceptable. And the distributions are with difference variances, which is not satisfying the assumption of the Li method. Hence, the just passable result is produced by this method. Besides, this is another reason leading to the difference between cloud models.

Furthermore, the evolution curves of function values by various methods are listed on semi-logarithmic coordinate, as shown in Figure 4(c). For simplicity, we only plot Xue’s curve with omitting the Otsu method, since these two methods achieve similar results and have no essential difference. Additionally, the Li method searches similar standard deviation, while our method searches similar hyper-entropy; then we plot variance difference and hyper-entropy difference for a clear comparison with a close magnitude. A threshold locating in the interval is more preferable. Li’s rule produces , while ours produces obtaining a significant improvement. Of course, the pixels with grayscale value in are not very much. Hence, there seems to be so little difference between the visual results of Li and our method. Therefore, we will verify this point using various images and then further investigate the performance on target detection application, as shown in the following section.

The quantitative comparison is also listed in Table 1, where ME denotes misclassification error and will be explained in Section 4. Our method yields the preferable result with the lowest misclassified pixels and the smallest ME value, since it can provide a more objective representation and accord with actual information of the image. Additionally, the running times by the methods are listed in Table 1. The proposed method, compared with the related methods, is also a competitive one from the perspective of time performance.

3.3. The Cloud Model-Based Algorithm

The proposed algorithm for cloud model-based infrared image thresholding is as shown in Algorithm 1. The method firstly generates the cloud models corresponding to image background and object, respectively, and defines a novel threshold dependence criterion related with the hyper-entropy of these cloud models and then determines the optimal grayscale threshold by the minimization of this criterion.

Input:
 the original image .
Output:
 the optimal threshold and the result image .
()For the original image , obtain the initial information, such as the number of pixels , the grayscale level , and then calculate
 the histogram .
()Initialize the parameters, including the optimal criterion value and the optimal threshold .
; .
()for   to   do
() Obtain the classes of background and object.
  ; .
()if   or   then
()  Eliminate the extreme cases.
   .
()else
()  Calculate the standard deviations , of background and object using (3).
()  if   and   then
()    Generate the cloud models for background and object using (10)–(12).
    ; .
()    Calculate the criterion value according to (13).
    .
()   else
()    Eliminate the extreme cases.
    .
()   end if
()  end if
()  if    then
()   Swap the global optimum for the current best.
   ; .
()  end if
() end for
() Segment the image with the above threshold , denoted as the result image .
  
() return  .
3.4. Implementation and Computational Complexity

In Algorithm 1, there is a for loop with iterations, and each execution costs the time . Theoretically, the time complexity of the proposed algorithm is . However, the proposed algorithm can be easily implemented by modifying the traditional methods, including Otsu and Hou. The efficient implementations for traditional methods are also applied equally to the cloud model-based technique. By comparison, the additional time cost is required to calculate the entropy of two classes, and calculations of the hyper-entropy only execute a subtraction as (12) showed. In sum, the time complexity of the proposed algorithm with efficient implementation is , approximately linear in the grayscale level of the original image. For an image with the size of , the time consumption is usually about 0.1 s in our practice.

4. Experimental Results

4.1. Experimental Setup

To demonstrate the efficiency of the proposed method, we conduct two groups of experiments, and seven methods are involved into comparison, including Kittler and Illingworth [19], Kapur et al. [20], Ramesh et al. [21], Otsu [7], Hou et al. [8], Li et al. [9], and Xue and Titterington [10]. The parameter setting is very important for performance evaluation. For a fair comparison, all are autoparameters or free parameters, but not from a guess. The latter five methods, including the Otsu, Hou, Li, and Xue method, as well as the proposed method, implemented by us, are based on the counts of the histogram, and the only parameter is the number of the bins when calculating the image histogram; we fixed it as 256, which is a conventional rule, so we can consider them as free parameters. The other three methods, including the Kittler, Kapur, and Ramesh method, are performed by the executable file from the public website maintained by Mehmet Sezgin [5] (http://mehmetsezgin.net/otimec.zip), which we cannot modify; these methods can be considered as autoparameters.

The used data includes two groups. The partial data is composed of four images including different objects, and the original images and ground-truth images are shown in Figure 5, named airplane, person, boat, and star, respectively.

The others come from Terravic Motion IR Database and can be registered to download from the website (http://vcipl-okstate.org/pbvs/bench/). We take four images as examples, named irw101, irw102, irin011, and irin012, respectively. The original images and ground-truth images are listed in Figure 6. Half of them are selected from irw10 with one subject entering the field of view (FOV) from the left, and the remaining are from irin01 with monitoring an indoor hallway.

We quantify the performance of these methods by means of misclassification error (ME) [31]. Considering image segmentation as a pixel classification process, the percentage of pixel misclassification is a measure of discrepancy. ME reflects the percentage of background pixels wrongly assigned to foreground and, conversely, foreground pixels incorrectly assigned to background. For the two-class segmentation problem, it can be expressed as [31]where background and foreground are denoted by and for the ground-truth image and by and for the test image. is the number of background pixels rightly assigned to background, and vice versa. ME varies from 0 for a perfectly classified image to 1 for a totally wrongly classified image.

To ensure the highest possible accuracy, the ground truth was created entirely by hand using the GNU image manipulation program (GIMP) (http://www.gimp.org/), and no semiautomatic technique was used. This was also important to avoid potential bias to any algorithmic facet of the procedure used to create it. But it should be noted that creating a 100% pixel accurate ground truth is, in general, impossible, due to the ambiguity in the true positions of the border pixels.

4.2. Comparison on Infrared Image Thresholding

In this group of experiments, we compare the segmentation results on the infrared images. The seven methods are involved into the comparison. The visual comparisons are shown in Figures 7 and 8. Each test image consists of eight components from up to down, the segmentation results obtained by Otsu, Hou, Li, Xue, Kittler, Kapur, Ramesh, and the proposed method, respectively.

The airplane image is relatively simple to segment; however Otsu, Hou, Xue, and Kittler cannot present valid results, and other methods provide good results. The quantitative comparisons are listed in Table 2. Results of the boat image demonstrate similar conclusion with the airplane image. For the person image, Li’s result is unsatisfied. For the star image, all these methods yield no good results, but only our segmented image is the closest to ground-truth image.

Results of the second group images by various methods show the similar performance. For images irw101 and irw102, our method appears with the best results, followed by the Li method. But there are several misclassified blocks scattered in the background, which should impact the performance of the subsequent steps. For the last two images, our method suffers from oversegmentation, and the person is not that complete, and the other methods yield undersegmentation. In summary, by a visual evaluation, the experimental results of these images indicate the proposed method is effective to yield the approximately ideal results.

The visual evaluation is, however, application-independent. For a more detailed portrait, quantitative comparisons of segmentation results yielded by various methods are listed in Table 2. The proposed method outperforms other methods, obtaining the less misclassified pixels and demonstrating the lower ME values. In other words, the proposed method yields better segmentation by a quantitative evaluation.

For the comparison purpose, the processing times of the proposed approach and other methods are also listed in Table 2. From this perspective, the proposed method is similar with the traditional Otsu method, as well as the Hou and the Li method, since the main time costs of all these methods lie in the calculation of the histogram and the scan of the gray level one by one. The time performance of these methods depends on the complexity of the histogram, rather than the image size. As a result, the time cost by each method is quite similar, even the different images by the various methods. It is noted that three methods, including the Kittler, the Kapur, and the Ramesh method, are performed by the executable file, as mentioned above. Thus, the running times of these three methods cannot be obtained. Then the processing times of the methods are omitted in Table 2. According to experimental experience, the involved methods belong to 1-D thresholding and very high efficiency, and the average time costs of these methods are all less than 2 s in our experiments. The comparison between the time cost of the existing methods and ours is not strictly necessary. Even so, the comparison on running time in Table 2 still indicates that the proposed method is efficient because our method runs with less time cost, and the segmented results are acceptable.

In addition, Figure 9 shows a boxplot diagram that summarizes the misclassification error rate statistics of individual thresholding method applied on all selected images. As can be seen, the lowest median is achieved with our method, followed by the Li method and next the Ramesh method. For reference, the detail views of these three methods are also provided. The proposed method has also the smallest interval within IQR of the first/third quartiles ( and ). The Hou and Otsu methods provided, overall, less accurate results. These results demonstrate the high level of competitiveness of our method.

4.3. Results of Infrared Image with Noise

To further investigate the performance of the proposed methods under noisy environment, the irin011 image is involved into comparison. According to Table 2, eight methods gain the closest performance for the clean irin011 image. Hence, the image is used in this subsection. The original image is contaminated by Gaussian noise with zero mean, as well as salt and pepper noise. Each noise tests 20 images; that is, Gaussian noise is with variances in , and salt and pepper noise is with intensities in . Since the noise contamination is a random process, we repeat the process of each variance or intensity 10 times and average the results.

The ME values corresponding to various noises are shown in Figure 10. The proposed method performs lower ME values than the other methods for all the Gaussian noise images. The Kapur method ranks the second, followed by the Ramesh method and the Li method with a little difference. The other four methods, including the Otsu, the Hou, the Xue, and the Kittler method, show a poorer performance to Gaussian noise. On the other hand, the proposed method exhibits an acceptable performance under the condition contaminated by the salt and pepper noise, which is similar with the Otsu, the Hou, and the Li method. The Kittler method comes next, while the Kapur and the Ramesh methods are invalid under this condition. Coincidently, these two methods show a better performance in the above round. Unfortunately, the Xue method is far from satisfactory under the conditions of both Gaussian noise and Salt and pepper noise. In general, the quantitative comparison with various noisy images suggests that our method is more robust to noise, especially under the condition contaminated by the Gaussian noise.

4.4. Application on Infrared Image Target Detection

In this group of experiments, we apply the proposed method into infrared image target detection. By the way, six methods, including Otsu, Hou, Xue, Kittler, Kapur, and Ramesh, have inevitable weakness for infrared images according to the previous section; thus these methods are not included in the comparison. We only compare Li’s results with ours. The infrared image sequence comes from irw10, number 000220 to number 000470. One subject walks across FOV from left to right and is sheltered in two places. The processing steps are () segmenting the images by two methods, () generating the minimum bounding rectangle of object, and () then overlying the original images and converting the result image sequences into videos.

The result videos in this subsection are included in the supplementary files (see Appendix ). Our result presents better performance without any preprocessing or postprocessing, although there exist two images with poor segmentation, where the shelter appears. Of course, we must objectively point out that our method may lead to slight oversegmentation in some cases, which does not affect target detection substantially. Our method mainly aims at the infrared image thresholding, and other postprocessing steps can be introduced into our method to improve the performance if involving target detection. The Li method provides just passable results, but it should suffer undersegmentation in most cases and even seriously degrade the performance. By comparison, our cloud model-based method performs a compelling performance in both image thresholding and target detection.

5. Discussions

In this section, we discuss four problems: one is the relationship between our method and related methods, as well as the theoretical principle or foundation, next is applicability for infrared images, followed by the limitation of the proposed method, and the last is the extension of cloud model and its application.

5.1. Relations with Other Methods

Type-2 fuzzy set is defined by lower membership function and upper membership function. Similarly, cloud model is approximatively related by lower boundary (LB) and upper boundary (UB). But the region bounded by LB and UB is uncertainty, which is determined by . In other words, cloud model has no clear boundary, but it is with a stable tendency (i.e., the expected curve , as shown in Figure 11.

Li’s method is motivated by the isoperimetric graph partitioning [32], in which the intraclass similarities of object and background are measured by the sum of degrees of their corresponding vertices, while the interclass similarity is measured by a cut of the graph. From this perspective, one can also take the hyper-entropy as the intraclass similarity instead of vertices’ degree sum. The standard deviation is a common statistical measure representing degree of deviations between mean and individuals, and is as the second-order or higher-order standard deviation. Therefore, is a higher-order statistical measure representing degree of deviations between mean and individuals. Accordingly, can also be used to measure intraclass similarity of each class in image thresholding. Thus, the cloud model-based method is equivalent to the isoperimetric constant when omitting interclass similarity of object and background, and the proposed criterion is equivalent to Li’s equation in some cases with and , and they are approximate in many cases with smaller and .

For an extreme comparison, we randomly build two simulated data sets, denoted by , . The data in different sets correspond to pixel intensities in different types of virtual image of two groups (i.e., the background and the objects). The size of image , and the number of background pixels is denoted by , while that of object is . The statistical parameters, including the mean, , , the variance, , , and the first-order central moment, , , are listed in Table 3. Since intensities are integers in the range , we round the simulated data into .

The segmented results are shown in Figure 12. With a larger entropy of classes (), the results by our method are clearly different from Li’s results. Our method achieves good results, while Li’s method generates invalid results (completely misclassifying the background or the objects). Li’s method is unexpectedly inferior to the other methods, even though the assumption of the equal size class for these traditional methods (e.g., the Otsu method) is not satisfied.

5.2. Applicability for Infrared Images

We confine the proposed method to infrared image thresholding, and there are two main reasons: () Our new criterion actually attempts to divide an image into two parts with lower and similar hyper-entropy, even higher-order entropy. Thus, the proposed method, especially the statistical criterion, is only suitable for a special histogram from the point of view of mathematical statistics, as discussed in the above subsection. Almost all of infrared images have the statistical properties. Therefore, the proposed method is more suitable for applying to infrared images, since the infrared images have the special features more observably and more directly, especially for those infrared images with small targets. () The cloud model-based method is inspired by the Li method, which is specific for infrared images. Hence, the comparison is more meaningful with the same application. Compared with the Li method, our method improves attention to detail in target detection for infrared images. Thus, the proposed method is also positioned in specific for infrared images.

Of course, our method can be also applied to the gray level images with similar statistical features, since the method is proposed to process the histogram only, but not the type of images. In fact, the proposed method performed well in the above subsection, in which there are simulated data sets, not infrared images. But we still think one cannot find out other types of images with the special features, which is more suitable than the infrared images. In other words, our method would generally achieve better performance by applying to infrared images rather than gray level images.

Theoretically, another statistical criterion can be provided for gray level images which are more appropriate than the infrared images. Unfortunately, this criterion is still open and to be further investigated. Nevertheless, our method does not want to apply to all images. Meanwhile, we cannot do so, since each technique has both strengths and weaknesses.

5.3. The Limitation

In general, our method demonstrates good performance, both efficiency and effectiveness. Nonetheless, each algorithm has its advantages and disadvantages. None is generally applicable to all images; our method is no exception. The proposed method belongs to a special type of statistical approach, and it takes the histogram as the fundamental basis; the division of background and object pixels depends on the statistical feature reflected by the image. The proposed cloud model has a certain capacity of processing uncertainty, when a small number of the object pixels are disguised as the background pixels from the point of view of the grayscale values. But it is never omnipotent, and when the statistical features of the object pixels in images were influenced seriously, the proposed method would suffer the oversegmentation. In extreme cases, the results would be unsatisfied, even wrong. For example, the proposed method currently cannot deal with texture; thus it does not work well if involved into the gray level images with texture, which can be easily processed by the other specific methods. In this view, our method is more specified for the infrared images, since it is difficult to reflect the target texture information by infrared images. Additionally, incorporating semantics into image segmentation and scene understanding is recently a new trend. The lack of introducing semantics information is another limitation of the proposed method, although the statistical method is simple and feasible.

5.4. The Extension

In this paper, the cloud model, nd-order essentially, uses three numerical characteristics to represent the classes of background and objects pixels. Furthermore, cloud model can generate the next-order entropy, which is the higher-order cloud model, if hyper-entropy is not enough to support or represent the statistical properties of a given image. Theoretically, the entropy of cloud model has the ability of an infinite extension. Wang et al. [17] have proposed the th-order generic normal cloud model, which is expressed by numerical characteristics. The recursive definition of the th-order normal cloud is the popularization of the nd-order normal cloud and the normal distribution. The th-order normal cloud has the unimodal and long-tail property, and then it is able to represent a power-law distribution [33]. From normal distribution to power-law distribution, cloud model has flexibility to represent the classes of background and objects in images.

For the application on image thresholding using higher-order cloud model, only one extra step is inserted as preprocessing of the proposed method. A possible way is to search the optimal order from 1 to (the right bound of order) so as to achieve that th-order normal cloud can represent the two classes with the least error, compared with the original histogram. The step is easy to implement, but the searching process of should take the additional time. Theoretically, with a larger , the representing error is less, and the segmented results are more perfect, while the time cost is more. In practice, how to keep a balance between them is really a challenge. From the perspective of time complexity, we only use nd-order cloud model in this paper, which is a special case of the th-order normal cloud model.

6. Summary and Conclusion

In this paper, a new algorithm for infrared image thresholding has been described. We analyze the rationale of the proposed approach. Cloud model is introduced into the representation of image background and objects, as named by cloud model with three parameters . The method possesses only one free parameter, the hyper-entropy , over which a criterion function is evaluated and the optimal threshold can be determined by optimizing the function value. A comparison with seven other thresholding methods has been presented. The results indicate that the proposed method is effective and efficient and highly competitive with other popular methods. Like any other methods, the proposed method maybe also has limitations and suffers from oversegmentation in some cases, which will be further considered and essentially improved in our future research.

Competing Interests

The authors declare that they have no competing interests.

Acknowledgments

This work was partially supported by the National Natural Science Foundation of China under Grant (nos. 61402399, 41501378), by National Key Basic Research and Development Program (no. 2012CB719903), by Foundation for Distinguished Young Teachers in Higher Education of Guangdong, China (no. YQ2014117), and by the Foundation of Humanities and Social Sciences Research in Ministry of Education, China (no. 14YJCZH161).

Supplementary Materials

In Appendix A, we provide some formal definitions and proofs on cloud model, including the formal definitions of cloud model, Equation (11) derived from the cloud model, and the measurement of He derived from the cloud model. In Appendix B, we present supplementary data on video results, and there are two result files with .mp4 file format, that is, “Our.mp4” by our method and “Li.mp4” by Li's method.

  1. Supplementary Material