Quality Prediction of DWT-Based Compression for Remote Sensing Image Using Multiscale and Multilevel Differences Assessment Metric
Accurate assessment and prediction of visual quality are of fundamental importance to lossy compression of remote sensing image, since it is not only a basic indicator of coding performance, but also an important guide to optimize the coding procedure. In the paper, a novel quality prediction model based on multiscale and multilevel distortion (MSMLD) assessment metric is preferred for DWT-based coding of remote sensing image. Firstly, we propose an image quality assessment metric named MSMLD, which assesses quality by calculating distortions in three levels and multiscale sampling between original images and compressed images. The MSMLD method not only has a better consistency with subjective perception values, but also shows the distortion features and visual quality of compressed image well. Secondly, some significant characteristics in spatial and wavelet domain that link well with quality criteria of MSMLD are chosen with multiple linear regression and used to establish a compression quality prediction model of MSMLD. Finally, the quality prediction model is extended to a wider range of compression ratios from 4 : 1 to 20 : 1 and tested with experiment. The experimental results show that the prediction accuracy of the proposed model is up to 98.33%, and its mean prediction error is less than state-of-the-art methods.
During the past few decades, several kinds of transforms have been developed for image compression , such as Karhunen-Loève transform (KLT), discrete cosine transform (DCT), and discrete wavelet transform (DWT). JPEG  and other DCT-based coding techniques  have been put to use in many space missions, such as Clementine, TRACE, and SPOT-5 . DWT offers a highly efficient and flexible way for subband decomposition of image, and it is a very powerful tool for image coding. DWT is suggested in JPEG2000 [5–7], which provides numerous advantages over the JPEG. As one of DWT-based coding methods, JPEG2000 has been applied to remote sensing image compression widely. For example, in these satellites such as BilSAT-1, RASAT, and X-SAT, all image compression payloads are designed with JPEG2000 . JPEG2000 is also used for the other remote sensing image compression referred to in [8, 9]. In addition, The Consultative Committee for Space Data Systems (CCDS) published a recommended standard, CCSDS-IDC Blue Book , which establishes a data compression algorithm consisting of DWT and bit-plane encoder. Although there are so many methods for remote sensing image coding, obviously, DWT-based methods are dominant in practical applications in recent years.
Predicting image quality at a preencoding stage or without coding is very useful for image compression, especially for DWT-based coding algorithms , for it can be applied to bit allocation, filter bank selection, coding scheme selection, encoding parameter decision, and so forth. PSNR-based quality criteria are most commonly used in quality prediction for wavelet coding [12, 13]. They can predict the compression quality without coding, only by constructing a function between image characteristics and the quality values. Although existing PSNR-based quality prediction methods allow us to evaluate the numerical difference between images for DWT coding, they have two defects when being applied to remote sensing image as follows. Firstly, images from different categories (natural, remote sensing, synthetic, compound, medical, etc.) tend to show different spatial domain characteristics, but the existing models are mainly designed and tested with nature scene images, instead of remote sensing images. Secondly, the PSNR-based compression quality prediction methods are not consistent with the human vision system (HVS) and do not reflect the appreciation of the humane eye accurately. Thus it is interesting to integrate other quality criteria which have a good consistency with subjective perception values.
In the paper, a quality assessment method is preferred primarily for remote sensing image compression, basing on multiscale and multilevel difference (MSMLD), which has a good consistency with subjective perception values, and the objective assessment results can well reflect the visual quality of remote sensing compressed images. After that, by analyzing the correlation between MSMLD quality criteria and image characteristics in spatial domain and wavelet domain, a quality prediction model based on MSMLD is designed for DWT coding algorithms. The rest of this paper is organized as follows. In the next section, we will review previous works on image compression quality prediction. Section 3 presents our objective image quality assessment metrics called MSMLD for remote sensing image. In Section 4, a quality prediction model based on MSMLD is preferred and implemented for DWT-based compression. In Section 5, our proposed method is compared with notable existing works. Finally, the conclusions and future work are drawn in Section 6.
2. Related Works
An essential aspect of quality prediction is to establish a function between image inherent characteristics and quality assessment criteria; thus only by calculating some values associated with image characteristics can the final image quality be predicted accurately at a preencoding stage or without coding, instead of after decoding and assessment. It is very important and advantageous for image compression to foresee the image quality accurately during coding because that makes it impossible to adjust algorithm promptly to get better coding performance with less computational cost. Currently for image compression, PSNR-based methods are most widely used to measure and predict image compression quality at a preencoding stage or without coding and also be a focus of research. For example, Koumaras et al.  predicted the compression distortion of DCT using the variances of image. Kourzi et al.  achieved the prediction by constructing wavelet filter space and frequency index. Saha and Vemuri  chose image activity measure (IAM) as prediction factor and established the IAM-PSNR equation, basing on analyzing the relevance between the inherent characteristics of the images and coding performance; it was special for the compression algorithm based on wavelet transform. On the basis of this, Li and Wang  analyzed the dependency between IAM and PSNR in different functional forms and constructed a prediction model of PSNR based on IAM, so that the prediction accuracy could be improved. In all of these methods, the correlations between image inherent characteristics and image quality assessment are thoughtfully analyzed and widely applied, and most of the research and applications are centered on ordinary scene images instead of remote sensing images. However, images from different categories tend to show different spatial domain characteristics, and there are many differences in correlations between image inherent characteristics and image distortion assessment for ordinary scene images and remote sensing images. Wei et al.  studied the problem of distortion prediction for remote sensing image compression by analyzing image characteristics in spatial domain and wavelet domain, and some image characteristics significantly related to PSNR were selected and used to establish a distortion prediction model with multiple linear regression to achieve the prediction of image distortion during coding. However, in  the selection of image characteristics is too dependent on the empirical values, and furthermore PSNR is used as the indicator of distortion prediction, whose ability to evaluate the image quality is relatively limited. So the method in  has inherent defects in distortion assessment and prediction for remote sensing image compression.
Image quality evaluation methods can be divided into two classes: subjective assessments and objective assessments. Due to the observing abilities, display conditions, and scoring standards, the subjective assessments by human are prone to be inconvenient, time-consuming, and unstable. The objective assessments propose a judgment on image quality by simulating the human vision system, by algorithms designed to imitate human subjectivity with a formulation. The objective methods include three kinds, such as full-reference methods, part-reference methods, and no-reference methods. Full-reference methods are the foundation of other types and outperform the other two types. The notable full-reference quality metrics are usually classified into three categories as follows . The first category is mathematically convenient metrics; PSNR and MSE are simply formulated and hence are computationally efficient; they are widely used in remote sensing image compression for a long time [17–19]. However, this kind of metric has a congenital defect where it provides an inaccurate representation of the image quality, and its evaluation result usually has deviation from subjective perception values in a way. The second category is near-threshold psychophysics based metrics. Taking WSNR  and NQM  for example, these metrics attempt to simulate the perception of image distortion by analysis of human vision sensitivity. However, these metrics confine to visual research limitations. What is more important is that the existing research works mostly focus on the feature analysis and evaluation of natural scenery images, which are quite different from remote sensing images for their unique characteristics. The third category is metrics based on overall perception. These metrics avoid simulating the complex human vision but measuring the structural distortions of images. Under the assumption that human visual perception is highly adapted for extracting structural information from a scene, structural similarity (SSIM)  between the distorted and original images was proposed to assess image quality, and it was further extended to the multiscale SSIM (MS-SSIM)  by sliding windows. Recently, VIF was proposed  to adopt a novel visual information fidelity criterion that quantified the Shannon information presented in the distorted image relative to the information presented in the original image. According to , VIF outperforms other quality metrics. As those structure based metrics formulate the assessment functions on the higher regional information rather than pixels, they are more consistent with the HVS but insufficient in evaluating small target distortions.
Although there are plenty of image quality evaluation methods, they are mainly common-oriented for ordinary image test and application. For example, the recently proposed SSIM and VIF are typically common-oriented and state-of-the-art methods and are adopted and used in many applications. Because the remote sensing images are quite different from ordinary images in acquisition mode and image content, the evaluation results often deviate from the subjective interpretation in a way when these common assessment methods are used for remote sensing image compression quality evaluation. Relatively, there are a small number of algorithms devoted to the evaluation of remote sensing image compression and in which distortion characteristics of remote sensing image compression are taken into account partly. Moreover, these methods are mainly focused on low-level image features, such as pixel differences, image variance, and image histogram, using less high-level features, such as image structure and image content . In overall performance term, these specific methods are still behind the SSIM, VIF, and so forth. Therefore, in this paper we propose an image quality metric to reveal image compression distortions and perform more comprehensive evaluation ability for different size distortions of remote sensing image compression.
Above all, in order to get a novel compression quality prediction model that has a good consistency with subjective perception values and for the prediction results to accurately reflect the visual quality of remote sensing compressed images, it is primarily necessary to study a novel image quality assessment metric according to the inherent characteristics of the remote sensing image; then, basing on the novel metric, a prediction model should be established for DWT-based compression to predict image compression quality at a preencoding stage or without coding.
3. Image Quality Assessment Metric Based on Multilevel Difference
3.1. Image Distortion Analysis Based on Multilevel Difference
Images from different types (natural, remote sensing, synthetic, compound, medical, etc.) tend to show different spatial domain characteristics. For example, when a variety of images of different categories are compressed using a fixed wavelet filter and a given compression ratio, it is observed that the PSNR values vary widely as much as dozens of dB from image to image . The HVS research shows that when a remote sensing image is being observed, the perceptual information is not only equal to the information entropy of the pixel, but also affected by its position and surrounding pixels. The information of a pixel not only is correlated with its self-luminance, but also can be significantly affected by its neighbor pixels as shown in Figure 1. Due to the different neighbor pixels, the identical pixel is perceived as with different luminance values.
Optical remote sensing images, especially high resolution ones, are often characterized with typical features, such as numerous small targets, complex edge texture, and uneven distribution of image content. After lossy compression, some image distortions like loss of target, texture blurring, edge jitter, and block effect may occur. As shown in Figure 2, when analyzing and assessing remote sensing images, a hierarchical difference can be used to reflect the different aspects of image distortions. For example, the pixel changes can reflect the loss on small targets in images; the regional difference can reflect the distorted edge and fuzzy texture in images; the difference of image content can reflect the classification of image characteristics. Finally, the image quality can be comprehensively evaluated by all these differences. Because the current image quality evaluation methods cannot effectively take into account the distortion of different levels and scales, there are some limitations in evaluation results, and it is hard to get a good consistency with subjective perception. Therefore, it is the key of research for remote sensing image compression assessment that how to comprehensively descript the image distortion at different levels and scales will make objective evaluation results be closer to the subjective interpretation.
Based on the above, image quality assessment should be comprehensive descriptions of self-luminance, contexture impact, and pixel feature. In this paper, an image quality assessment method based on multilevel distortions (MLD) is designed to get better consistency with subjective perception values; in the proposed MLD method, differences in pixels, neighborhood, and structure between the original image and the decompressed image are calculated to reveal their inherent relationship with subjective interpretation. The overall flow of the proposed MLD includes three steps. First, all pixels of the original image and the distorted image are classified into three types according to image features. Second, the difference between original image and distorted image is calculated in point level and neighborhood level for each pixel. Finally, evaluation result of MLD is obtained by calculating the weighted distortion for each image pixel.
In the next section, the three levels of distortions will be formulated firstly, and the assessment metrics promoted to multiscale will be presented subsequently.
3.2. Calculation of Multilevel Differences
3.2.1. Calculating the Differences of Images
Let and represent the original image and its distorted image, respectively; and represent the luminance value of pixel in the original and distorted image. The pixel-level distortion and the contexture-level distortion are defined, respectively, as where is a nonzero constant to keep the divisor unequal to zero, and in the paper is set as 0.01 according to other parameters. is the contexture impact of pixel in the original image ; is the contexture impact of pixel in the distorted image . They are defined as where are the contexture pixels of . are the contexture pixels of . is the weight set generated by the 7 × 7 circular-symmetric Gaussian weighting function with standard deviation of 1.5 samples.
3.2.2. Classifying the Pixels of Images
For observing remote sensing image, human pays more attention to two factors; one is the mutation position in image and the other is the distortion in mutation position. Therefore, the different weight should be set to the different distortion position as assessing image quality. Therefore, in order to assess image distortion accurately, all pixels should be divided into variation pixels and smooth pixels. Variation pixels are pixels with large gradient magnitude, such as the target edge pixels, textures pixels, and boundary pixels. Smooth pixels are pixels with small gradient magnitude. In the paper all pixels are classified into smooth-maintain pixels and variation-maintain pixels, according to Sobel gradient magnitudes with the threshold of . The value of is optimized by objective experiments, as shown in Figure 3. In the figure, the number of variation-maintain pixels is gradually reduced with the increase of , and in order to make the variation pixels close to subject variation interpretation, the value of is set to 0.02 as a tradeoff.
The procedure of classifying pixels of images can be summarized as follows.(1)Calculate the gradient magnitudes of the original image and its distorted image with Sobel operator, respectively.(2)Set the classifying threshold value (here is 0.02) and classify the pixels into the smooth pixel sets and and the variation pixel sets and according to . Pixels in and are pixels with small gradient magnitude that is less than . Pixels in and are pixels with large gradient magnitude that is not less than .(3)Smooth pixels and variation pixels are refined and divided further, based on the following definitions. If , pixel is classified as a smooth-maintain pixel. Else if , pixel is classified as a variation-maintain pixel. Else pixel is classified as a feature-changed pixel.
3.2.3. Calculating the Comprehensive Evaluation
In order to get a comprehensive evaluation of remote sensing image compression, an integral calculation of multilevel distortions is necessary after pixel classifying. The procedure of calculating the comprehensive evaluation can be summarized as follows.(1)Classify each pixel in the original image and distorted image into smooth-maintain pixel, variation-maintain pixel, or feature-changed pixel.(2)Calculate the evaluation weight of content-level distortion, and it is set as the ratio of smooth pixel number to variation pixel number.(3)Calculate and for each pixel , respectively.(4)Calculate the final image quality evaluation according to formula (3). Consider
3.3. Multiscale Promotion of MLD
The perception of image details depends on the sampling density of image signals, the observation distance, and the observer’s visual perceptual capability. According to Wang et al.’s method , the proposed MLD is promoted to multiscale metrics named MSMLD for rough granulation applications. We extend our MLD to the multiscale metrics MMLD for rough granulation applications, as shown in Figure 4. In MSMLD the impacts of varying image resolutions and view conditions are well considered. The main idea is to apply the single-scale MLD on the sampled images of the original images and distorted images.
The procedure of MSMLD can be summarized as follows.(1)In light of image size, the number of image downsampling is calculated by setting , where and are the width and height of the image, respectively.(2) is a set whose members are the mean filter factors , and it is defined as . Its members are determined by ; for example, when is 0, when is 1, when is 3, and so forth.(3)For every element of set , the original image and distorted image are filtered and downsampled, using mean filter with window size of , and and are got separately; the width and height of the images and are downsampled into and .(4)Based on the steps in Section 3.2.3, the final image quality evaluation is calculated for and according to (3).(5)Iteratively compute steps and until there is an value corresponding to each element of .(6)The final image quality assessment value MSMLD can be achieved with formula (4). Consider
3.4. Remote Sensing Image Compression Quality Assessment Verification
Considering ensuring the universality of test for remote sensing image compression, we randomly select 30 original gray remote sensing images (512-by-512 pixel) from the Internet as a test set, which has a great deal of important characteristics such as gray, contrast, image resolution, texture complexity, and the target distribution and covers a variety of commonly used scenes in remote sensing image, such as sea, mountain, airport, buildings, and motors. Some of the typical remote sensing images in the test set are shown in Figure 5.
The distorted image is the decompressed image of the original image in test set. According to the analysis in Section 1, for covering various compression distortions, some of the most commonly used coding methods for remote sensing image such as JPEG2000 , JPEG , HD-PHOTO , BHC [27, 28], and FRACTAL  are chosen to generate distorted images with varying compression ratios. Finally a test image database is built with a total number of 750 images (30 images are compressed with 5 algorithms at 5 compression ratios separately). Some of the typical distorted images caused by different coding methods with various compression ratios are shown in Figures 6 and 7.
It is obvious that subjective scores are unstable and may be easily affected by many factors such as observation environment and individual capacity. Hence some rules of observing environment, scoring criteria, and score-processing methods are designed to avoid the random influences and ensure that the subjective scores are more close to human visual perception. After that, subjective quality scores of distorted images are given, and the probability of unreasonable scoring is reduced by pairwise comparison and the distortion sorting. The final mean opinion score (MOS) of the distorted image is determined by the weighted evaluation scores from all interpreters.
The MSMLD algorithms are implemented and compared with state-of-the-art assessment methods such as PSNR , VSNR (visual signal-to-noise ratio) , WSNR , NQM , VIF , SSIM , and MS-SSIM . The sorting of objective quality metrics is transformed to the predicted MOS score through nonlinear regression, denoted by , which is then compared with the actual MOS in the test image database. Nonlinear regression fitting function is selected from video quality expert group (VQEG II)  as where is the original objective score, is the mapping score, and , , , and are parameters from VQEG II. Some correlation factors such as Pearson correlation coefficient (CC), Spearman rank order correlation coefficient (SROCC), root mean square error (RMSE), and mean absolute error (MAE) are used to compare the and MOS. A higher score indicates a better performance of the objective assessment for CC and SROCC; conversely, a lower score is better for RMSE and MAE.
The correlation factors (such as CC, SROCC, RMSE, and MAE) of and MOS are calculated to quantify the assessing performance, and values are shown in Table 1. The results show that the metrics of VIF, MS-SSIM, and MSMLD can obtain a better consistency with MOS, benefiting from higher values in CC and SROCC and lower values in RMSE and MAE. However the metrics of NQM, PSNR, VSNR, and WSNR show a bad consistency with MOS, due to lower values in CC and SROCC and higher values in RMSE and MAE. All indicators show that MSMLD has the best consistency with MOS. The reason why MSMLD has a better consistency with MOS lies in the following three points. Firstly, the NQM, PSNR, VSNR, and WSNR metrics focus on pixel differences without considering contexture impacts. Secondly, VIF and MS-SSIM do not take into consideration the pixel-level distortions. Thirdly, the MSMLD takes into account the multilevel distortions (such as pixel-level, contexture-level, and content-level) and gets a better property in accuracy and integrity of image quality assessment.
The proposed MSMLD provides a way which has better consistency with subjective perception values than current state-of-the-art methods in remote sensing image compression assessment. On the one hand, it can be used as an important tool for image quality assessment. On the other hand, it can also be applied to coding algorithm to predict compression quality. The latter will be discussed further in the following sections.
4. Model of Compression Quality Prediction Based on MSMLD
Although the proposed MSMLD has a better ability to assess image quality than other methods, this assessment is computationally heavy, since the evaluation of the compression quality is an expensive operation, which requires performing coding, decoding, and then comparison between original and decompressed images. Thus it is interesting to study a model which can epitomize the principle of MSMLD and can predict the compression quality at a preencoding stage or without coding. Later, the design and construction process of a quality prediction model will be introduced from three aspects. First, to choose an outstanding evaluation method and establish image data set for test, it has been done in the last section. Second, select some proper image characteristics (or functions) as predictive factors for DWT-based compression. Third, establish the relationship model between the evaluation results and predictive factors. Finally, apply the model to DWT-based coding algorithm to predict compression quality accurately.
4.1. Determination of Predictive Factors for DWT-Based Coding
In order to let a predictive model accurately reflect the compression quality of remote sensing images with DWT-based coding, it is necessary to analyze factors that affect compressing distortion. The basic diagram of DWT-based coding consists of three basic steps  as in Figure 8. First, DWT offers a highly efficient and flexible way to decompose and decorrelate the original image and bring about a new form in wavelet domain to support a more efficient coding. Second, the quantization step restricts wavelet coefficients into a limited series of values, that is, removes information considered to be redundant or useless. Meanwhile, the loss is generated in this irreversible process. Third, the encoding step assigns to each quantified coefficient a code as short as possible, and bits allocation is weighed and balanced on the different subbands of the image according to a given compression ratio (CR). The image decompression follows an inverse procedure.
The nature of an image histogram provides many clues to the character of the image. We have used a set of gray level image statistics and have analyzed various images to see if any particular image feature is mainly responsible for the variation in MSMLD values among various images. Therefore, a set of image characteristics that are commonly used in DWT-based methods are chosen as candidates to establish a quality prediction model of MSMLD, including image gray level , image contrast , and image activity measure . They are defined as where and are the width and height of the image, respectively, and represent the luminance value of pixel in the image.
In addition, according to the characteristics of DWT-based compression algorithms, some special parameters about wavelet coefficients are selected as frequency domain characteristics of the image, including coefficients average , coefficients energy concentration , and bit-plane entropy average . They are defined as where represents the wavelet coefficient values of image with DWT, represents the number of bit-planes, and represents the entropy of the bit-plane.
To establish a function between image characteristics and compression quality, 86 gray images are selected as the training set, in which about 70 percent of images are remote sensing images, and every one is a 512-by-512 pixel, eight bits per pixel. JPEG2000 is used to compress all images at different CR, such as 2.0 bpp, 1.0 bpp, 0.8 bpp, 0.67 bpp, 0.5 bpp, and 0.4 bpp. A distorted image database is composed of all decompressed images with different CR. Then, quality assessment is applied to the distorted image database with MSMLD, and the objective evaluation values of images are achieved. In the end, a final training set comes into being, consisting of the original images, decompressed images, and the objective evaluation values.
In order to explore if any particular image feature is mainly responsible for the variation in MSMLD values among various images, a set of gray level image statistics are analyzed with scatter diagram  to determine whether the two variables are related, and image characteristic and objective evaluation value are plotted against each other as a scatter diagram. Figure 9 shows the relationship between each characteristic of images and image objective evaluation values at 0.5 bpp with MSMLD. Experiments indicate that every characteristic has different scatterplots with image objective evaluation values, and and have less relevant degree with MSMLD values, but , , , and link very well with MSMLD values.
In order to choose subsequent prediction factors more accurately from the above candidates, curvilinear regression analysis is further used to reveal interdependency between each characteristic itself and MSMLD values. SPSS data statistical software is used to process six kinds of curvilinear regression analysis, such as linear, logarithm, exponential, S type curve, logistic, and quadratic polynomial. Table 2 is fitting results of the analysis. From the results, it can be made sure that , , , and in quadratic form link well with MSMLD values, and and in logarithm form have better dependency with MSMLD values relatively. For this reason, the quadratic form of , , , and and logarithm form of and are chosen as the optional factors for multiple linear regression analysis in the next steps. According to the above analysis, six factors such as , , , , , and will be chosen as the optional factors of a predictive model for multiple linear regression analysis in the late steps.
4.2. Construction of Prediction Model
After determining candidate predictors, the final predictors should be selected to establish a quality prediction model. Multiple linear regression (MLR)  is a powerful tool commonly used to find a mathematical relationship between a group of random variables. The goal of MLR is to model the relationship between the explanatory and response variables, and the model can create a relationship in the form of a linear that best approximates all the individual data points. There are three techniques employed in MLR, such as forward selection, backward elimination, and stepwise regression. Stepwise regression is most frequently used in several procedures, and it proceeds with forward selection, but each time a variable is added, and a backward elimination step occurs to test whether any variables entered previously can be removed.
In view of image compressed at 0.5 bpp, the final predictors are selected and the coefficients of model are decided by MLR with stepwise regression. Table 3 shows the analysis results; the column of “model” indicates the count of “stepwise” in the overall analysis process and the selected factors in each cycle. The column of “” represents coefficient of each predictor in the model. The table illustrates that the stepwise regression analysis iterates 3 cycles and, respectively, selects and square and as predictive factors whose coefficients have been given.
Thus, for remote sensing images that are compressed at 0.5 bpp with JPEG2000 algorithm, the predictive values of image quality assessment with MSMLD can be predicted comprehensively by three statistics: the mean value of image gray level , coefficients energy concentration , and bit-plane entropy average . The function of prediction model is shown as
In order to evaluate the accuracy of prediction models, residuals between the image quality evaluation values and predictive values are calculated and represented by the standardized residuals histogram shown in Figure 10. The histogram shows that the residuals between predictive values and actual values take on normal distribution, and the estimated standard deviation is defined as . The confidence level is referred to descript the probability where a confidence interval will contain given residuals; from Figure 10 we can see that a 95% confidence level can be achieved with a confidence interval of , and a 70% confidence level can be achieved with a confidence interval of . It means the predicted values are in good correlation with the actual value and model predictive accuracy is perfectly excellent.
Similarly, prediction models of MSMLD can be constructed for CR of 0.8 bpp and 0.67 bpp, and the value of MSMLD can also be predicted with , , and by the curve regression analysis and multiple linear regression. As a result, it shows that the joint function of the brightness of the image , wavelet energy concentration , and wavelet entropy value is suitable to be applied to any compression ratio to predict image quality, and the only thing that needs to be done is to adjust the coefficients of predictors according to the given CR.
Based on the above, a method of multiple linear regression analysis called “enter’” is used to analyze the linear relationship between , and image compression quality, and a prediction model is built as formula (9). The model is applied to JPEG2000 and BHC that is another prominent DWT-based coding method for remote sensing image and can accurately predict compression quality in various bit ratios, such as 2 bpp, 1 bpp, 0.8 bpp, 0.67 bpp, 0.5 bpp, and 0.4 bpp. Consider where
5. Verification and Analysis
In order to verify the accuracy of the proposed prediction model, 30 images are randomly selected from the Internet as a test set, which mostly are remote sensing images with different characteristics and resolution, 512-by-512 pixel size, and eight bits per pixel. All images of the test set are beyond the training set and are compressed by JPEG2000 method with different CR, such as 2 bpp, 1 bpp, 0.8 bpp, 0.67 bpp, 0.5 bpp, and 0.4 bpp. In the end, a final test set including 180 decompressed images is established.
For each image of test set, the quality assessment value of decompressed image is calculated by MSMLD as the actual result, and the quality assessment value is calculated by the preferred prediction model as the predictive result. Two typical results are shown in Figure 11.
All differences between the actual results and the predictive results are computed and shown as cumulative distribution function (CDF) in Figure 12. According to the figure, on the one hand, most of the absolute values of the differences are less than 0.02 and are approximately distributed as normal, concentrating around 0. On the other hand, the error distribution is more concentrated with lower CR, and the predictive accuracy decreases with the increase of the compression ratio in a way; nevertheless, all errors are not more than 0.03 as compression ratios are 1 bpp, 0.8 bpp, 0.67 bpp, and 0.5 bpp; only three of the errors are greater than 0.03 as compression ratio is 0.4 bpp.
In addition, the prediction errors in various compression ratios are counted up and shown in Table 4. According to the table, 98.33% of the prediction errors are less than 0.03, and 89.44% of the prediction errors are less than 0.02. Meanwhile, the prediction model can achieve relatively higher prediction accuracy for lower compression ratio; for example, 96.66% of the prediction errors are less than 0.02 when the compression ratio is lower than 2 bits per pixel. Due to the range of evaluation values of MSMLD which is , magnitude error value from 0.02 to 0.03 accounts for only 2% to 3% of the entire range, so the proposed prediction model can accurately predict the assessment of image quality.
As previously described, at present the methods used for image compression quality prediction are mainly some traditional methods, such as PSNR. Since their range of the quality evaluation values is different from MSMLD, it is difficult to compare prediction results directly. Therefore, percentage of the prediction error and the actual value are calculated as an indicator of comparison. For example, with CR of 0.5 bpp, the predictive value of PSNR is calculated according to the prediction model in ; then the actual PSNR and the absolute error between the predicted value and the actual value are also calculated. Finally, the percentage of the prediction error and the actual value are obtained. In the same way, percentage of the prediction error can also be calculated with the proposed MSMLD, and all results are shown in Table 5. From the table, the average of the relative prediction errors is 1.54% by the predictive model in , and the average of the relative prediction errors is 1.09% by the proposed model of MSMLD. Obviously, the proposed model is of relatively higher prediction accuracy. Similar experiments are also used to test the BHC algorithm, just like on JPEG2000; the model can also predict the image compression quality of MSMLD accurately.
Meanwhile, the proposed MSMLD algorithm has better consistency with subjective perception values than PSNR in remote sensing image compression assessment, and the objective assessment results can show the distortion features and visual quality of compressed image well. Therefore, the conclusion can draw that the proposed prediction model of MSMLD is not only accurate for assessment prediction, but also closer to the subjective evaluation of compressed image distortion.
6. Conclusions and Future Work
In the paper a novel image quality assessment method named MSMLD is preferred for remote sensing image compression; it is incorporating merits of pixel-level distortion, contexture-level distortion, content-level distortion, and multiscale structural similarity. Compared with state-of-the-art image quality assessment approaches like SSIM, VIF, PSNR, and so forth, the proposed MSMLD algorithm has a better consistency with subjective perception values than the current state-of-the-art methods in remote sensing image compression assessment, and the objective assessment results can show the distortion features and visual quality of compressed image well.
Basing on the above, by analyzing the correlation between multilevel differences and image characteristics, six important characteristics of image in space domain and frequency domain are preferred, such as image gray level, image contrast, image activity measure, wavelet coefficients average, wavelet coefficients energy concentration, and bit-plane entropy average. Furthermore, three characteristics that are closely related to the quality assessment of MSMLD are chosen with multiple linear regression and used to establish a quality prediction model of MSMLD for the DWT-based coding algorithm. The quality prediction model is applied to a DWT-based compression algorithm named JPEG2000 and BHC for various compression ratios and tested with experiments; the experimental results show that the prediction accuracy of the proposed model is up to 98.33%, and its mean prediction error is less than the current state-of-the-art methods. It can predict the distortion of remote sensing image compression in multiple fixed bit rates efficiently.
Although the proposed prediction model of MSMLD can reflect the visual quality of remote sensing compressed images accurately, there are still some issues that deserve to be further investigated in the future. Firstly, besides the currently used low-level image visual features such as small target, complex edge texture, and uneven distribution of image content, there are some additional elements and characteristics of remote sensing images that may be taken into account further and be mapped into multilevel spatial semantics via visual feature extraction. Moreover, some high-level semantics like object-based image classification and spatial relationship inference [34, 35] can also be introduced to forecast the image quality during remote sensing image compression. Secondly, some new concepts and opinions may be beneficial for the prediction of remote sensing image compression assessment, like image sketch  and image epitome , which could be derived from original images to assist the vision technologies and the compression assessment as well. At last, since the MSMLD is a full-reference method which requires full information of the reference image, it could be a serious impediment for some applications. So developing a reduced-reference prediction method based on MSMLD is essential in the future to reduce prediction complexity of image assessment with limited information of the compressed images.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
This work is partially supported by the 973 Program (no. 2010CB327900), the NSFC (no. 61272347), and the National Science Fund for Distinguished Young Scholars (no. 61125206).
“Information technology-digital compression and coding of continuous-tone still images,” Tech. Rep. ISO/IEC 10918-1 and ITU-T Rec. T.81, 1992.View at: Google Scholar
“Information technology-JPEG2000 image coding system,” ISO/IEC 15444-1 and ITU-T Rec. T.800, 2000.View at: Google Scholar
D. Taubman and M. Marcellin, JPEG2000: Image Compression Fundamentals, Standards and Practice, Kluwer Academic, Norwell, Mass, USA, 2001.
L. Pesquer, X. Pons, A. Cortes, and I. Serral, “Spatial pattern alterations from JPEG2000 lossy compression of remote sensing images: massive variogram analysis in high performance computing,” Journal of Applied Remote Sensing, vol. 7, no. 1, Article ID 073595, 2013.View at: Publisher Site | Google Scholar
Image Data Compression, Recommendation for space data system standards, CCSDS 122.0-B-1, 2005.
A. Kourzi, D. Nuzillard, G. Millon, and F. Nicolier, “Quality estimation in wavelet image coding,” in Proceedings of the13th European Signal Processing Conference (EUSIPCO '05), pp. 1227–1230, Antalya, Turkey, September 2005.View at: Google Scholar
S. Saha and R. Vemuri, “An analysis on the effect of image features on lossy coding performance,” IEEE Signal Processing Letters, vol. 7, pp. 104–107, 2000.View at: Google Scholar
S. Nichols, K. Hyunju, A. A. Humos, and H. J. Cho, “A performance evaluation on DCT and wavelet-based compression methods for remote sensing images based on image content,” in Proceedings of the 17th International Conference on Geoinformatics (Geoinformatics '09), pp. 1–5, Fairfax, Va, USA, August 2009.View at: Publisher Site | Google Scholar
M. Datcu, G. Schwarz, K. Schmidt, and C. Reck, “Quality evaluation of compressed optical and SAR images: JPEG vs. wavelets,” in Proceedings of the International Geoscience and Remote Sensing Symposium (IGARSS '95), pp. 1687–1689, July 1995.View at: Google Scholar
T. Mitsa and K. L. Varkur, “Evaluation of contrast sensitivity functions for the formulation of quality measures incorporated in halftoning algorithms,” in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. V-301–V-304, Minneapolis, Minn, USA, April 1993.View at: Google Scholar
Z. Wang, E. P. Simoncelli, and A. C. Bovik, “Multi-scale structural similarity for image quality assessment,” in Proceedings of the 37th Asilomar Conference on Signals, Systems and Computers, pp. 1398–1402, Pacific Grove, Calif, USA, November 2003.View at: Google Scholar
Information technology—JPEG XR image coding system, ISO/IEC 29199-5 and ITU-T Rec. T.835, 2010.
R. H. Jiao, High efficient prediction coding and its optimization in image compression [Ph.D. dissertation], Beihang University, Beijing, China, 2008.
D. Djaenudin, H. Marwan, H. Subagjo, and A. Hidayat, Tech. Rep., Technical Guidance of Land Evaluation for Agricultural Commodities , Soil Research Institute, Agricultural Research and Development Agency, 2003.
L. S. Aiken, S. G. West, and S. C. Pitts, “Multiple linear regression,” in Handbook of Psychology, 2003.View at: Google Scholar
M. Wang, T. Song, H. R. Sheikh et al., “Remote sensing image retrieval by scene semantic matching,” IEEE Transactions on Geoscience and Remote Sensing, vol. 51, pp. 2874–2886, 2013.View at: Google Scholar