Abstract

A novel procedure using phase congruency is proposed for discriminating some lung disease using chest radiograph. Phase congruency provides information about transitions between adjacent pixels. Abrupt changes of phase congruency values between pixels may suggest a possible boundary or another feature that may be used for discrimination. This property of phase congruency may have potential for deciding between disease present and disease absent where the regions of infection on the images have no obvious shape, size, or configuration. Five texture measures calculated from phase congruency and Gabor were shown to be normally distributed. This gave good indicators of discrimination errors in the form of the probability of Type I Error (δ) and the probability of Type II Error (β). However, since 1 −  δ is the true positive fraction (TPF) and β is the false positive fraction (FPF), an ROC analysis was used to decide on the choice of texture measures. Given that features are normally distributed, for the discrimination between disease present and disease absent, energy, contrast, and homogeneity from phase congruency gave better results compared to those using Gabor. Similarly, for the more difficult problem of discriminating lobar pneumonia and lung cancer, entropy and homogeneity from phase congruency gave better results relative to Gabor.

1. Introduction

Lung cancer is the cause of more than a quarter of all cancer deaths in the United States and a major contributor of cancer death worldwide [1]. Despite efforts to control lung cancer, the chance of survival from this disease is still low (5–10% at five years). Annually 1.52 million new cases of lung cancer are detected and more than 1.31 million deaths are reported [2]. Lung cancer in early stages does not have clear signs and symptoms; however, continued coughing, shortness of breath, chest pains, and wheezing and coughing of blood are common indicators [3].

Pneumonia is a serious infection of the lungs and may have over 30 different causes. Symptoms of pneumonia include fever, coughing, shortness of breath, chest pain, and loss of appetite [4]. Over fifty-five thousand people died of pneumonia in the United States of America in the year 2006 [5]. Pneumonia is the ninth leading cause of death in 2010, with the highest mortality rate of all infectious diseases especially for people over 65 years of age [6].

Economic considerations create the situation where the digital chest X-ray is widely used [7]. For example, the Malaysian government hospitals perform a large number of diagnoses using radiograph films. However problems arise with the use of X-rays where studies have shown that the accuracy of the X-ray interpretation is subject to varying degrees of observer error [8]. Current methods or algorithms for disease detection mainly focus on the discrimination between normal images and images with signs of disease involving chest radiograph [915].

The study of image-based assessment of pulmonary infiltrates has seen the application of a wide variety of techniques; for example, Oliveira et al. [10] used wavelet, van Ginneken et al. [12] and Arzhaeva et al. [13] used texture, Katsuragawa and Doi [14] used contrast enhancement, Arzhaeva et al. [15] used two-class classifier, and Tsevas and Iakovidis used Gabor filter features [16].

Phase congruency (PC) features do not seem to be used for the assessment of the pulmonary infiltrates. Kovesi [17] showed that corners and edges are well detected using PC. The problem of discriminating between disease present and disease absent where the regions of infection on the images have no obvious shape, size, or configuration, sharp transitions between adjacent pixels strongly suggest using PC as a viable candidate for the development of features.

The use of PC was motivated by Oppenheim and Lim where phase information in a signal is more important than its amplitude [18]. Further developments involved the study of local energy model where the features are perceived at points in an image and the Fourier components are maximally in phase [1923]. Venkatesh and Owens proved that local energy is proportional to phase congruency [24]. In 1999 Kovesi developed the phase congruency model based on log-Gabor transfer function for specific image applications [25].

One advantage of using PC is that it provides information about adjacent pixels. This property is particularly useful when the region of infection (ROI) does not conform to any standard shape, size, or configuration. Further the PC is invariant to changes in image brightness or contrast [25]. High congruency between phases (the value close to one) indicates large transition between adjacent pixel and low congruency between phases (the value close to zero) indicates mild transition between adjacent pixels [25]. Another advantage of PC is that it is a dimensionless measure and allows constant threshold values to be applied across wide classes of images. The properties of the phase congruency model have been used in new and interesting techniques in the recent decade to extract image features [17, 2628].

Textures measures will be used as features for the discrimination. There are many techniques used in texture feature extraction. A popular technique is the use of wavelet textures; for example, in [9], Daubechies wavelet was applied. However, Daubechies wavelet does not preserve phase information [25]. Texture measures using cooccurrence matrix and markov random field parameters are sensitive to illumination and Gaussian noise [29], a problem which can be handled using PC. Multiresolution histogram does give texture information and in general they can discriminate between images with different spatial patterns alone without the help of other filters or features [30, 31]. Another technique that is widely used in capturing local structural patterns from images is Gabor filter [16]. References [3238] reported that Gabor filters yield excellent texture descriptors. However in our study the region of infection does not exhibit any well- or easily defined spatial patterns. These remarks suggested that the potential of PC texture features should be explored.

The closest rival to the application of PC texture features is that of the application of Gabor texture features which will be compared in this study. Kovesi formulates PC using log Gabor as the filter function which maintains zero DC for arbitrarily large bandwidth compared to Gabor function (zero DC confined to one octave bandwidth only) [25]. Secondly log Gabor has extended tail at high frequency and therefore high frequency components are not suppressed resulting in preservation of image details [39, 40].

The discrimination ability of PC texture features relative to Gabor texture features in discriminating lobar pneumonia (PNEU) from normal healthy individuals (NL), lung cancer (LC) from NL, and between PNEU and LC using chest radiograph will be carried out. The texture measures are tested for normality. Existence of the univariate normal probability distributions allows optimal discrimination functions to estimate error probabilities, in particular, the probability of Type I Error () and the probability of Type II Error (). The final selection of texture measures was decided from the result of an ROC analysis.

2. Selection of Case Study

This study involved collaboration with the Institute of Respiratory Medicine (IPR), Malaysia, which is the national referral centre for respiratory diseases. The IPR provided archived patients’ data which include chest X-ray films (blue-based films) captured using the Phillips Diagnost 55/Super 50CP (Phillips Corp., Holland) together with complete patients’ medical information. The patient’s chest was captured in full inspiration using the posterior-anterior (PA) view. The distance between the X-ray and the patient is fixed at 180 cm to diminish the effect of beam divergence and magnification of structures closer to the X-ray tube. The cassette sizes of 35 cm × 35 cm and 35 cm × 43 cm were used for female and male patients, respectively. A patient is exposed to 64 kV and 4.0 mAs if he or she is underweight, 70 kV and 5.0 mAs if he or she is normal weight, and 96 kV and 8 mAs if he or she is overweight (>100 kg). All overexposed and underexposed X-ray films as well as those with shadows are omitted from the study.

The archived data (stored in files) in IPR were diagnosed by a pulmonologist. In IPR, all the pulmonologists are trained to interpret chest radiographs. In this study, stratified random sampling (SRS) was carried out for the patients’ files which were randomly selected given that the patients chosen were already diagnosed as LC or PNEU. The diagnoses of all cases were done by the IPR pulmonologists, and in this study the diagnoses were again verified by the consultant pulmonologist. It should be noted that the pulmonologist and consultant pulmonologist mentioned above are two different individuals. The LC cases consist of 75% cases of nonsmall cell carcinoma of which 50% of these are the squamous cell carcinoma and 25% are the adenocarcinoma. The other 25% cases are small cell carcinoma. The confirmation of LC was based on bronchial biopsy result. The confirmation of PNEU cases was based on chest X-ray findings and sputum culture test. This sputum test is done to identify the causative organism, and in this study, the commonest organism isolated is Streptococcus pneumoniae bacteria, while the chest X-ray is to identify the type and site of infection. The commonest type noted in this study is lobar pneumonia. The selected patients used were the confirmed LC and PNEU cases with no other systemic diseases such as diabetes, hypertension, and heart disease. The omission of cases, with the help of the consultant pulmonologist, with other systemic diseases was done in order to avoid bias in the development of a statistical discriminant function. The normal lung (NL) chest X-ray films selected by the radiologist from IPR represent patients who came for a general medical check-up. The complete dataset consist of 40 NL, 50 LC, and 50 PNEU cases.

The chest X-ray films were then digitized into DICOM format using the Kodak LS 75 X-ray Film Scanner (pixel spot size of 100 µm, 12 bit per pixel, and image size of 2016 × 2048 pixels). Examples of a digitized X-ray film for normal lung and infected lung with LC and PNEU are shown in Figure 1. All digital image X-rays underwent normalization process as described in the next section.

2.1. Normalization of the Chest Radiograph

Image normalization was done on the whole dataset to make image intensities comparable across the whole dataset [16]. LC or PNEU cases will have their chest X-ray film image exhibitting some abnormal opacity. The landmarks on the spine were selected following [16] where these landmarks remain practically unaffected by the presence of infiltrates. Other than the spine, other lung anatomies especially the soft tissues are not suitable to become normalization landmark since it will be affected by the presence of infiltrates. A landmark, for example, the seventh cervical vertebra (C7), was selected for each patient in a given category (example NL) and the corresponding pixel intensity at the centre of C7 was studied. This process was repeated for other landmarks, 1st–4th thoracic vertebrae (, , , and , resp.), all of which were taken at the center of the spine. The summary statistics illustrated in Tables 1(a), 1(b), and 1(c) showed that the variation of the intensity is small.

To eliminate problem the normalization of the chest radiograph was done on the dataset as proposed by [16].

In order to enhance the discrimination procedure X-ray images were preprocessed by histogram equalization and image normalization processes. Figure 2 illustrates the normalization procedure proposed [16]. The procedure is as follows.(1)The original images were cropped in order to have only the lung fields.(2)Plot the image histogram of each image.(3)In order to have uniform distribution of intensities, for each resultant image the histogram equalization procedure was implemented.(4)Four landmarks, namely, the middle of 1st–4th thoracic vertebrae (, , , and resp.), are selected as references.(5)Steps 1 to 4 were repeated for all 40 NL, 50 LC, and 50 PNEU cases.(6)Let be the central tendency of the intensity references at selected landmarks. Let where (40 NL, 50 LC and 50 PNEU).Let be the central signature of our data base.(7)Let be a given radiograph. The image was then normalized by dividing the image intensities with the central signature, , where is the resultant normalized image. Figure 3 shows the original image and the normalized image and its respective histogram.

3. Phase Congruency Model

The two-dimensional phase congruency model for any pixel is given as follows [25]:such that if the value of is negative, the function returns a value zero; otherwise, it returns its arguments.

The component is the weighting function at orientation , whereand is the total number of scale being considered, is a small positive number added to prevent division by zero, is the amplitude of the filter pair having maximum response at pixel , is the gain factor to control the sharpness of cut-off filter, is the cut-off filter value, and is the amplitude of the transform at the given wavelet scale using log-Gabor filter,where is the angular frequency, is the filter’s centre frequency, and the term gives approximately two-octave bandwidth. , where , and is the scaling between centre frequencies of successive filters. The wavelength of the smallest scale is denoted as . The angular overlap of the filter transfer function is controlled by the ratio of the angular interval between filter orientations and the standard deviation of the angular Gaussian spreading function (denoted by ).

A measure of angular phase deviation that is approximately linear is given bywhere and are the local phase and the mean phase angle, respectively. compensates the effect of noise at orientation , where and are the mean and variance of the Rayleigh distribution of the noise energy response and = constant (typically 2 or 3). is the amplitude of the transform at the given wavelet scale and orientation .

4. Gabor Model

Gabor’s theory implies that the information can be represented by the amplitudes of functions that are localized in both space and frequency [39]. By using 2D Gabor filter, orientation and scale information can be captured.

Gabor filtered image is obtained by convolving image with a two-dimensional Gabor function given by [41] as follows:where returns the value zero if is negative; otherwise, it returns its arguments. Two-dimensional Gabor function, , is given as where the arguments and are the position of a light impulse in the visual field, is the center of a receptive field, is the standard deviation of the Gaussian factor that determines the size of the receptive field, is the spatial aspect ratio (in this study we use ), is the wavelength from the cosine factor such that is the spatial frequency bandwidth, is the orientation of the filter, and is the phase offset that determines the symmetry of the .

5. Methods

Three random samples consisting of 40 NL, 50 LC, and 50 PNEU cases were used in this study. The consultant pulmonologist determined the ROI as ground truths. The PC parameters were then determined in a simulation study. It should be noted that the Gabor model is chosen in such a way that it has the same number of channels and similar channel characteristic as in the PC model. The ROI size for both model is fixed at 256 × 256 pixels. Once the parameter values are selected, the corresponding texture measures were calculated. Summary statistics of texture measures investigate the range of values of the texture measures. Test of normality on texture measures in turn indicates the suitability of texture measures to be used in an optimal statistical discrimination procedure. Finally an ROC analysis determined the best choice of texture measures and this enables a comparison between PC and Gabor features in discriminating between PNEU and NL and between LC and NL. A comparison was also made in discriminating between PNEU and LC.

5.1. Selecting Phase Congruency Parameter Values

A simulation study was carried out to find optimal values for the parameters , , , , , , , , and of (2) using the NL images (ROI size fixed at 256 × 256 pixels). The first estimate of the parameters is the default values suggested by Kovesi [25] using , , , , , , , , and , which does not give clear distinction between the rib-bones and lung tissue. The parameter values are varied as follows: Figure 4 shows the effect of varying while keeping the other default values fixed. For each value of , the power signal to noise ratio (PSNR) value between the original image and transformed image was calculated and clearly the case gave the highest PSNR value (). Similarly, varying with and all remaining parameters at their default values suggests selecting , (Figure 5). This process is continued, as illustrated in Figures 6, 7, 8, 9, 10, and 11 until the combination of , , , , , , , , and yielded the highest PSNR value (9.1575). These optimal PC parameters found for the NL case were then applied to the LC and PNEU case (see Table 2).

The consultant pulmonologist studied these images in comparison with the original image and was able to locate the ROI. To further investigate the suitability of the parameter values, a selected line profile which is a plot of PC() values versus pixel position was obtained and shown in Table 2. The corresponding 2D profile is also given. The 1-D line profile gave clear differences but is less obvious for the 2D profile which gives motivation for using PC() values to discriminate LC from PNEU. Texture measures of PC() values may in turn be possible candidates for features in a given discrimination procedure.

5.2. Gabor Filter Parameters

To enable reasonable comparison between PC and Gabor, in this study we use the same number of channels and similar channel characteristics. The number of channels is 30 (wavelet scale , orientation ). The other parameters used for Gabor filter are and , and phase offset, . The ratio determines the spatial frequency bandwidth, and since PC model used two-octave bandwidth, the same bandwidth size is utilized for Gabor model. The ratio gives two-octave bandwidth [42]. is the spatial aspect ratio (in this study we use ). The spatial aspect ratio has been found to vary in a limited range of [43]. The values and are used in our study as suggested by [42]. The size of ROI is fixed at 256 × 256 pixels.

5.3. Phase Congruency and Texture Measures

The PC() values were obtained for the selected ROI (fixed at 256 × 256 pixels), and subsequently a texture measure may then be calculated. There are many possible texture measures that could be used; however, in this study energy (), mean energy (), entropy (), contrast (), homogeneity (), standard deviation of value (), standard deviation of energy (), and correlation () texture measures [11] are considered. The mean and standard deviations of texture measures using PC model are listed in Table 3 and the corresponding table for Gabor model is given in Table 4. A comparison of these two tables shows that texture measures that tend to be better estimated using PC.

5.4. A Test of Normality

Amongst the many tests for univariate normality the Kolmogorov-Smirnov test is a widely used test that has high power even for relatively small sample size (less than 30). The Kolmogrov-Smirnov test statistic is -stat , where and are the empirical and assumed normal cumulative distribution function (CDF) of the random variable , respectively. The hypothesis of normality is rejected when the value of -stat is greater than the selected critical value [44].

In total 140 images were used in this study where seventy images were used as control group in which 20 are from NL, 25 from LC, and 25 from PNEU. The other seventy images comprised of 20 NL, 25 LC, and 25 PNEU are used as the test group. Normality was tested at a 95% significance level. Homogeneity, energy, mean energy, entropy, and contrast were shown to be normally distributed for both PC and Gabor models. Table 5 illustrates the test results for homogeneity using PC model and Gabor model. The results of normality testing were graphically verified by QQ plots as shown in Figure 12.

5.5. Statistical Discrimination

Without loss of generality the discrimination problem for LC and NL will be discussed following the notations of Johnson and Wichern [45]. Since the selected texture measures are normally distributed, an optimal discrimination procedure can be developed using , where and represent the probability density functions for LC and NL, respectively. When the sample data from both populations have equal variances, we use the well-known linear discriminant function (LDF), ; otherwise the quadratic discriminant function (QDF), , where will be used.

If , then is identified as LC; otherwise, is identified as NL, where , , , , and given that and denote the size of the control dataset from LC and NL, respectively. Finally,

The choice of LDF and QDF follows the result of testing the hypothesis , where is the variance of   .

Twenty-five LC cases and twenty NL cases were used to estimate the parameters for a given texture measure TM. The remaining dataset was used as the test sample.

Let the null hypothesis be shows LC and, alternatively, shows NL. Type I Error occurs when the null hypothesis is rejected incorrectly and Type II Error occurs when the null hypothesis is accepted incorrectly. Henceforth the probability of Type I Error and Type II Error is as follows: and .

Similarly δ2, β2, and δ3, β3 are estimated corresponding to the pairs (PNEU, NL) and (LC, PNEU), respectively. A similar procedure was applied using Gabor features. Results are shown in Table 6 for all five texture measures that are normally distributed.

6. Receiver Operating Characteristic (ROC) Analysis

The performance of the discrimination procedure is indicated in terms of the size of and for equal cost of misclassifications and equal a priori probabilities (); see Table 6. In practice this assumption does not hold and instead all possible combinations of these parameters should be considered for a given texture measure. This problem was solved by calculating and for all possible values of followed by the calculation of the true positive fraction (TPF) and the false positive fraction (FPF).

Clearly TPF equals and FPF equals . A plot of FPF and TPF for increasing values of greater than zero is defined as an ROC curve for the texture measure considered. The choice of two texture measures is made by comparing their corresponding ROC curves (see Figure 13). The ROC curve which is left most and highest will imply that the corresponding texture measure should be preferred. Table 7 shows the area under the ROC curve (AUC) for each texture measure.

7. Results and Discussion

The chest X-ray images were selected under stringent conditions; for example, appropriate acquisition parameters were considered when cases involved underweight, normal, and overweight patients. Cases with other systematic diseases were also omitted in this study. Under these conditions the consultant pulmonologist finally determined the ROI which can be regarded as the ground truth.

The improved discrimination results using PC features are possible because of its properties of using Log Gabor as its filter function of having no DC components and its ability to preserve image details relative to the Gabor features [25, 39, 40]. The only disadvantage or limitation of using PC method is that too many parameters that need to be chosen to suit the application. In this study a simulation was carried out to get the best PC parameter that yields the highest PSNR.

A simulation study was carried out which strongly suggests that the PC parameters, where , , , , , , , , and , yielded the highest PSNR (9.1575) and therefore provided evidence of discriminatory properties. A further motivation in the use of PC is that it provides local information and the texture measures of the PC in turn may be regarded as providing global information of the ROI.

To enable reasonable comparison between PC and Gabor, in this study we use the same number of channels and similar channel characteristics which are 30 channels (, ) and bandwidth is 2 octaves following [42].

Five of the texture measures from PC and from Gabor were shown to have univariate normal probability distributions. An optimal discrimination procedure which judiciously selects the LDF or QDF was developed to perform pairwise discrimination. For the case (total ignorance) smaller () errors for discriminating LC and NL as well as discriminating PNEU and NL were shown if energy, contrast, and homogeneity from PC were used relative to Gabor. All texture measures performed badly for the LC and PNEU discrimination case. This prompted the use of the ROC analysis where plots of the TPF versus FPF for all values aided in the selection of texture measures in more than one possible way. Firstly, the ROC curves furthest to the left and which is highest indicate the best choice of texture measures. Secondly, for a fixed low value of FPF (0.2), the texture measure with the highest TPF should be selected. Thirdly, using the area under the ROC curves shown in Table 7 yielded energy (99.86%), contrast (99.06%), and homogeneity (99.86%) from PC and gave the best discrimination results for LC-NL and PNEU-NL discriminations. For LC-PNEU discrimination, entropy (80.02%) and homogeneity (80.16%) gave the best discrimination. Our results on energy and contrast from PC verify the invariance property of PC with respect to image brightness and contrast [25].

A comparison of the use of PC and that of our earlier work [9] showed that Type I Errors and Type II Errors were much less using PC when energy, contrast, and homogeneity were used as features for discrimination.

The proposed method if followed closely as shown above should be applicable to all or most images produced or prepared by a qualified radiologist. As long as the normalization of the image is carried out, the method is robust and will yield high success rate if the discrimination is between disease absent and disease present in a large cohort study. However, for the discrimination between diseases, even with the assistance of a qualified radiologist, the proposed method is only applicable for the situation when both LC and PNEU are in their developed stage and each patient exhibits only one category of disease. These conditions must be adhered to when carrying out a large cohort study where images include other lung diseases.

8. Conclusion

A novel procedure for the pairwise discrimination of LC, PNEU, and NL using texture measures calculated from phase congruency values is proposed in this study. When the set of the phase congruency parameters , , , , , , , , and was used, good discrimination was achieved. The ROC analysis was the method used to indicate the best texture measures for the three pairwise discrimination cases. Given that the texture measures are normally distributed, energy and homogeneity from PC gave better discrimination results for LC-NL and PNEU-NL discriminations, and for LC-PNEU discrimination, entropy and homogeneity gave the best discrimination relative to Gabor. A promising result of this study is that PC texture measures have a better chance of solving the LC-PNEU discrimination problem.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This research was supported and funded under the University of Malaya Research Grant (UMRG 252/12AFR) with collaboration from Universiti Teknologi Malaysia and Institute of Respiratory Medicine, Kuala Lumpur.