Abstract

The purpose of this study was to explore the effects of CT slice thickness, reconstruction algorithm, and radiation dose on quantification of CT features to characterize lung nodules using a chest phantom. Spherical lung nodule phantoms of known densities (−630 and + 100 HU) were inserted into an anthropomorphic thorax phantom. CT scan was performed ten times with relocations. CT data were reconstructed using 12 different imaging settings; three different slice thicknesses of 1.25, 2.5, and 5.0 mm, two reconstruction kernels of sharp and standard, and two radiation dose of 30 mAs and 12 mAs. Lesions were segmented using a semiautomated method. Twenty representative CT quantitative features representing CT density and texture were compared using multiple regression analysis. In 100 HU nodule phantoms, 18 and 19 among 20 computer features showed significant difference between different mAs and reconstruction algorithms, respectively (). 20, 19, and 19 computer features showed difference between slice thickness of 5.0 vs 1.25, 5.0 vs 2.5, and 2.5 vs 1.25 mm, respectively (). In −630 HU nodule phantoms, 18 and 19 showed significant difference between different mAs and reconstruction algorithms, respectively (). 18, 11, and 17 computer features showed difference between slice thickness of 5.0 vs 1.25, 5.0 vs 2.5, and 2.5 vs 1.25 mm, respectively (). When comparing the absolute value of regression coefficient, the effect of slice thickness in 100 HU nodule and reconstruction algorithm in −630 HU nodule was greater than the effect of remaining scan parameters. The slice thickness, mAs, and reconstruction algorithm had a significant impact on the quantitative image features. In clinical studies involving deep learning or radiomics, it should be noted that differences in values can occur when using computer features obtained from different CT scan parameters in combination. Therefore, when interpreting the statistical analysis results, it is necessary to reflect the difference in the computer features depending on the scan parameters.

1. Introduction

Biomedical images may contain information that reflects underlying pathophysiology of many diseases. Nowadays, based on high-throughput computing, extracting many quantitative features from tomographic images is possible. Therefore, many studies have focused on how to convert information on images to quantitative computer features. The conversion of digital medical images into high-dimensional computer data is known as radiomics. Radiomics is also a decision support tool, and it can involve combining radiomic data with other patient characteristics such as survival and disease phenotype [1, 2].

Computer features based on computed tomography (CT) histogram and texture are most frequently used for the differential diagnosis of various cancers including lung cancer [36]. Histogram features represent the density of lung nodules and are indicators of distribution of attenuation. On the other hand, digital images are formed from pixels that are too small to be recognized by the human eye. However, the human visual system can detect the patterns such as roughness and smoothness. Such spatial variation of pixel intensities can be represented as the texture. Computerized analysis of a pattern of brightness and darkness is called texture analysis, and texture features show surface information by examining the relationship between voxels on images [7, 8].

Recently, the number of researches on radiomics or deep learning is increasing [914]. Scan parameters can affect the image quality or noise, and some of the computer features extracted from these images are at risk of varying values. Nevertheless, imaging data from clinical studies are usually obtained using different scan parameters. For accurate analysis, images acquired with different parameters should be normalized to the same conditions. For normalization, it is necessary to quantitatively analyze the magnitude of the effects of the change of the scan parameter on the feature values. Several studies have been reported on the quantification of characteristic values according to scan parameters. Zhao et al. [15] reported differences in radiomic features due to changes in slice thickness and reconstruction algorithms. The repeated CT data were collected from 32 lung cancer patients, and experiments were performed on the reproducibility of radiomic features in 6 different combinations of slice thickness and reconstruction algorithms. Also, differences in radiomic features due to changes in reconstruction algorithms at the same slice thickness were analyzed. As a result, the radiomic features were reproducible when the CT was obtained repeatedly under the same conditions. However, Zhao et al. reported that the radiomic features differed in the change of the reconstruction algorithm. Mackin et al. [16] reported the effect of tube current on radiomic features using the phantom in CT. The differences between 48 features extracted from 25 mAs to 300 mAs conditions were analyzed in the CCR phantom, which was made with 10 textures. As a result, the changes in the tube current influenced more on the features extracted from homogeneous materials (acrylic, sycamore wood) than materials with more tissue-like textures (cork, rubber particles). Mackin et al. reported that tube currents do not have a significant effect on the radiomic features extracted from the tissue texture such as tumors. Kim et al. [17] also reported the tube current effect on radiomic features in CT. A total of 15 features were extracted from 42 CT data based on two reconstruction algorithms, filtered back projection, and iterative reconstruction algorithm, respectively. The difference between the features extracted from the two reconstruction algorithms was analyzed. As a result, Kim et al. reported that 9 of 15 features showed significant differences. Several studies have been reported on the differences in radiomic features for changes in scan parameters. However, most of the studies were analyzed using one scan parameter. We need to analyze the relationship between more various scan parameters and radiomic features.

Therefore, the purpose of this study was to analyze the effect of various scan parameters on the quantitative CT features of lung nodule phantoms. We evaluated the effect of different CT slice thicknesses, mAs, and reconstruction algorithms on 3-dimensional computer features including CT histogram, gray-level co-occurrence matrix (GLCM), and gray-level run length matrix (GLRLM).

2. Materials and Methods

2.1. Lung Nodules

In this study, we used an anthropomorphic thorax phantom (KYOTO KAGAKU co., Kyoto, Japan) and nodule phantoms of two different attenuation values (100 Hounsfield Unit (HU) and −630 HU) (Figure 1). The pulmonary nodule can be divided into a solid component and a ground glass component depending on the component. It is generally known that the solid component has an attenuation value of 100 HU, and the ground glass component has an attenuation value of  630 HU. Therefore, we used two types of nodal phantoms with attenuation values of 100 HU and −630 HU on CT. The 100 HU nodules were made with polyurethane and hydroxyapatite, and the −630 HU nodules were made with urethane foam. The size of nodule phantoms was 10 mm and 12 mm in each attenuation value [18]. So, total four kinds of nodule phantoms were used. We inserted nodule phantoms into the two lungs (2 lesions per lung).

2.2. Image Acquisition

CT scan was performed by using a 64 channel multi-detector row CT scanner (GE Discovery CT 750 HD; GE Healthcare, USA). The CT scan parameters were 120 kVp, 64 × 0.625 collimator configuration, and pitch of 0.984 : 1. The raw data were then reconstructed using 12 different combinations of scan parameters; slice thicknesses (1.25, 2.5, 5.0), mAs (30, 120), and reconstruction algorithms (lung, standard). In each combination of scan parameter, CT scan was repeated 10 times with relocation of nodule phantoms (Figures 2 and 3).

In general, it is known that definition and noise decreases as slice thickness increases, noise decreases as mAs increases, and the standard algorithm has less definition and noise than the lung algorithm. Figure 4 shows a graph of the relationship between each scan parameter and noise, sharpness.

2.3. Nodules Segmentation

In this study, we used an in-house software for the computerized analysis of CT images. This software was developed by using Microsoft Visual Studio (Ver. 2010, Microsoft, Redmond, WA, USA), ITK (Insight Segmentation and Registration Toolkit, Kitware Inc., NY, USA), and VTK (Visualization Toolkit, Kitware Inc., NY, USA). For the first step of the volume measurement of the nodules, the entire tumor mass was separated from surrounding anatomic structures by using a semiautomated segmentation algorithm developed in the Laboratory for Computational Image Analysis in the Department of Biomedical Engineering of Gachon University College of Medicine. This algorithm combined the image analysis techniques of seed region-growing algorithm [19]. Computer-generated tumor boundaries were then visually inspected by a radiologist (HJL, with 19 years of experience performing chest image interpretations) for correctness and consistency. If any segmentation results were considered suboptimal, tumor contours were edited by the same radiologist (HJL).

2.4. Features Selection and Extraction

In this study, we used the features mainly used in lung nodule analysis. Among the various features, 20 radiomic features were selected based on several related papers that performed lung nodule analysis [2034].

In this study, we quantized the pixel values of 4,096 gray colors into 256 gray colors, and texture features are extracted based on the discretized pixel values [35]. A total of 20 computer features including 7 histogram features and 13 texture features were extracted from each nodule phantom (Table 1). Histogram features were mean of CT attenuation, standard deviation (stddev), variance, skewness, kurtosis, energy, and entropy. GCLM texture features were contrast, dissimilarity, homogeneity, angular second moment (ASM), energy, probability max, entropy, and correlation. GLRLM texture features were long runs emphasis (LRE), gray-level nonuniformity (GLN), run length nonuniformity (RLN), low-gray-level run emphasis (LGRE), and high-gray-level run emphasis (HGRE) [7, 8, 3639].

GLCM is a matrix that represents the frequency of occurrence in the relationship of gray level between neighboring voxels with a specific direction. GLRLM is a matrix characterized by the frequency of occurrence in the consecutive voxels with the same attenuation value along a specific direction [8].

The GLCM is ​​represented by the maximum size of the gray level, both in rows and columns. The relationship between gray values ​​of all the pixels and the gray values ​​of neighboring pixels according to a given direction and distance is expressed in the number of occurrences in the matrix. In Figure 5(a), the number of occurrences is 2 when the gray value of the target pixel in the left image is 1 and the gray value of the neighboring pixel under the given condition is 1. Therefore, (1, 1) in the right GLCM becomes the total number of occurrences, 2.

In the GLRLM, the rows are represented by gray values and the columns are expressed by the same number of adjacent pixels. The number of occurrences for the case where the gray value of each of the pixels is the same as the gray value of the neighboring pixels according to a given direction, and distance is represented by a matrix. In Figure 5(b), the number of occurrences is 1 when the gray value of the target pixel in the left image is 2 and the length of the same gray value of the neighboring pixel under the given condition is 2. Therefore, (2, 2) in the right GLRLM becomes the total number of occurrences, 1.

On a three-dimensional space, GLCM and GLRLM can generally conduct a matrix calculation in 13 directions (Figure 6). The calculation of GLCM and GLRLM values was conducted in each of 13 directions, and we used the mean value of each calculation for statistical analysis [40, 41]. The distance between voxels was set as 1. The computer features were calculated from the equations in Table 1 in the GLCM and the GLRLM.

2.5. Statistical Analysis

Multiple regression analysis was performed to evaluate the effect of different scan parameters on the computer features of nodule phantoms (SPSS version 18.0, SPSS Inc., USA) [42, 43]. The dependent variables were 20 computer features, and independent variables were slice thickness (three variables), mAs (two variables), and reconstruction algorithm (two variables). For the statistical analysis, categorical variables were converted into dummy variables. We assumed that independent variables have a linear relationship with dependent variables. We performed an absolute effect size analysis to evaluate the difference in the change of the computer features according to the scan parameter. We calculated Cohen's d effect size by dividing the mean difference by their pooled standard deviation.

3. Results

The results of multiple regression analysis in 100 HU nodule phantoms are presented in Tables 2 and 3. In 95 sets of parameter comparison among 100 sets of comparison, computer features showed significant difference. Between different slice thicknesses, 19 computer features showed significant difference (). The only feature, dissimilarity, showed no difference between slice thicknesses of 5.0 mm and 2.5 mm () and also showed no difference between slice thicknesses of 2.5 mm and 1.25 mm (). Between 30 mAs and 120 mAs, 18 computer features showed significant difference (). Two features including kurtosis () and LGRE () showed no difference. Between different reconstruction algorithms, 19 computer features showed significant difference (), and correlation showed no difference (). In the absolute effect size analysis, the numbers of large effects, medium effects, and small effects were 19, 1, and 0 between 5.0 mm and 1.25 mm slice thickness, and 18, 1, and 1 between 5.0 mm and 2.5 mm slice thickness, and 15, 4, and 1 between 2.5 mm and 1.25 mm slice thickness, respectively. The numbers of large effects, medium effects, and small effects were 0, 13, and 7 between 30 mAs and 120 mAs, and 2, 16, and 2 between lung and standard reconstruction algorithms, respectively.

The results of multiple regression analysis in −630 HU nodule phantoms are presented in Tables 4 and 5. In 83 sets of parameter comparison among 100 sets of comparison, computer features showed significant difference. Eighteen computer features between slice thicknesses of 5.0 mm and 1.25 mm, 11 features between of 5.0 mm and 2.5 mm, and 17 features between 2.5 mm and 1.25 mm showed significant difference (). Between 30 mAs and 120 mAs, 18 computer features showed significant difference (). Two features including mean attenuation () and LGRE () showed no difference. Between different reconstruction algorithms, 19 computer features showed significant difference (), and LGRE showed no difference (). In the absolute effect size analysis, the number of large effects, medium effects, and small effects was 7, 11, and 2 between 5.0 mm and 1.25 mm slice thickness, and 6, 6, and 8 between 5.0 mm and 2.5 mm slice thickness, and 0, 18, and 2 between 2.5 mm and 1.25 mm slice thickness, respectively. The number of large effects, medium effects, and small effects were 6, 12, and 2 between 30 mAs and 120 mAs, and 19, 0, and 1 between lung and standard reconstruction algorithms, respectively.

In the regression analysis, the absolute value of regression coefficient (|RC|) can represent the scale of difference. |RC|s in all features in comparison between 5.0 mm and 1.25 mm were larger than |RC|s between 5.0 mm and 2.5 mm (Tables 2 and 4).

In 100 HU nodule phantoms, the maximum (Max), median (Med), and minimum (Min) values of |RC| were 0.541, 0.388, and 0.011 between 5.0 mm and 2.5 mm slice thickness, and 0.672, 0.574, and 0.135 between 5.0 mm and 1.25 mm slice thickness, respectively. Max, Med, and Min values of |RC| were 0.186, 0.079, and 0.011 between 30 mAs and 120 mAs and 0.271, 0.127, and 0.015 between lung and standard reconstruction algorithms, respectively (Table 2).

In −630 HU nodule phantoms, Max, Med, and Min values of |RC| were 0.247, 0.039, and 0.004 between 5.0 mm and 2.5 mm slice thickness, and 0.277, 0.122, and 0.018 between 5.0 mm and 1.5 mm slice thickness, respectively. Max, Med, and Min values of |RC| were 0.196, 0130, and 0.011 between 30 mAs and 120 mAs and 0.436, 0.224, and 0.011 between lung and standard reconstruction algorithms, respectively (Table 4).

The values of computer features in different slice thicknesses, mAs, and reconstruction algorithms are presented in Supplementary Materials 1 and 2. The computer features in 12 different sets of scan parameters are presented in Supplementary Materials 3 and 4.

4. Discussion

Our study showed that (a) in both of 100 HU and −630 HU nodule phantoms, differences in the scan parameters had a significant effect on almost all computer features with few exceptions, (b) in the 100 HU nodule phantoms, considering the Max and Med values of |RC|s between different slice thicknesses were larger than the Max and Med values of |RC|s between different mAs or algorithms, we speculate slice thickness had a greater effect than mAs or algorithm, and (c) in the −630 HU nodule phantoms, considering the Max and Med values of |RC|s between different algorithms were larger than the Max and Med values of |RC|s between different slice thicknesses or mAs, we speculate algorithm had a greater effect than slice thickness or mAs.

In this study, differences in the scan parameters had a significant effect on almost all computer features. These results indicate that noise or artifacts affected the attenuation and texture in the nodule, which indicates that the scan parameters are related to noise and artifact. Also, our results are consistent with several previous studies [38, 44, 45].

In the regression analysis, the |RC| can represent the scale of difference in feature values in different scan parameters. We found in this study that |RC|s in all features in comparison between 5.0 mm and 1.25 mm were larger than |RC|s between 5.0 mm and 2.5 mm. As the slice thickness becomes thinner, the noise increases. As the slice thickness becomes thicker, the noise decreases and greater effect of the partial volume effect [46]. Thus, a large difference in slice thickness will increase the impact of noise and partial volume effect. So it is considered that RC is shown larger in the comparison between 5.0 mm and 1.25 mm.

In the 100 HU nodule phantoms, we found slice thickness had a greater effect than mAs or algorithm. The CT image of 100 HU nodule phantoms has smaller noise than CT of −630 HU nodule phantom due to higher average attenuation and a smaller variation in the distribution of attenuation. Therefore, the 100 HU nodule phantom is more affected by the partial volume effect than the −630 HU nodule phantom. As the result, the slice thickness that is closely related to partial volume effect was the most influential parameter in the 100 HU nodule phantom.

We also found in this study that, in the 100 HU nodule phantoms, GLCM-dissimilarity showed no difference between slice thickness of 5.0 and 2.5 mm and between 2.5 mm and 1.25 mm. The weight of the dissimilarity increases linearly unlike other texture features which increases exponentially [47]. In the features in which the weights increase exponentially, a large difference may occur even if the difference of the image values is small. Since the weight of the dissimilarity increases linearly, the difference is relatively small when compared with other feature values. Therefore, the dissimilarity showed not statistically significant when difference of slice thickness was small such as 5.0 mm and 2.5 mm, and 2.5 mm and 1.25 mm, respectively.

In the −630 HU nodule phantoms, the reconstruction algorithm had a greater effect than slice thickness or mAs. The amount of noise in the −630 HU nodule phantom is greater than the amount in the 100 HU nodule phantom. The lung reconstruction algorithm usually makes higher noise level that the standard algorithm. The change of reconstruction algorithm from standard to lung algorithm made bigger increase of noise in the nodule with inherently higher level of noise, that is, −630 HU nodule.

In this study, in the −630 HU nodule phantoms, the effect of slice thickness was smaller than the effect of the reconstruction algorithm. We can notice that smaller difference of slice thickness might make smaller difference of noise, thus the statistical difference was not significant. On the other hand, due to higher average attenuation and a smaller variation in the distribution of attenuation, the CT image of 100 HU nodule phantoms was more affected by the difference of slice thickness that cause difference of partial volume effect.

CT images of the lung reconstruction algorithm contain higher noise level that the image of standard algorithm. Therefore, the change of the reconstruction algorithm can affect the features associated with CT histogram or CT texture. In this study, we found that change of the algorithms had significant effect on 19 computer features in the 100 and −630 HU nodule phantoms. Only a feature, that is, correlation, in 100 HU phantoms and LGRE in −630 HU nodule phantom showed no difference between lung and standard algorithms.

In this study, we found that most computer features showed significant difference between 30 mAs and 120 mAs. We speculate that this significant difference was originated by the change of noise level. We also found that LGRE showed no difference between 30 mAs and 120 mAs in both of 100 HU and −630 HU nodule phantoms. LGRE, that is, low gray level run emphasis, can be defined by distribution of run length in the low gray values. The value of LGRE is high when there is the large number of pixels with low gray level [37]. Considering the result that LGRE showed no difference between 30 mAs and 120 mAs, the number of pixels with low gray level has not significantly changed by the change of mAs.

This study demonstrated that the change of CT scan parameters can affect the quantitative CT features. In clinical studies involving deep learning or radiomics, it should be noted that differences in values can occur when using computer features obtained from different CT scan parameters in combination. Therefore, when interpreting the statistical analysis results, it is necessary to reflect the difference in the computer features depending on the scan parameters. In further studies, we need to develop methods for the standardization of computer features obtained from different scan parameters.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors of this manuscript declare no relationships with any companies, whose products or services may be related to the subject matter of the article.

Authors’ Contributions

Hyun-Ju Lee and Kwnag Gi Kim contributed equally to this study.

Acknowledgments

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2017R1D1A1B04032467).

Supplementary Materials

Supplementary Material 1: the computer features according to the scanning parameters in the 100 HU nodule phantoms. Supplementary Material 2: the computer features according to the scanning parameters in the −630 HU nodule phantoms. Supplementary Material 3: the computer features in 12 different scan parameters in 100 HU nodule phantoms. Supplementary Material 4: the computer features in 12 different scan parameters in −630 HU nodule phantoms. (Supplementary Materials)