Open Access
15 February 2018 Influence of CT acquisition and reconstruction parameters on radiomic feature reproducibility
Abhishek Midya, Jayasree Chakraborty, Mithat Gönen, Richard K. G. Do M.D., Amber L. Simpson
Author Affiliations +
Abstract
High-dimensional imaging features extracted from diagnostic imaging, called radiomics, are increasingly reported for diagnosis, prognosis, and response to therapy. Establishing the sensitivity of radiomic features to variation in scan protocols is necessary because acquisition and reconstruction parameters can vary widely across and within institutions. Our objective was to assess the reproducibility of radiomic features derived from computed tomography (CT) images by varying tube current (mA), noise index, and reconstruction [adaptive statistical iterative reconstruction (ASiR)], parameters increasingly varied by institutions seeking to reduce radiation dose in their patients. We extracted radiomic features from CT images of a uniform water phantom, anthropomorphic phantom, and a human scan. Scans were acquired from the phantoms with six tube currents (50, 100, 200, 300, 400, and 500 mA) and five noise index levels (12, 14, 16, 18, and 20), respectively. Scans of the phantoms and patient were reconstructed from 0% ASiR (i.e., filtered back projection) to 100% ASiR in increments of 10%. Two hundred and forty-eight well-known radiomic features were extracted from all scans. The concordance correlation coefficient was used to assess agreement of features. Our analysis suggests that image acquisition parameters (tube current, noise index) as well as the reconstruction technique strongly influence radiomic feature reproducibility and demonstrate a subset of reproducible features potentially usable in clinical practice.

1.

Introduction

Improvements in the resolution of diagnostic imaging over a decade ago have created an abundance of retrospectively available images that contain information, which can be capitalized upon to create predictive treatment algorithms. Radiomics, the high-throughput extraction of imaging features from high-resolution images, has opened up new possibilities for the diagnosis, staging, and treatment stratification of patients with malignant pathologies.13 Extracting additional information from routine imaging is an attractive opportunity for clinicians looking for low-cost, objective, and noninvasive biomarkers for personalized cancer treatment. With respect to the liver, for example, publications have emerged linking quantitative imaging features to clinicopathological and outcome variables in single institution retrospective series.412 While clinically promising, the successful clinical implementation of radiomics as a trusted biomarker requires reproducibility experiments studying the effect of varying image acquisition and reconstruction parameters on imaging features.

Establishing the repeatability and reproducibility of radiomic features is necessary since computed tomography (CT) scan acquisition and reconstruction parameters can not only vary widely across institutions, but also within an institution, as protocol standardization is clinically challenging. Figure 1 shows an example of a portal venous phase CT scan of the liver acquired of the same patient at two institutions, 10 days apart. Despite the fact that attenuation (brightness) differences are observable in these images, little attention has been paid to the effect that imaging protocol variation has on radiomic features. In CT phantom studies, differences in slice thickness and reconstruction algorithm (standard versus lung) significantly influenced radiomic features.13 Recent work on reproducibility of features extracted from lung CT has demonstrated a subset of reproducible, informative, and nonredundant radiomic features in the presence of scan protocol differences2,1419 with similar findings observed for bone CT applications.20 However, lung cancers, contrary to liver cancers, are surrounded by air and imaged without intravenous contrast. Thus, liver-specific studies based on contrast-enhanced CT are needed to determine reproducibility for clinical use. Motivated by a study demonstrating the effect of slice thickness variation on radiomic feature performance assessed across noncontrast, arterial, and portal venous phases in liver CT,21 further study is warranted.

Fig. 1

Comparison of imaging protocol differences: portal venous phase CT acquired of the same patient 10 days apart.

JMI_5_1_011020_f001.png

Adaptive statistical iterative reconstruction (ASiR™, GE Healthcare, Waukesha, Wisconsin) is a noise reduction-based reconstruction algorithm introduced for CT in 2008. ASiR iteratively refines each pixel value measured with filtered back projection (FBP) to an idealized estimate, predicted by noise modeling.22 It is implemented in discrete levels by adjusting the percentage of ASiR blended with the FBP reconstruction of the image, which enables the user to control overall magnitude of noise in the image. Recent studies showed the degradation of spatial resolution due to ASiR,23 which affects radiographic performance.24,25 In addition, the amount of radiation dose has an effect on CT appearance; reduction of dose increases noise, which affects observers’ performance in disease detection.26,27 Automated tube current modulation along the z-axis optimizes the radiation dose to the patient while attempting to maintain a predetermined noise level throughout the scan volume. The noise level within the scan volume is determined by an operator preset noise index. The effect of ASiR on volumetric tumor measurements was recently assessed using a phantom (with known dimensions) designed to mimic the tissue properties of liver in imaging.28 Lower contrast of the lesion relative to the background parenchyma with increasing ASiR contributed to variability in volumetric measurements; however, the effect of ASiR on radiomic features is unknown. Figure 2 demonstrates the increase in blurring effect due to gradual increase in ASiR from 0% (pure FBP) to 100% ASiR in exemplar phantom and human scans.

Fig. 2

Changes in the appearance of image texture with increasing ASiR.

JMI_5_1_011020_f002.png

Although CT-based imaging features have been shown to be promising prognostic biomarkers for clinical use, the robustness of these features on systems equipped with iterative reconstruction algorithms and automated tube current modulation is unknown. Understanding the variability of radiomic features is necessary to use radiomics in clinical practice. Solomon et al.29 investigated the effect of radiation dose and reconstruction algorithm on imaging features derived from gray-level co-occurrence matrices (GLCM) and intensity histograms. Therefore, the present study aimed to determine the sensitivity of radiomic features to modulating tube currents, variable noise index, and reconstruction algorithms with many more imaging features commonly described in the quantitative imaging literature. We sought to perform a controlled study with phantoms and a human scan to motivate future prospective assessment. We chose to investigate the effect of tube current, noise index, and ASiR on imaging features, parameters increasingly varied by institutions looking to reduce radiation dose to their patients.

2.

Materials and Methods

2.1.

Image Data Sources

A uniform water phantom (UWP, GE Healthcare, Waukesha, Wisconsin) for CT image quality control and an adult-sized anthropomorphic dosimetry verification phantom (ATOM phantoms, CIRS, Norfolk, Virginia) were employed for the study. The UWP was attached to the end of the CT table and the anthropomorphic phantom was positioned similar to clinical scanning (Fig. 3). The UWP is made of acrylic and filled with water, whereas the anthropomorphic phantom consists of five simulated tissue types including the liver. The tissue types are engineered to produce photon attenuation values within 1% of those for real tissues for the bone and the soft tissue substitutes, and 3% for the lung tissue substitute over the range of 30 to 20,000 keV.30 Phantoms were employed so that we could systemically vary tube current and noise index, requiring multiple scans, which would be difficult to justify in patients due to the added radiation dose. A single abdominal CT scan of a patient was also included in the study so that we could study the effect of reconstruction parameters in a clinical setting.

Fig. 3

Phantoms used in study: (a) uniform water phantom and (b) anthropomorphic phantom reconstructed with FBP. Regions of interest included in analysis are outlined in green.

JMI_5_1_011020_f003.png

2.2.

CT Scan Acquisition and Reconstruction

All CT scans were acquired using a GE 64 slice multidetector scanner (GE Healthcare, Waukesha, Wisconsin) using the parameters listed in Table 1. The UWP scans were acquired at six different fixed tube currents: 50, 100, 200, 300, 400, and 500 mA, respectively. The anthropomorphic phantom was scanned at five noise index (NI) levels (12, 14, 16, 18, and 20). Modulating tube current values in the range [61 to 69], [99 to 104], [70 to 79], [52 to 63], and [42 to 49] were recorded from the DICOM header of the anthropomorphic phantom scans for NI 12, 14, 16, 18, and 20, respectively. A second data acquisition verified that the values increased at NI 14 indicating unexpected behavior not controlled in the experiment. A contrast-enhanced patient CT obtained as part of routine clinical management was also included. The CT image was acquired following the administration of 150-mL iodinated contrast (Omnipaque 300, GE Healthcare, New Jersey) at 4.0  mL/s with noise index of 14. CT reconstruction was performed with FBP and ASiR on all images. All images were reconstructed from FBP (0% ASiR) to 100% ASiR in increments of 10%. FBP is the most widely used reconstruction algorithm; hence, it was considered the gold standard reference image set in this study. The choice to vary mA, noise index, and ASiR was made based on our clinical experience that these acquisition and reconstruction parameters often vary across different institutions. At our cancer center, we tend to use lower noise indices than at outpatient CT facilities, for example, and apply lower ASiR levels. The wide variability in specific mA values investigated in this study may be greater than those used clinically (most sites use 100 to 300 mA) but included here for completeness. However, the choice of NI reflects clinical conditions at our institution.

Table 1

CT acquisition and reconstruction parameters.

ParameterUWPAnthropomorphic phantomHuman
GE modelDiscovery 750Discovery 750 HDDiscovery 750 HD
Detector rows646464
Scan modeHelicalHelicalHelical
kVp120120120
Display field of view (cm)254050
Filter typeHEADBODYBODY
Convolutional KernelSTANDARDSTANDARDSTANDARD
Axial pixel size (mm)0.480.780.87
Slice thickness (mm)555
Focal spot size (cm)0.71.20.7

2.3.

Image Segmentation

All images were subject to preprocessing prior to radiomic feature extraction. The central portion of the UWP and the anthropomorphic phantom were manually segmented using Scout Liver (Analogic Corporation, Peabody, Massachusetts) (Fig. 3). The image mask was propagated to all other scans such that all scans had the same segmentation applied. This was possible because the position of the phantoms remained fixed in the imaging unit for every scan. For the human scan, the liver region was semiautomatically segmented from the CT scan using Scout Liver (Fig. 4).

Fig. 4

Segmented liver region from human CT reconstructed with FBP used in analysis.

JMI_5_1_011020_f004.png

2.4.

Extraction of Radiomic Features

For each of the segmented regions of interest, we extracted standard radiomic features that broadly describe variation in CT enhancement patterns (i.e., heterogeneity), which are well described in the image processing literature. Briefly, these features can be categorized as first-order features based on the intensity histogram and second-order texture-based features based on spatial variation in pixel intensity. Intensity histogram (IH) features included mean, standard deviation, skewness, kurtosis, and entropy. Texture features included GLCM, run-length matrices (RLM), local binary patterns (LBP), fractal dimension analysis (FD), and angle co-occurrence matrices (ACM).3141 Radiomic features are listed in Table 2.

Table 2

Image features set investigated in this study.

A. GLCM (19 features)2. A12 (Contrast)
1. G1 (Energy)59. L59 (Frequency of 58th bin of ULBP)3. A13 (Correlation)
2. G2 (Contrast)60. L60 (Frequency of 0th bin of RI-LBP)4. A14 (Sum of squares)
3. G3 (Correlation)61. L61 (Frequency of 1st bin of RI-LBP)5. A15 (Inverse difference moment)
4. G4 (Sum of squares)6. A16 (Sum average)
5. G5 (Inverse difference moment)69. L69 (Frequency of 9th bin of RI-LBP)7. A17 (Sum variance)
6. G6 (Sum average)70. L70 (Standard deviation (SD) of ULBP)8. A18 (Entropy)
7. G7 (Sum variance)71. L71 (Skewness of ULBP)9. A19 (Difference variance)
8. G8 (Entropy)72. L72 (Kurtosis of ULBP)10. A110 (Sum entropy)
9. G9 (Difference variance)73. L73 (Entropy of RI-ULBP)11. A111 (Difference entropy)
10. G10 (Sum entropy)74. L74 (SD of RI-ULBP)12. A112 (Information-theoretic
11. G11 (Difference entropy)75. L75 (Skewness of RI-ULBP)measures of correlation 1)
12. G12 (Information-theoretic measures of correlation 1)76. L76 (Kurtosis of RI-ULBP)13. A113 (Information-theoretic measures of correlation 2)
13. G13 (Information-theoretic measures of correlation 2)77. L77 (Entropy of RI-ULBP)
14. G14 (Maximum correlation coefficient)78. L78 (SD of LBP)14. A114 (Maximum correlation coefficient)
15. G15 (Inertia)79. L79 (Skewness of LBP)15. A115 (Inertia)
16. G16 (Cluster shade)80. L80 (Kurtosis of LBP)16. A116 (Cluster shade)
17. G17 (Cluster prominence)81. L81 (Entropy of LBP)17. A117 (Cluster prominence)
18. G18 (Renyi entropy)82. L82 (SD of RI-LBP)18. A118 (Renyi entropy)
19. G19 (Tsallis entropy)83. L83 (Skewness of RI-LBP)19. A119 (Tsallis entropy)
B. RLM (11 features)84. L84 (Kurtosis of RI-LBP)G. ACM2 (19 features)
1. R1 (Short run emphasis)85. L85 (Entropy of RI-LBP)1. A21 (Energy)
2. R2 (Long run emphasis)86. L86 (SD of rotated LBP)2. A22 (Contrast)
3. R3 (Gray-level nonuniformity)87. L87 (Skewness of LBP)3. A23 (Correlation)
4. R4 (Run length nonuniformity)88. L88 (Kurtosis of LBP)4. A24 (Sum of squares)
5. R5 (Run percentage)89. L89 (Entropy of LBP)5. A25 (Inverse difference moment)
6. R6 (Low gray-level run emphasis)90. L90 (0th frequency coefficient of RI-ULBP Fourier spectrum6. A26 (Sum average)
7. R7 (High gray-level run emphasis)7. A27 (Sum variance)
8. R8 (Short run low gray-level emphasis)127. L127 (37th frequency coefficient of RI-ULBP Fourier Spectrum8. A28 (Entropy)
9. R9 (Short run high gray-level emphasis)E. FD (48 features)9. A29 (Difference variance)
10. R10 (Run low gray-level emphasis)1. F1 (FD from 1st binary image from SFTA10. A210 (Sum entropy)
11. R11 (Long run high gray-level emphasis)11. A211 (Difference entropy)
C. IH (5 features)16. F16 (FD from 16th binary image from SFTA12. A212 (Information-theoretic measures of correlation 1)
1. I1 (Mean)17. F17 (Mean gray value from 1st binary image from SFTA)
2. I2 (Standard deviation)13. A213 (Information-theoretic measures of correlation 2)
3. I3 (Skewness)32. F32 (Mean gray value from 16th binary image from SFTA)14. A214 (Maximum correlation coefficient)
4. I4 (Kurtosis)33. F33 (Pixel count from 1st binary image from SFTA)15. A215 (Inertia)
5. Entropy16. A216 (Cluster shade)
D. LBP (127 features)48. F48 (Pixel count from 16th binary image from SFTA)17. A217 (Cluster prominence)
1. L1 (Frequency of 0th bin of ULBP)F. ACM1 (19 features)18. A218 (Renyi entropy)
2. L2 (Frequency of 1st bin of ULBP)1. A11 (Energy)19. A219 (Tsallis entropy)

In total, 248 features were extracted using MATLAB (MathWorks, Natick, Massachusetts). Averaging the feature values over all slices yielded one value for each feature for each image.

2.5.

Statistical Analysis

First the effect of tube current (or noise index) on radiomic features was investigated by varying the tube current (or noise index) for a specific reconstruction method. We then studied the effect of reconstruction method on radiomic features by keeping the acquisition parameters constant. When assessing the agreement in radiomic features with reconstruction algorithms (i.e., different ASiR levels), the FBP dataset was used as the reference set.

Lin’s concordance correlation coefficient (CCC),42 a measure of concordance or agreement between two measurements, was used to determine sensitivity of features to variations in image noise, tube current, and reconstruction method. CCC ranges from 0 to 1 (correlated), with perfect agreement at 1. CCC is defined as

Eq. (1)

CCC=|1(xy)2σx2+σy2+(μxμy)2|,
where x is a vector containing the value of a particular feature for all cases in the first observation and y is a vector containing the value of the same radiomic feature in the second observation; σx2 and σy2 are the variances; and μx and μy are the means of each vector. When assessing the agreement in radiomic features with reconstruction algorithms, the FBP dataset was used as the reference set. We defined features with an absolute CCC0.90 as being reproducible.43 We also report the features with CCC0.85 and 0.95. Statistical analysis was performed with MATLAB.

3.

Results

3.1.

Effect of Tube Current and Noise Index on Radiomic Features

The number and percentage of reproducible radiomic features (CCC0.9) extracted from the UWP while varying the tube currents (50 versus 100 mA, 200 versus 300 mA, 400 versus 500 mA, 100 versus 400 mA, 100 versus 500 mA, and 200 versus 500 mA) are listed in Table 3. In general, variation in tube current had substantial effect on CCC with only 20 (8%) and 44 (18%) reproducible radiomic features for 50 versus 100 mA and 200 versus 300 mA, respectively. This is likely because lower tube current is associated with more random noise, which results in increased pixel-by-pixel intensity variability. With increased tube current (400 versus 500 mA), 63 (25%) of the radiomic features were reproducible suggesting that increased tube current (and corresponding reduction in random noise) results in an increase in the uniformly distributed pixels. The number and percentage of reproducible radiomic features extracted from the anthropomorphic phantom while varying the noise indices (NI 12 versus 14, NI 14 versus 16, NI 16 versus 18, NI 12 versus 18, NI 12 versus 20, and NI 14 versus 20) are listed in Table 4. In general, change in noise indices, resulted in variation in the number of reproducible features: 49 (20%), 62 (25%), and 47 (19%) of radiomics features were reproducible for NI 12 versus 14, NI 14 versus 16, and NI 16 versus 18, respectively.

Table 3

Total number (percentage) of reproducible features out of 248 radiomic features for UWP with FBP reconstruction.

50 versus 100 mA200 versus 300 mA400 versus 500 mA100 versus 400 mA100 versus 500 mA200 versus 500 mA
CCC0.9513 (5)26 (10)27 (11)7 (3)7 (3)11 (4)
CCC0.9020 (8)44 (18)63 (25)18 (7)10 (4)17 (7)
CCC0.8525 (10)58 (23)74 (30)24 (10)12 (5)18 (7)

Table 4

Total number (percentage) of reproducible features out of 248 radiomic features for anthropomorphic phantom with FBP reconstruction.

NI 12 versus 14NI 14 versus 16NI 16 versus 18NI 12 versus 18NI 12 versus 20NI 14 versus 20
CCC0.9534 (14)56 (23)30 (12)29 (12)31 (13)35 (14)
CCC0.9049 (20)62 (25)47 (19)43 (17)40 (16)53 (21)
CCC0.8557 (23)69 (28)53 (21)54 (22)53 (21)61 (25)

The changes in appearance of image texture with varying mA and NI are shown in Fig. 5. We also note that values of radiomic features change approximately linearly with increased ASiR. Select exemplar GLCM features for the UWP and anthropomorphic phantom are shown in Fig. 6.

Fig. 5

CT pixel pattern appearance variation with increasing (a)–(d) tube current and (e)–(h) noise index in the UWP and anthropomorphic phantom, respectively. The circle in the anthropomorphic phantom is a rod designed to hold dosimeter.

JMI_5_1_011020_f005.png

Fig. 6

Change in radiomic feature values over different NI and mA.

JMI_5_1_011020_f006.png

3.2.

Effect of Reconstruction Algorithm on Radiomic Features

The number and percentage of reproducible radiomic features extracted from the UWP, anthropomorphic phantom, and human scan for increasing ASiR levels are listed in Table 5. In all three datasets, the number of reproducible features decreased with increasing ASiR with respect to FBP. Figure 7 demonstrates the increase in blurring due to gradual increase in ASiR from 0% (pure FBP) to 100% ASiR in all three datasets. We also note that values of radiomic features change approximately linearly with increased ASiR. Selected GLCM features for all three datasets are shown in Fig. 8.

Table 5

Total number (percentage) of reproducible features out of 248 features for UWP, anthropomorphic phantom, and human for different ASiR levels. Feature values are averaged all over tube currents and noise indices for the UWP and anthropomorphic phantom, respectively.

ASiRUWPAnthropomorphic phantomHuman
CCC≥0.85CCC≥0.90CCC≥0.95CCC≥0.85CC≥0.90CCC≥0.95CCC≥0.85CCC≥0.90CCC≥0.95
10115 (46)96 (39)60 (24)232 (94)223 (90)197 (79)230 (93)219 (88)193 (78)
20110 (44)81 (33)45 (18)227 (92)217 (88)185 (75)222 (90)209 (84)183 (74)
30100 (40)78 (31)45 (18)218 (88)197 (79)164 (66)213 (86)193 (78)156 (63)
4097 (39)75 (30)45 (18)208 (84)180 (73)148 (60)209 (84)183 (74)145 (58)
5082 (33)63 (25)40 (16)200 (81)174 (70)144 (58)186 (75)161 (65)124 (50)
6081 (33)60 (24)34 (14)185 (75)165 (67)133 (54)166 (67)145 (58)107 (43)
7072 (29)50 (20)32 (13)177 (71)155 (63)127 (51)153 (62)134 (54)94 (38)
8055 (22)44 (18)27(11)162 (65)144 (58)122 (49)141 (57)117 (47)73 (29)
9051 (21)32 (13)26 (10)158 (64)140 (56)107 (43)131 (53)110 (44)58 (23)
10047 (19)29 (12)26 (10)152 (61)128 (52)102 (41)119 (48)95 (38)47 (19)

Fig. 7

CT pixel pattern appearance variation with increasing ASiR levels in (a)–(d) UWP, (e)–(h) anthropomorphic phantom, and (i)–(l) human scan, respectively.

JMI_5_1_011020_f007.png

Fig. 8

Change in radiomic feature values over different ASiR levels.

JMI_5_1_011020_f008.png

CCC calculated between FBP and increasing ASiR levels from 10% to 100% for all radiomic features extracted from UWP, anthropomorphic phantom, and human are provided in Figs. 911, respectively, at different tube currents and NI, as available. Specifically, in Fig. 9, the x-axis is partitioned into the available tube currents and within each tube current; the CCC for each ASiR level relative to FBP is shown. Similarly, in Fig. 10, the x-axis is partitioned into the available noise indices and within each noise index; the CCC for each ASiR level relative to FBP is shown. In Fig. 11, CCC for each ASiR level is shown since we did not vary the noise index or tube current in the human scan.

Fig. 9

Effect of tube current and ASiR level on radiomic features extracted from UWP. (a) The x-axis is partitioned into the available tube currents and within each tube current; the CCC for each ASiR level relative to FBP is calculated. (b) CCC is calculated for each mA with mA=200 and FBP reconstruction as the reference standard.

JMI_5_1_011020_f009.png

Fig. 10

Effect of tube current and ASiR level on radiomic features extracted from anthropomorphic phantom. (a) The x-axis is partitioned into the available noise indices and within each noise index; the CCC for each ASiR level relative to FBP is calculated. (b) CCC is calculated for each NI with NI=14 and FBP reconstruction as the reference standard.

JMI_5_1_011020_f010.png

Fig. 11

Effect of the ASiR level on radiomic features extracted from the human scan such that the CCC for each ASiR level relative to FBP is calculated.

JMI_5_1_011020_f011.png

3.3.

Reproducible Radiomic Features

The specific radiomic features with CCC0.90 for different image acquisition and reconstruction parameters are listed in Table 6. In general, reproducibility of radiomic features was influenced by acquisition parameters (noise index and tube current) more so than ASiR. Studying the effect of variation in noise index and tube current requires multiple scans, which would expose patients to additional radiation. Therefore, we studied these variations in phantoms to motivate prospective evaluation in a clinical trial with patient consent.

Table 6

Reproducible features (CCC≥0.9).

UWPAnthropomorphic phantomHuman
All mA (FBP)All ASiR (500 mA)All NI (FBP)All ASiR (NI 12)All ASiR
1. G61. G61. G31. G151. LBP44101. ACM121. G351. FD2
2. G162. G142. G62. G352. LBP45102. ACM142. G452. FD3
3. G173. G163. G83. G453. LBP46103. ACM153. G653. FD4
4. RLM14. G174. G124. G554. LBP47104. ACM164. G754. FD5
5. RLM25. RLM15. G135. G655. LBP48105. ACM175. G855. FD6
6. RLM86. RLM26. G146. G756. LBP49106. ACM186. G1356. FD8
7. RLM97. RLM87. G167. G857. LBP52107. ACM197. G1457. FD11
8. RLM108. RLM98. G178. G958. LBP54108. ACM1108. G1658. FD17
9. IH19. RLM109. G189. G1259. LBP55109. ACM1119. G1759. FD20
10. LBP510. IH110. RLM110. G1360. LBP56110. ACM11510. RLM160. FD23
11. IH311. RLM211. G1461. LBP57111. ACM11611. RLM261. FD26
12. IH412. RLM712. G1662. LBP78112. ACM11712. RLM362. FD29
13. LBP513. RLM813. G1763. LBP84113. ACM11813. RLM463. FD37
14. LBP1514. RLM1014. G1864. LBP86114. ACM11914. RLM564. FD38
15. LBP1615. RLM1115. G1965. LBP87115. ACM2115. IH165. FD41
16. LBP2616. IH116. RLM166. LBP88116. ACM2216. IH266. FD44
17. LBP3317. IH317. RLM267. LBP89117. ACM2417. IH367. FD47
18. LBP4018. FD218. RLM868. LBP92118. ACM2518. IH468. ACM11
19. LBP4419. FD519. IH169. LBP93119. ACM2619. IH569. ACM12
20. LBP4920. FD720. IH270. LBP94120. ACM2720. LBP170. ACM13
21. LBP9721. FD821. IH371. LBP95121. ACM2821. LBP271. ACM14
22. LBP9822. FD922. IH572. LBP96122. ACM2922. LBP1172. ACM15
23. FD223. FD1023. LBP73. LBP97123. ACM21023. LBP1273. ACM16
24. ACM1224. FD1124. LBP774. LBP98124. ACM21124. LBP1674. ACM18
25. ACM1425. FD1225. LBP1075. LBP100125. ACM21525. LBP2075. ACM19
26. ACM1526. FD1426. LBP1176. LBP101126. ACM21626. LBP3976. ACM110
27. ACM1627. FD1627. LBP1277. LBP112127. ACM21727. LBP4477. ACM111
28. ACM1728. FD1728. LBP1378. LBP113128. ACM21828. LBP4778. ACM112
29. ACM1829. FD1829. LBP1479. LBP117129. ACM21929. LBP4879. ACM113
30. ACM11030. FD2130. LBP1580. FD130. LBP5780. ACM114
31. ACM11131. FD2631. LBP1681. FD531. LBP5881. ACM115
32. ACM11532. FD2832. LBP1782. FD832. LBP5982. ACM116
33. ACM2233. FD2933. LBP1883. FD1033. LBP6083. ACM117
34. ACM2434. FD3134. LBP1984. FD1134. LBP6484. ACM118
35. ACM2535. FD3235. LBP2085. FD1435. LBP6785. ACM119
36. ACM2636. FD3536. LBP2186. FD1736. LBP6886. ACM23
37. ACM2737. FD3837. LBP2387. FD2037. LBP6987. ACM24
38. ACM2838. FD4138. LBP2588. FD2338. LBP7688. ACM25
39. ACM21039. FD4439. LBP2689. FD2439. LBP7889. ACM26
40. ACM21140. FD4540. LBP2790. FD2640. LBP7990. ACM28
41. ACM21541. LBP2891. FD2941. LBP8091. ACM212
42. ACM21842. LBP3392. FD3142. LBP8292. ACM213
43. LBP3493. FD3243. LBP8393. ACM214
44. LBP3694. FD3544. LBP8494. ACM216
45. LBP3795. FD3845. LBP8595. ACM217
46. LBP3896. FD4146. LBP86
47. LBP3997. FD4447. LBP92
48. LBP4098. FD4748. LBP93
49. LBP4299. FD4849. LBP94
50. LBP43100. ACM1150. FD1

4.

Discussion

We approached our reproducibility study with the intention of varying NI, ASiR, and mA as it is increasingly common for institutions to set goals for dose reduction, to prevent unnecessary radiation to their patients. Dose reduction may be achieved by increasing the noise index or by applying different types of tube current modulation, which subsequently affect mA. As the noise levels increase with decreasing dose, it is also common for institutions to apply ASiR or other reconstruction methods to varying degrees to improve image quality. The concern with dose reduction, combined with the observed variability in clinical practice across institutions, motivated our approach to investigate these specific imaging parameters, within the constraints of our clinical GE CT scanners.

Our data suggest that image acquisition parameters relating to image noise [tube current, noise index, and reconstruction (ASiR)] strongly influence radiomic feature reproducibility (see Tables 35). Random noise in CT images affects the pixel-by-pixel intensity variability, thereby influencing the reproducibility of features that are based on the spatial distribution of pixel intensities. An increase in the tube current reduces the image noise, resulting in more reproducible radiomic features. For example, only 20 reproducible features (8% of all features) were observed when features extracted at 50 mA were compared with the same features extracted at 100 mA, whereas 63 features (25%) were reproducible when the comparison was performed between 400 and 500 mA. Increased ASiR led to blurring, resulting in fewer reproducible features when compared with FBP (the reference standard). 219 (88%), 161 (65%), and 95 (38%) features were reproducible at 10%, 50%, and 100% ASiR, respectively. ASiR initially uses information obtained from the FBP algorithm to initiate image reconstruction and focuses on noise reduction primarily through statistical modeling of the system. The statistical model transforms the measured value of each pixel to a new estimate of the pixel value. The new estimate is then compared against the ideal pixel value that the noise model predicts. The process iterates until the final estimated and ideal pixel values converge.22 This modification of the pixel-by-pixel values likely explains the poor concordance observed between features extracted from images with increasing ASiR application. These results are supported by a recent study comparing GLCM and IH features obtained from FBP, 50% ASiR, and model-based iterative reconstruction images.29 Our institution uses 20% ASiR in imaging protocols, but this value varies widely across institutions;44 hence, care should be taken in the extraction of radiomic features for prognostication in the presence of variable scan protocols.

The UWP is filled with uniform material (i.e., water), but the reconstructed CT images are not perfectly uniform in terms of intensity values. The reconstructed CT images of the UWP have random noise without any texture due to the absence of structures. It should also be noted that in Figs. 8(a)8(c), GLCM homogeneity-based features (e.g., correlation) have the lowest values for UWP due to greater local random noise present in the UWP images. However, with increasing ASiR, as images became smoother, the homogeneity features values gradually increased for (a), (b), and (c). The GLCM features shown in Figs. 8(d)8(f) represent heterogeneity (e.g., entropy), and demonstrate higher values in more locally heterogeneous images.

We evaluated the reproducibility of 248 radiomic features. Many of the first-order intensity histogram-based features (features I1 to I5 in Figs. 911) were reproducible as ASiR maintains the mean and standard deviation of the image histogram by definition. GLCM (G1 to G19) and RLM features (R1 to R11) were consistently reproducible with increasing ASiR; however, concordance decreased with extreme ASiR application. RLM features count consecutive pixels with the same intensity so large variation in pixel intensity will have significant effect. LBP and FD features (L1 to L127 and F1 to F48, respectively) encode structural texture information; increased blurring with increased ASiR will affect spatial distribution of pixel intensities so LBP and FD features were less reproducible than GLCM features. ACM-based features (A11 to A119 and A21 to A219) were robust to variation in ASiR. ACM features are likely less sensitive to image noise because these features are computed using gradient magnitude and orientation information instead of spatial distribution of pixel intensities. With respect to modulations in tube current, few features showed robustness to large changes in mA (Fig. 9); however, for smaller modulations, FD, GLCM, and RLM-based features were robust (Fig. 10) suggesting that care should be taken when comparing features derived from CT scans with vastly different tube currents. Figures 9 and 10 show CCC variation with different image acquisition and reconstruction parameters. Noise index (tube current) affects reproducibility of features more so than ASiR variation. Figure 6 shows feature values over different acquisition conditions for a single patient. Therefore, it is difficult to relate Figs. 9 and 10 to Fig. 6.

There are several limitations to this study. First, studies evaluating the effects of modulation of tube current and noise index on radiomic features require additional scans; hence, phantoms were employed. However, the UWP is circular, filled with liquid water and not representative of patient anatomy. To address this issue, we used an anthropomorphic phantom, but when compared with actual patient anatomy, the phantom images are uniform in content, without organs and without artifacts observed during patient image acquisition procedures. Both phantoms lack texture in imaging (see Fig. 12), underscoring the need for phantoms that mimic the tissue properties of contrast-enhanced abdominal CT. Although we included one human scan reconstructed with increasing ASiR, prospective evaluation in patients with multiple tube currents or NI levels is required. Second, radiomic features were analyzed on the basis of one segmented region of interest. Different segmentation strategies employed by various studies can affect the quantification of radiomic features and their reproducibility. Another limitation of this study was that only GE scanners were used. Hence, only variability with one manufacturer’s reconstruction algorithms and automatic exposure control modes was studied. Other acquisition parameters such as helical pitch and contrast timing likely affect the reproducibility of features. Future work includes prospective evaluation in liver cancer patients with scans acquired at variable NI and with different reconstruction algorithms and contrast timings. Reproducibility of individual radiomic features as well as prediction model variability will be studied. Despite these limitations, our findings underscore the influence of noise on radiomic features and demonstrate reproducible features potentially usable for clinical decision making.

Fig. 12

Magnified regions from the UWP, anthropomorphic phantom, and human scans demonstrating lack of textural appearance in the phantoms versus the human scan.

JMI_5_1_011020_f012.png

5.

Conclusion

The present study demonstrates that image noise plays an important role in the reproducibility of radiomic features. Specifically, variation in image noise due to dose reduction algorithms, tube current, and noise index significantly affects reproducibility of radiomic features. Prospective evaluation across multiple centers, preferably with human subjects, is needed.

Disclosures

The authors have nothing to disclose.

Acknowledgments

This research was funded in part through the NIH/NCI Cancer Center Support Grant No. P30 CA008748.

References

1. 

H. J. W. L. Aerts et al., “Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach,” Nat. Commun., 5 4006 (2014). https://doi.org/10.1038/ncomms5006 NCAOBW 2041-1723 Google Scholar

2. 

V. Kumar et al., “Radiomics: the process and the challenges,” Magn. Reson. Imaging, 30 (9), 1234 –1248 (2012). https://doi.org/10.1016/j.mri.2012.06.010 MRIMDQ 0730-725X Google Scholar

3. 

P. Lambin et al., “Radiomics: extracting more information from medical images using advanced feature analysis,” Eur. J. Cancer, 48 (4), 441 –446 (2012). https://doi.org/10.1016/j.ejca.2011.11.036 EJCAEL 0959-8049 Google Scholar

4. 

B. Ganeshan et al., “Dynamic contrast-enhanced texture analysis of the liver,” Invest. Radiol., 46 (3), 160 –168 (2011). https://doi.org/10.1097/RLI.0b013e3181f8e8a2 INVRAV 0020-9996 Google Scholar

5. 

M. G. Lubner et al., “CT textural analysis of hepatic metastatic colorectal cancer: pre-treatment tumor heterogeneity correlates with pathology and clinical outcomes,” Abdom. Imaging, 40 (7), 2331 –2337 (2015). https://doi.org/10.1007/s00261-015-0438-4 ABIMEL 1432-0509 Google Scholar

6. 

K. A. Miles et al., “Colorectal cancer: texture analysis of portal phase hepatic CT images as a potential marker of survival,” Radiology, 250 (2), 444 –452 (2009). https://doi.org/10.1148/radiol.2502071879 RADLAX 0033-8419 Google Scholar

7. 

S.-X. Rao et al., “CT texture analysis in colorectal liver metastases: a better way than size and volume measurements to assess response to chemotherapy?,” United Eur. Gastroenterol. J., 4 (2), 257 –263 (2016). https://doi.org/10.1177/2050640615601603 Google Scholar

8. 

S.-X. Rao et al., “Whole-liver CT texture analysis in colorectal cancer: does the presence of liver metastases affect the texture of the remaining liver?,” United Eur. Gastroenterol. J., 2 (6), 530 –538 (2014). https://doi.org/10.1177/2050640614552463 Google Scholar

9. 

A. L. Simpson et al., “Texture analysis of preoperative CT images for prediction of postoperative hepatic insufficiency: a preliminary study,” J. Am. Coll. Surg., 220 (3), 339 –346 (2015). https://doi.org/10.1016/j.jamcollsurg.2014.11.027 JACSEX 1072-7515 Google Scholar

10. 

A. L. Simpson et al., “Computed tomography image texture: a noninvasive prognostic marker of hepatic recurrence after hepatectomy for metastatic colorectal cancer,” Ann. Surg. Oncol., 24 2482 –2490 (2017). https://doi.org/10.1245/s10434-017-5896-1 Google Scholar

11. 

Z. Wu et al., “Hepatitis C related chronic liver cirrhosis: feasibility of texture analysis of MR images for classification of fibrosis stage and necroinflammatory activity grade,” PloS One, 10 (3), e0118297 (2015). https://doi.org/10.1371/journal.pone.0118297 POLNCL 1932-6203 Google Scholar

12. 

J. Zheng et al., “Preoperative prediction of microvascular invasion in hepatocellular carcinoma using quantitative image analysis,” J. Am. Coll. Surg., 19 S48 (2017). https://doi.org/10.1016/j.hpb.2017.02.035 JACSEX 1072-7515 Google Scholar

13. 

B. Zhao et al., “Exploring variability in CT characterization of tumors: a preliminary phantom study,” Transl. Oncol., 7 88 –93 (2014). https://doi.org/10.1593/tlo.13865 Google Scholar

14. 

Y. Balagurunathan et al., “Reproducibility and prognosis of quantitative features extracted from CT images,” Transl. Oncol., 7 (1), 72 –87 (2014). https://doi.org/10.1593/tlo.13844 Google Scholar

15. 

X. Fave et al., “Can radiomics features be reproducibly measured from CBCT images for patients with non-small cell lung cancer?,” Med. Phys., 42 6784 –6797 (2015). https://doi.org/10.1118/1.4934826 MPHYA6 0094-2405 Google Scholar

16. 

L. A. Hunter et al., “High quality machine-robust image features: identification in nonsmall cell lung cancer computed tomography images,” Med. Phys., 40 (12), 121916 (2013). https://doi.org/10.1118/1.4829514 MPHYA6 0094-2405 Google Scholar

17. 

P. Lo et al., “Variability in CT lung-nodule quantification: effects of dose reduction and reconstruction methods on density and texture based features,” Med. Phys., 43 4854 –4865 (2016). https://doi.org/10.1118/1.4954845 MPHYA6 0094-2405 Google Scholar

18. 

L. Lu et al., “Assessing agreement between radiomic features computed for multiple CT imaging settings,” PLoS One, 11 e0166550 (2016). https://doi.org/10.1371/journal.pone.0166550 POLNCL 1932-6203 Google Scholar

19. 

B. Zhao et al., “Reproducibility of radiomics for deciphering tumor phenotype with imaging,” Sci. Rep., 6 23428 (2016). https://doi.org/10.1038/srep23428 SRCEC3 2045-2322 Google Scholar

20. 

P. Guggenbuhl et al., “Reproducibility of CT-based bone texture parameters of cancellous calf bone samples: influence of slice thickness,” Eur. J. Radiol., 67 514 –520 (2008). https://doi.org/10.1016/j.ejrad.2007.08.003 EJRADR 0720-048X Google Scholar

21. 

D. Duda, M. Kretowski and J. Bezy-Wendling, “Effect of slice thickness on texture-based classification of liver dynamic CT scans,” Lect. Notes Comput. Sci., 8104 96 –107 (2013). https://doi.org/10.1007/978-3-642-40925-7 LNCSD9 0302-9743 Google Scholar

22. 

L. L. Geyer et al., “State of the art: iterative CT reconstruction techniques,” Radiology, 276 (2), 339 –357 (2015). https://doi.org/10.1148/radiol.2015132766 RADLAX 0033-8419 Google Scholar

23. 

C. H. McCollough et al., “Degradation of CT low-contrast spatial resolution due to the use of iterative reconstruction and reduced dose levels,” Radiology, 276 499 –506 (2015). https://doi.org/10.1148/radiol.15142047 RADLAX 0033-8419 Google Scholar

24. 

G. Saiprasad et al., “Evaluation of low-contrast detectability of iterative reconstruction across multiple institutions, CT scanner manufacturers, and radiation exposure levels,” Radiology, 277 124 –133 (2015). https://doi.org/10.1148/radiol.2015141260 RADLAX 0033-8419 Google Scholar

25. 

C.-I. Shin et al., “Ultra-low peak voltage CT colonography: effect of iterative reconstruction algorithms on performance of radiologists who use anthropomorphic colonic phantoms,” Radiology, 273 (3), 759 –771 (2014). https://doi.org/10.1148/radiol.14140192 RADLAX 0033-8419 Google Scholar

26. 

J. G. Fletcher et al., “Observer performance in the detection and classification of malignant hepatic nodules and masses with CT image-space denoising and iterative reconstruction,” Radiology, 276 465 –478 (2015). https://doi.org/10.1148/radiol.2015141991 RADLAX 0033-8419 Google Scholar

27. 

A. Padole et al., “Assessment of filtered back projection, adaptive statistical, and model-based iterative reconstruction for reduced dose abdominal computed tomography,” J. Comput. Assisted Tomogr., 39 (4), 462 –467 (2015). https://doi.org/10.1097/RCT.0000000000000231 JCATD5 0363-8715 Google Scholar

28. 

Q. Li et al., “The effects of iterative reconstruction in CT on low-contrast liver lesion volumetry: a phantom study,” Proc. SPIE, 10134 101340Z (2017). https://doi.org/10.1117/12.2255743 PSISDG 0277-786X Google Scholar

29. 

J. Solomon et al., “Quantitative features of liver lesions, lung nodules, and renal stones at multi? Detector row CT examinations: dependency on radiation dose and reconstruction algorithm,” Radiology, 279 (1), 185 –194 (2016). https://doi.org/10.1148/radiol.2015150892 RADLAX 0033-8419 Google Scholar

30. 

“Atom dosimetry verification phantoms,” (2015). Google Scholar

31. 

R. M. Haralick, K. Shanmugam and I. Dinstein, “Textural features for image classification,” IEEE Trans. Syst. Man Cybern., SMC-3 (6), 610 –621 (1973). https://doi.org/10.1109/TSMC.1973.4309314 Google Scholar

32. 

L. K. Soh and C. Tsatsoulis, “Texture analysis of SAR sea ice imagery using gray level co-occurrence matrices,” IEEE Trans. Geosci. Remote Sens., 37 (2), 780 –795 (1999). https://doi.org/10.1109/36.752194 IGRSD2 0196-2892 Google Scholar

33. 

D. A. Clausi, “An analysis of co-occurrence texture statistics as a function of grey level quantization,” Can. J. Remote Sens., 28 (1), 45 –62 (2002). https://doi.org/10.5589/m02-004 CJRSDP 0703-8992 Google Scholar

34. 

X. Tang, “Texture information in run-length matrices,” IEEE Trans. Image Process., 7 (11), 1602 –1609 (1998). https://doi.org/10.1109/83.725367 IIPRE4 1057-7149 Google Scholar

35. 

M. M. Galloway, “Texture analysis using gray level run lengths,” Comput. Graphics Image Process., 4 (2), 172 –179 (1975). https://doi.org/10.1016/S0146-664X(75)80008-6 Google Scholar

36. 

O. S. Al-Kadi and D. Watson, “Texture analysis of aggressive and nonaggressive lung tumor CE CT images,” IEEE Trans. Biomed. Eng., 55 (7), 1822 –1830 (2008). https://doi.org/10.1109/TBME.2008.919735 IEBEAX 0018-9294 Google Scholar

37. 

N. Sarkar and B. B. Chauduri, “Efficient differential box-counting approach to compute fractal dimension of image,” IEEE Trans. Syst. Man Cybern., 24 (1), 115 –120 (1994). https://doi.org/10.1109/21.259692 Google Scholar

38. 

T. Ojala, M. Pietikäinen and D. Harwood, “A comparative study of texture measures with classification based on featured distributions,” Pattern Recognit., 29 (1), 51 –59 (1996). https://doi.org/10.1016/0031-3203(95)00067-4 Google Scholar

39. 

M. Pietikäinen et al., Local Binary Patterns for Still Images, 13 –47 Springer, London (2011). Google Scholar

40. 

J. Chakraborty et al., “Statistical measures of orientation of texture for the detection of architectural distortion in prior mammograms of interval-cancer,” J. Electron. Imaging, 21 (3), 033010 (2012). https://doi.org/10.1117/1.JEI.21.3.033010 JEIME5 1017-9909 Google Scholar

41. 

A. Midya and J. Chakraborty, “Classification of benign and malignant masses in mammograms using multi-resolution analysis of oriented patterns,” in IEEE 12th Int. Symp. on Biomedical Imaging (ISBI), 411 –414 (2015). https://doi.org/10.1109/ISBI.2015.7163899 Google Scholar

42. 

L. I.-K. Lin, “A concordance correlation coefficient to evaluate reproducibility,” Biometrics, 45 (1), 255 –268 (1989). https://doi.org/10.2307/2532051 BIOMB6 0006-341X Google Scholar

43. 

G. McBride, “A proposal for strength-of-agreement criteria for Lin’s concordance correlation coefficient,” (2005). Google Scholar

44. 

A. Nakamoto et al., “Clinical evaluation of image quality and radiation dose reduction in upper abdominal computed tomography using model-based iterative reconstruction; comparison with filtered back projection and adaptive statistical iterative reconstruction,” Eur. J. Radiol., 84 (9), 1715 –1723 (2015). https://doi.org/10.1016/j.ejrad.2015.05.027 EJRADR 0720-048X Google Scholar

Biographies for the authors are not available.

© 2018 Society of Photo-Optical Instrumentation Engineers (SPIE) 2329-4302/2018/$25.00 © 2018 SPIE
Abhishek Midya, Jayasree Chakraborty, Mithat Gönen, Richard K. G. Do M.D., and Amber L. Simpson "Influence of CT acquisition and reconstruction parameters on radiomic feature reproducibility," Journal of Medical Imaging 5(1), 011020 (15 February 2018). https://doi.org/10.1117/1.JMI.5.1.011020
Received: 1 June 2017; Accepted: 23 January 2018; Published: 15 February 2018
Lens.org Logo
CITATIONS
Cited by 69 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Computed tomography

Current controlled current source

Feature extraction

Reconstruction algorithms

CT reconstruction

Image segmentation

Liver

Back to Top