Open Access Paper
17 October 2022 Learning CT scatter estimation without labeled data: a feasibility study
Joscha Maier, Luca Jordan, Elias Eulig, Fabian Jäger, Stefan Sawall, Michael Knaup, Marc Kachelrieß
Author Affiliations +
Proceedings Volume 12304, 7th International Conference on Image Formation in X-Ray Computed Tomography; 123042Z (2022) https://doi.org/10.1117/12.2646588
Event: Seventh International Conference on Image Formation in X-Ray Computed Tomography (ICIFXCT 2022), 2022, Baltimore, United States
Abstract
Since X-ray scattering is a major cause of artifacts, its correction is a crucial step in almost any CT application. Most existing approaches, however, are based on complex theoretical models that need to be tailored to that particular application. To perform scatter estimation in absence of such models, we propose the unsupervised deep scatter estimation (uDSE). Here, uDSE combines a scatter estimation network that operates in projection domain with a scatter correction layer and CT reconstruction layer. In that way scatter estimation can be trained using an unsupervised Wassersten GAN (WGAN) setup in which the parameters of the scatter estimation network are optimized such that the resulting scatter corrected reconstructions cannot be distinguished from samples of a true artifact-free reference set. To demonstrate the feasibility of the proposed approach, uDSE is evaluated for simulated CBCT scans. Applied to the corresponding test data, uDSE is able to remove most of the present scatter artifacts and yields similar CT value accuracy (mean error of 27.9 HU vs. 24.7 HU) as a state-of-the-art supervised scatter estimation approach. Thus, uDSE may be used in the future to learn scatter estimation in cases where labels are not available or cannot be generated with sufficient accuracy.

I.

INTRODUCTION

THE contribution of scattered x-rays to the acquired projection data leads to a violation of CT reconstruction criteria, and thus, to the introduction of CT artifacts. In particular this holds true for cone-beam CT (CBCT), where scatter-to-primary ratios may easily be in the order of 1 and above. Therefore, scatter correction is a crucial preprocessing step to achieve diagnostic image quality. Typically, existing approaches implement such a correction by deriving an estimate of the present scatter distribution and by subtracting it subsequently from the acquired raw data. Here, the scatter distribution can either be estimated using dedicated hardware such as beam blockers or primary modulation grids [1][5],or using software-based approaches. The latter rely on physical, empirical, or consistency-based models to predict x-ray scattering [6][13],or more recently, on neural networks that make use of such models during training, e.g. by being trained to reproduce the output of Monte Carlo simulations [14][18]. While these approaches have proven great potential in terms of accuracy and computation time, their performance highly depends on the quality and the availability of labeled data. Since several applications may lack such data, this study proposes the unsupervised deep scatter estimation (uDSE) which is able to overcome this limitation. Instead of using a supervised setup, uDSE relies on a Wasserstein GAN (WGAN) setup that can be trained without labeled data or prior knowledge about the CT scanner’s x-ray and scatter properties [19]. Here we demonstrate the feasibility of the proposed approach using simulated CBCT data and compare the results against DSE, our supervised scatter estimation approach [14], [15].

II.

MATERIAL AND METHODS

A.

Deep Scatter Estimation (DSE)

The basic idea of the DSE approach is illustrated in figure 1. Here, DSE uses a U-net-like architecture to predict scatter as a function of the acquired projection data. To learn the corresponding mapping, DSE is trained to reproduce Monte Carlo simulations, i.e. the U-net’s weights are determined by minimizing the following loss function:

00108_PSISDG12304_123042Z_page_1_1.jpg

Fig. 1.

Schematic of the DSE and the proposed uDSE approach. DSE uses a U-net-like architecture to predict scatter as a function of the acquired projection data. In a supervised setup DSE is trained to reproduce a ground truth Monte Carlos scatter distribution. uDSE, in contrast, uses a Wasserstein GAN (WGAN) setup. Here, a scatter correction followed by a CT reconstruction is performed. Subsequently, the correction is evaluated by a critic network that is trained simultaneously to recognize scatter artifact-free images. By optimizing the weights of the scatter estimation network to fool the critic network, scatter estimation can be learned without labels.

00108_PSISDG12304_123042Z_page_2_1.jpg

where θ denotes the parameter vector, n the sample number within a batch of size B, In the flat field-corrected intensities, and Sn the Monte Carlo scatter estimate. It has to be noted that the first layer of the DSE network performs a “pep”-transform to be consistent with our DSE publication [14].

00108_PSISDG12304_123042Z_page_2_2.jpg

B.

Unsupervised Deep Scatter Estimation (uDSE)

The proposed uDSE approach, shown in figure 1, extends the concept of DSE to cases where labeled data are not available. To do so, it is composed of a generator network and a critic network. Here, the generator combines the DSE network with a scatter correction layer and a Feldkamp reconstruction layer, such that it is able to map acquired intensities I of a CT scan to scatter-corrected CT reconstructions. The critic network, in turn, is designed to distinguish between the generator’s output and true scatter-free CT reconstructions. Thus, letting the critic network act as loss function for the generator allows to learn CT scatter estimation without labeled or paired data, respectively.

Here, the corresponding optimization is performed using a WGAN setup in which the generator netork Gθg (I) and the critic network Cθc (f) are optimized in an alternating manner according to the following loss functions:

00108_PSISDG12304_123042Z_page_2_3.jpg
00108_PSISDG12304_123042Z_page_2_4.jpg

where θc and θg denote the parameter vectors of the generator and the critic network, n is the sample number within a batch of size B, and freal corresponds to a sample from a set of almost scatter-free clinical CT reconstructions.

C.

Datasets

In the present study, DSE as well as uDSE were trained and tested on simulated CBCT data. Therefore, clinical CT reconstructions of 65 patients were used as prior. Based on the corresponding voxel volumes fprior, CBCT scans with 360 views and an angular coverage of 360° were simulated at five different z-positions within the abdomen region using a tube voltage of 120 kV, a source-to-isocenter distance of 700 mm, a source-to-detector distance of 1100 mm, and a 1024×768 flat detector with an isotropic pixel spacing of 0.39 mm. Furthermore, a shifted detector was used to increase the field of measurement to about 380 mm. Using this setup, three datasets were generated using different patients: one scatter-corrupted dataset (30 patients) from which input data for the generator network were sampled during training, one scatter-free dataset (30 patients) that was used to provide ideal reference CT reconstructions for the critic network, and a scatter-corrupted dataset (5 remaining patients) for testing. In any case, the scatter-corrupted data were simulated as I = IP + SMC, where Ip corresponds to a polychromatic forward projection of the prior volume and SMC is a scatter distribution that was generated using our in-house Monte Carlo simulation. The ideal reference data, on the other hand, correspond to a CBCT reconstruction of only the primary intensities, i.e. freal = X‒1(‒ln(Ip)), with X‒1 being the CBCT reconstruction operator. It has to be noted that freal is not sampled directly from the set of clinical CT reconstructions, but is generated via forward and back-projection, to have the same spatial resolution and the same field of measurement as the reconstructions provided by the generator network.

D.

Training and Evaluation

The uDSE approach was trained using the datasets described in section II-C. However, to avoid memory issues as well as to increase the computational performance, a 2-fold angular downsampling to 180 views and an 8-fold spatial downsampling to a detector size of 128×96 was performed in advance. This downsampling can be justified by the fact that scatter distributions are known to be of low frequency. Therefore, we only expect a minor degradation of accuracy compared to a training that uses the full size projection data. Given the small detector size, the internal reconstruction operations were also performed on a low resolution grid with 144×144×48 voxels and an isotropic spacing of 2.7 mm.

The corresponding optimization was implemented according to section II-B using the Tensorflow frame-work. Here, all hyperparameters except for the batch size (this study: B = 16) and the number of iterations (this study: Nitr = 5000) were chosen following to the original WGAN publication, i.e. five updates of the critic network are followed by one update of the generator network using an RMSProp optimizer with a learning rate of 0.00005 and a weight clipping to [−0.01, 0.01] in the critic network.

Finally, uDSE was applied to the test data and compared against DSE which was trained in a supervised setting using the Monte Carlo scatter distributions a labels. In contrast to the training data, the scatter correction was performed on the full size projection data. Therefore, all scatter predictions were upsampled to the original detector size of 1024 × 768 pixels prior to scatter correction.

III.

RESULTS

To estimate scatter, DSE and uDSE were trained as described in section II-D and evaluated for the five patients of the test dataset. Exemplary scatter predictions are shown in figure 2. Here, the DSE approach yields scatter estimates that are almost equal to the Monte Carlo ground truth while uDSE shows slightly higher deviations. This can be attributed to the fact the uDSE loss function is evaluated in image domain. Therefore, the scatter estimate is less reliable in regions where it has a low impact on image quality, i.e. in regions with low scatter-to-primary ratio. In particular this explains the poor accuracy of uDSE scatter estimates in air regions without patient intersection.

Fig. 2.

DSE and uDSE scatter estimates for an exemplary shifted detector projection of the test dataset (top row), as well as the corresponding error with respect to the ground truth Monte Carlo scatter distribution (bottom row).

00108_PSISDG12304_123042Z_page_3_1.jpg

A quantitative evaluation of the scatter estimates in terms of the mean absolute percentage error with respect to the ground truth yields similar trends. Here, the average error of DSE for all 9000 projections of the test dataset is 3.5 % while the average error of uDSE is 9.8 %.

CT reconstructions with and without scatter correction are shown in figure 3. Here, DSE as well as uDSE are able to remove most of the artifacts that are present in the uncorrected reconstruction and provide CT images that are almost equal to the scatter-free ground truth. Quantitatively, the application of the scatter correction improves the mean absolute error of the CT values from 160.7 HU (no correction) to 24.7 HU (DSE) or 27.9 HU (uDSE), respectively.

Fig. 3.

CT reconstruction without and with DSE and uDSE scatter correction (top row), as well as difference images to the scatter free ground truth (bottom row).

00108_PSISDG12304_123042Z_page_4_1.jpg

IV.

DISCUSSION AND CONCLUSION

This study introduces a novel approach to learn scatter estimation without labeled data. To do so, the proposed uDSE makes use of a WGAN setup in which the generator network is optimized such that its output, i.e. scatter corrected reconstructions, cannot be distinguished from samples of an artifact-free reference set. Here, we demonstrate the feasibility of uDSE using CBCT simulations as input and clinical CT reconstructions as reference. However, it has to be noted that uDSE is not restricted to this particular choice but can be trained with any tomographic input and any scatter-free reference as long as both distributions are sufficiently equal after scatter correction. The fact that this condition is perfectly met here, i.e. input and reference distributions only differ by scatter, can be considered as a limitation of this study. Practically, both distribution may additionally differ by the amount of beam hardening, the dynamic range, or the contrast media distribution for instance. Investigating uDSE’s performance in such cases is subject to further research and may require the incorporation of additional constraints and correction layers. In the current setup, however, uDSE is able to remove most of the present scatter artifacts and yields similar CT value accuracy (mean error of 27.9 HU vs. 24.7 HU) as a state-of-the-art supervised scatter estimation approach. Thus, uDSE has the potential to extend the concept of neural network-based scatter estimation and correction to scenarios where labels are not available or cannot be generated with sufficient accuracy.

ACKNOWLEDGMENT

Parts of the reconstruction software RayConStruct IR and of the Monte Carlo software RayConStruct MC were provided by RayConStruct® GmbH, Nürnberg, Germany.

REFERENCES

[1] 

R. Ning, X. Tang, and D. Conover, “X-ray scatter correction algorithm for cone beam CT imaging,” Medical Physics, 31 (5), 1195 –1202 (2004). https://doi.org/10.1118/1.1711475 Google Scholar

[2] 

L. Zhu, N. R. Bennett, and R. Fahrig, “Scatter Correction Method for X-Ray CT Using Primary Modulation: Theory and Preliminary Results,” IEEE Transactions on Medical Imaging, 25 (12), 1573 –1587 (2006). https://doi.org/10.1109/TMI.2006.884636 Google Scholar

[3] 

R. Grimmer, R. Fahrig, W. Hinshaw, H. Gao, and M. Kachelrieß, “Empirical Cupping Correction for CT Scanners with Primary Modulation (ECCP),” Medical Physics, 39 (2), 825 –831 (2012). https://doi.org/10.1118/1.3676180 Google Scholar

[4] 

L. Ritschl, R. Fahrig, M. Knaup, J. Maier, and M. Kachelrieß, “Robust primary modulation-based scatter estimation for cone-beam CT,” Medical Physics, 42 (1), 469 –478 (2015). https://doi.org/10.1118/1.4903261 Google Scholar

[5] 

S. Hsieh, “Estimating scatter in cone beam CT with striped ratio grids: A preliminary investigation,” Medical Physics, 43 (9), 5084 –5092 (2016). https://doi.org/10.1118/1.4960631 Google Scholar

[6] 

W. Swindell and P. M. Evans, “Scattered radiation in portal images: A Monte Carlo simulation and a simple physical model,” Medical Physics, 23 (1), 63 –73 (1996). https://doi.org/10.1118/1.597792 Google Scholar

[7] 

B. Ohnesorge, T. Flohr, and K. Klingenbeck-Regn, “Efficient object scatter correction algorithm for third and fourth generation CT scanners,” European Radiology, 9 (3), 563 –569 (1999). https://doi.org/10.1007/s003300050710 Google Scholar

[8] 

G. Poludniowski, P. M. Evans, V. N. Hansen, and S. Webb, “An efficient Monte Carlo-based algorithm for scatter correction in keV cone-beam CT,” Physics in Medicine and Biology, 54 (12), 3847 –3864 (2009). https://doi.org/10.1088/0031-9155/54/12/016 Google Scholar

[9] 

J. Star-Lack, M. Sun, A. Kaestner, R. Hassanein, G. Virshup, T. Berkus, and M. Oelhafen, “Efficient scatter correction using asymmetric kernels,” 7258 72581Z , (2009). Google Scholar

[10] 

M. Sun and J. M. Star-Lack, “Improved scatter correction using adaptive scatter kernel superposition,” Physics in Medicine and Biology, 55 (22), 6695 –6720 (2010). https://doi.org/10.1088/0031-9155/55/22/007 Google Scholar

[11] 

E. Meyer, C. Maaß, M. Baer, R. Raupach, B. Schmidt, and M. Kachelrieß, “Empirical scatter correction (ESC): A new CT scatter correction method and its application to metal artifact reduction,” in IEEE Medical Imaging Conference, 2036 –2041 (2010). Google Scholar

[12] 

M. Baer and M. Kachelrieß, “Hybrid scatter correction for CT imaging,” Physics in Medicine and Biology, 57 (21), 6849 –6867 (2012). https://doi.org/10.1088/0031-9155/57/21/6849 Google Scholar

[13] 

A. Maslowski, A. Wang, M. Sun, T. Wareing, I. Davis, and J. Star-Lack, “Acuros CTS: A fast, linear Boltzmann transport equation solver for computed tomography scatter - Part I: Core algorithms and validation,” Medical Physics, 45 (5), 1899 –1913 (2018). https://doi.org/10.1002/mp.12850 Google Scholar

[14] 

J. Maier, E. Eulig, T. Vöth, M. Knaup, J. Kuntz, S. Sawall, and M. Kachelrieß, “Real-time scatter estimation for medical CT using the deep scatter estimation: Method and robustness analysis with respect to different anatomies, dose levels, tube voltages, and data truncation,” Medical Physics, 46 (1), 238 –249 (2019). https://doi.org/10.1002/mp.2019.46.issue-1 Google Scholar

[15] 

J. Maier, S. Sawall, M. Knaup, and M. Kachelrieß, “Deep Scatter Estimation (DSE): Accurate Real-Time Scatter Estimation for X-Ray CT Using a Deep Convolutional Neural Network,” Journal of Nondestructive Evaluation, 37 (3), 57:1 –9 (2018). https://doi.org/10.1007/s10921-018-0507-z Google Scholar

[16] 

Y. Nomura, Q. Xu, H. Shirato, S. Shimizu, and L. Xing, “Projection-domain scatter correction for cone beam computed tomography using a residual convolutional neural network,” Medical Physics, mp.13583 (2019). Google Scholar

[17] 

Y. Jiang, C. Yang, P. Yang, X. Hu, C. Luo, Y. Xue, L. Xu, X. Hu, L. Zhang, J. Wang, K. Sheng, and T. Niu, “Scatter correction of cone-beam CT using a deep residual convolution neural network (DRCNN),” Physics in Medicine and Biology, 64 (14), 145003 (2019). https://doi.org/10.1088/1361-6560/ab23a6 Google Scholar

[18] 

J. Erath, T. Vöth, J. Maier, E. Fournié, M. Petersilka, K. Stierstorfer, and M. Kachelrieß, “Deep learning-based forward and cross-scatter correction in dual-source CT,” Medical Physics, 48 (9), 4824 –4842 (2021). https://doi.org/10.1002/mp.v48.9 Google Scholar

[19] 

M. Arjovsky, S. Chintala, and L. Bottou, “Wasserstein Generative Adversarial Networks,” 70 214 –223 , (2017). Google Scholar
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Joscha Maier, Luca Jordan, Elias Eulig, Fabian Jäger, Stefan Sawall, Michael Knaup, and Marc Kachelrieß "Learning CT scatter estimation without labeled data: a feasibility study", Proc. SPIE 12304, 7th International Conference on Image Formation in X-Ray Computed Tomography, 123042Z (17 October 2022); https://doi.org/10.1117/12.2646588
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
CT reconstruction

Computed tomography

X-ray computed tomography

Artificial neural networks

Compton scattering

RELATED CONTENT


Back to Top