Research Article

True 4D Image Denoising on the GPU

Table 2

The table shows the in and out data resolution, the used equations and the memory consumption for all the processing steps for spatial filtering (SF) and FFT-based filtering (FFTBF). Note that the driver for the GPU is stored in the global memory, and it normally requires 100–200 MB.

Processing step Resolution, SF Memory consumption, SF Resolution, FFTBF Memory consumption, FFTBF

Lowpass filtering and downsampling of CT volumes in    406 MB in    294 MB
out out

Filtering with 14 monomial filters and calculating the local structure tensor ((10), (11)) in    1376 MB in    1791 MB
out out

Lowpass filtering of the local structure tensor components (normalized convolution, (25)) in    1276 MB in    720 MB
out out

Calculating the tensor magnitude and mapping it with the M-function ((17), (18), (19), (13)) in    1376 MB in    770 MB
out out

Mapping the local structure tensor to the control tensor ((20), (22), (23)) in    1376 MB in    770 MB
out out

Lowpass filtering of the control tensor components (normalized convolution, (25)) in    1476 MB in    820 MB
out out

Filtering with 11 reconstruction filters, interpolating the control
tensor on the fly, and calculating
the denoised data (24)
in    2771 MB in    2110 MB
out out (three rounds×6 slices = 18 denoised slices in total)