Research Article

3D Data Denoising via Nonlocal Means Filter by Using Parallel GPU Strategies

Table 6

Full unrolling algorithm on a single GPU unit. Execution times and speed-up values for a 3D dataset of normally distributed random numbers (size ) for several window configurations.

Execution time/speed-up
1 GPU unitCPU
(16, 16, 1)(128, 1, 1)(256, 1, 1)(512, 1, 1)

71.4/39.470/40.270.1/40.173.9/38.12814
514/38.5504/39.2505/39.2538/36.819790
254/32244/33.3244/33.3267/30.48133
1845/31.91761/33.41765/33.31964/29.958785