Research Article

True 4D Image Denoising on the GPU

Table 1

Comparison between three Nvidia GPUs, from three different generations, in terms of processor cores, memory bandwidth, size of shared memory, cache memory, and number of registers; MP stands for multiprocessor and GB/s stands for gigabytes per second. For the GTX 580, the user can for each kernel choose to use 48 KB of shared memory and 16 KB of L1 cache or vice versa.

Property/GPU 9800 GT GTX 285 GTX 580

Number of processor cores 112 240 512
Normal size of global memory 512 MB 1024 MB 1536 MB
Global memory bandwidth 57.6 GB/s 159.0 GB/s 192.4 GB/s
Constant memory 64 KB 64 KB 64 KB
Shared memory per MP 16 KB 16 KB 48/16 KB
Float registers per MP 8192 16384 32768
L1 cache per MP None None 16/48 KB
L2 cache None None 768 KB