Table 3: Performance comparison between optimized GPU solution on Quadro FX 5800 and parallel CPU solution on E5540 with fixed .

CPU GPUSpeedup

128 0.29 0.25 1.14
256 1.06 0.84 1.26
512 3.96 2.82 1.40
1024 15.20 9.33 1.63
2048 60.16 33.59 1.79