Research Article
A Strategy for Automatic Performance Tuning of Stencil Computations on GPUs
Table 9
Heuristic performance on various input sizes.
| | Data size per dimension | NVIDIA | AMD | 32 | 64 | 128 | 256 | 512 | 32 | 64 | 128 | 256 | 512 |
| Speedup | | | | | | | | | | | Hybrid oracle | 1.03 | 1.04 | 1.15 | 1.18 | 1.19 | 0.98 | 0.93 | 1.12 | 1.07 | 1.36 | Hybrid | 0.97 | 1.04 | 1.15 | 1.18 | 1.19 | 0.85 | 0.91 | 1.11 | 1.06 | 1.39 | Random sample | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| Correct predictions | 2 | 7 | 7 | 7 | 7 | 3 | 3 | 6 | 6 | 6 |
|
|