Research Article

A Strategy for Automatic Performance Tuning of Stencil Computations on GPUs

Table 1

Optimization abbreviations and allowable values. N denotes the input/output array size in each dimension. X, Y, and Z correspond to the , , and dimensions of the work-item grid, respectively.

Optimization AbbreviationRange of values
MinimumMaximum

Work-group sizeWX, WY, WZ1NX, NY, NZ
Block merge factorBX, BY, BZ1NX, NY, NZ
Cyclic merge factorCX, CY, CZ1NX, NY, NZ
Vector widthVX116
Local memoryN/A0 (no)1 (yes)
Image memoryN/A0 (no)1 ( yes)