Research Article

A Domain-Specific Architecture for Elementary Function Evaluation

Table 1

Accuracy, throughput, and table size (for SPU/double precision).

Function Cycles/double
new
Cycles/double
SPU
Speedup (%) Max error (ulps) Table size () Poly order

recip311.33760.50020483
div3.514.94251.333recip3
sqrt315.45130.5004096318
rsqrt314.64860.50340963

cbrt8.313.31600.5008192318
rcbrt1016.11610.501rcbrt3
qdrt7.527.63680.5008192318
rqdrt8.319.62290.501rqdrt318

log22.514.65840.5004096318
log21p3.5n/an/a1.106log23
log3.513.83941.184log23
log1p4.522.55001.726log23

exp24.513.02881.791256418
exp2m15.5n/an/a1.29exp24
exp5.014.42881.55exp24
expm15.519.53541.80exp24

atan27.523.43110.9554096218
atan7.518.52460.955atan22 + 3
asin1127.22471.706atan22 + 3 + 3
acos1127.12460.790atan22 + 3 + 3

sin1116.61501.4741283 + 352
cos1015.31531.025sin3 + 3
tan24.527.61132.051sin3 + 3 + 3