Research Article

Effective SIMD Vectorization for Intel Xeon Phi Coprocessors

Table 7

Vector register contents after the final shuffle.

A_v512A11A21A31A41
A12A22A32A42
A13A23A33A43
A14A24A34A44