Research Article

Cache Locality-Centric Parallel String Matching on Many-Core Accelerator Chips

Figure 9

Illustration of global memory only (P-1) implementation.