Figure 3: Execution time distribution of kernel execution and auxiliary API functions.