Figure 8: Performance of Square and Vectoraddition applications with different workload per workitem.