Research Article

Performance Optimization and Modeling of Fine-Grained Irregular Communication in UPC

Table 2

Time usage (in seconds) of 1000 iterations SpMV for Test problem 1; naive UPC implementation (Listing 2) vs. the first transformed UPC implementation (Listing 3).

ā€‰1 thread2 threads4 threads8 threads16 threads

Naive UPC895.44548.57301.17173.08106.10
UPCv1270.40159.5186.3751.1028.80