Research Article

Performance Optimization and Modeling of Fine-Grained Irregular Communication in UPC

Table 3

Time usage (in seconds) of 1000 iterations SpMV for Test problems 1–3.

1 node2 nodes4 nodes8 nodes16 nodes32 nodes64 nodes
16 threads32 threads64 threads128 threads256 threads512 threads1024 threads

Test problem 1: 6,810,586 tetrahedrons
 UPCv128.80522.15443.981882.01551.20311.54183.73
 UPCv239.3736.7023.6818.8913.619.989.57
 UPCv325.0115.078.224.652.912.685.56

Test problem 2: 13,009,527 tetrahedrons
 UPCv159.142525.053532.333657.953078.352613.851588.67
 UPCv273.7969.6055.3336.3924.1625.0621.29
 UPCv346.8824.9715.4310.916.255.157.54

Test problem 3: 25,587,400 tetrahedrons
 UPCv1115.252990.921758.94986.851302.524653.102692.69
 UPCv2154.72178.14122.3881.7752.9941.1644.80
 UPCv393.3048.7426.1315.3711.127.4110.16