Application Article

A High-Performance Parallel FDTD Method Enhanced by Using SSE Instruction Set

Table 2

Performance comparison of parallel FDTD method on 4-CPU workstation and high-performance cluster.

PlatformCPU typeNetworkSimulation time

Workstation (4 CPUs, 64 GB RAM)AMP Opteron 6168 1.9 GHz($795 each)No network319 min. (with hardware acceleration and NUMA)
2 CPUs (24 GB RAM)Intel Xeon X5570 2.93 GHz($1465 each)No network1720 min. (with hardware acceleration, no NUMA)
128 CPUs (1536 GB RAM)Intel Xeon X5570 2.93 GHz($1465 each)Infiniband29 min.37 sec. (with hardware acceleration no NUMA)