Review Article

Energy-Aware High-Performance Computing: Survey of State-of-the-Art Tools, Techniques, and Environments

Table 3

Device type.

Device typeWorksDescription

(1) Single/multicore/manycore CPU[33]A platform based on ARM Cortex A9, 4, 8, and 16 core architectures
[49]Multicore CPUs as part of a node and cluster on which an MPI application runs
[35]A chip with k cores with specific frequencies is considered
[54]Cluster, 40 performance counters are investigated and correlated for energy-aware optimization, related to runtime, system, CPU, and memory power
[38]A multicore system with cores as discrete thermal elements
[58]Possibly also (2 multiprocessor system)
[24]32-node cluster each with 2 Sandy Bridge 8 core CPUs
[40]Multicore and manycore CPUs
[52]Sandy Bridge and Haswell Xeon CPUs
[77]Servers with single CPUs hosting VMs
[41]Single-core Pentium-M (32-bit) in a off-the-shelf laptop
[59]Single-core AMD Athlon-64
[42]Intel Xeon Phi KNL 7250 processor with 68 cores, flat memory mode
[43]Multicore Intel Xeon CPU

(2) Multiprocessor system[44]A heterogeneous real-time multiprocessor system-on-chip (MPSoC) system—consists of a number of processors each of which runs at its voltage and speed
[47]A cluster with Intel Xeon CPUs
[30]A multiprocessor system with Intel Xeon CPUs
[48]A cluster with ARM CPUs, a cluster with Intel Ivy Bridge CPUs
[74]A grid system parametrized with the number of hosts, distribution of computing capacities, and host selection policy
[50]A system with a number of nodes with multicore CPUs assumed in the simulated HPC platform and cores of an Intel core M CPU with 6 voltage/frequency levels assumed
[53]Cluster with consideration of CPU, memory, disk, and network
[55, 56, 61, 62, 65]Cluster with CPUs
[45]Many cores within a system, and core and uncore frequencies are of interest
[57]With consideration of disk and network scaling
[14]Systems with 2 socket Westmere-EP, 2 socket Sandy Bridge-EP, and 1 socket Ivy Bridge-HE CPUs
[76]Undefined machines in a data center capable of hosting up to 15 VMs
[60]Homogeneous multicore cluster
[63, 64]Cluster with multicore CPUs
[66, 67]Cluster in a data center
[68]Sandy Bridge cluster
[69]Cluster with InfiniBand
[46]Dual-socket server with two Intel Xeon CPUs
[70]Overprovisioned HPC cluster with CPUs
[71]Cluster with 1056 Dell PowerEdge SC1425 nodes

(3) GPU/accelerator[34]A GPU allowing concurrent kernel execution and frequency scaling
[36]Focus on the GPU version and comparison to serial and multithreaeded CPU versions
[75]GPUs used for generation of parity data in a RAID
[37]Postapplication minimization of energy consumed
[39]Server, desktop, mobile GPUs, not yet existing GPUs can be simulated
[72]A cluster with with two Intel Xeon CPUs per node

(4) Hybrid[73]Consideration of both GPUs and CPUs in a cluster or collection of clusters
[51]Targeted at optimization on a cluster with Intel Xeon CPUs and MICs, early evaluation performed using OpenMP on multicore Intel and AMD CPUs