Research Article
Exploring Many-Core Design Templates for FPGAs and ASICs
Table 4
32-Node. Performance comparison between MARC, hand-optimized, and GPGPU.
| Configuration | Per iteration | Relative | | Time (s) | Perf. |
| GPGPU scaled reference | 174 | 0.0024 | MARC-Ropt-F | 2550 | 0.0002 | MARC-C1-F | 172 | 0.0025 | MARC-C2-F | 124 | 0.0034 | MARC-C4-F | 136 | 0.0031 | Hand design FPGA | 51.4 | 0.0082 | MARC-Ropt-A | 27.6 | 0.0152 | MARC-C1-A | 1.47 | 0.2863 | MARC-C2-A | 1.11 | 0.3808 | MARC-C4-A | 1.17 | 0.3608 | Hand design ASIC | 0.422 | 1.0000 |
|
|