Research Article
A Coarse-Grained Reconfigurable Architecture with Compilation for High Performance
Table 2
Small configuration results.
| Arch. |
App. | Small Cfg.: 1 tile, PEs each tile | Ops | Cycles | Avg. IPC | Perf. Gain | Efficiency |
| RCP | idct(row+col) | — | — | 9.2 | — | 57% | FDR-CGRA | idct(row+col) | 2042 | 184 | 11.1 | 21% | 69% |
| FDR-CGRA | interpolate_avg4_c | 1193 | 136 | 8.8 | — | 55% | FDR-CGRA | interpolate_halfpel__c | 1295 | 135 | 9.6 | — | 60% | FDR-CGRA | sad16_c() | 3441 | 339 | 10.2 | — | 63% |
| ADRES | get_blocks (64 PEs) | — | — | 29.9(64 PEs) | — | 47% | FDR-CGRA | get_block(H) | 340 | 38 | 8.9 | — | 56% | FDR-CGRA | get_block(V) | 296 | 37 | 8.0 | — | 50% | FDR-CGRA | get_block(V+H) | 899 | 93 | 9.7 | — | 60% | FDR-CGRA | get_block(H+V) | 900 | 95 | 9.5 | — | 59% | FDR-CGRA | Adjusted Avg. (4 tiles) | — | — | 36.1(4 tiles) | 21% | 56% |
|
|