Research Article

A Low-Power Scalable Stream Compute Accelerator for General Matrix Multiply (GEMM)

Table 1

Naming convention.

Hardware port Corresponding operand

C Cache prefetch stream
S Compute stream
B Auxiliary stream , , ,
P Result stream ,
Designates input
Designates output