Research Article

IP-Enabled C/C++ Based High Level Synthesis: A Step towards Better Designer Productivity and Design Performance

Table 8

Mapping of Matrix Multiplication Operation.

Design
(2 2 matrix multiplication; 16 bit integer elements)
XC4VFX12FF668-10
LUTFFDSPBRAMCLK
(ns)7
Latency/
throughput8
Wall clock time
(ns)
IP coreComments

D1 (v-HLS)3162102.85670/70199.9NoExternal memory (EM)
D2 (v-HLS)39138802.8569/925.7NoEM
D3 (v-HLS)13586133.247N.AN.ANo
D4 (v-HLS)129198833.112N.AN.ANo
D5 (v-HLS)124103133.215117/117376.1No
D6 (v-HLS)155214832.98859/59176.2No
D7 (proposed)269622242.95622/411.8Yes
D8 (proposed)5141688803.12617/13.1Yes
D9 (proposed)183576473.00517/23.0Yes

Clock period constraint of 4 ns with 0.5 ns clock jitter.
The best case and the worst case latencies were the same for v-HLS flows.