Research Article

Performance of Post-Training Two-Bits Uniform and Layer-Wise Uniform Quantization for MNIST Dataset from the Perspective of Support Region Choice

Table 1

SQNR and model accuracy for application of different two-bits UQ designs.

Two-bits UQ, accuracy (FP32) = 98.1%

xmin = -7.063787, xmax = 4.8371024
xmax(H) = 1.9605, xmax(J) = 2.1748
Case 1 gCase 2 gCase 3 gCase 4 g
[−xmax, xmax][xmin, −xmin][−xmax(H), xmax(H)][−xmax(J), xmax(J)]
SQNRexUQ (dB)2.8821−1.24028.76768.7639
SQNRthUQ (dB)1.9360−2.00666.97877.0707
Accuracy (%)96.9794.5896.3496.74
Within g (%)99.98810094.78796.691