Research Article

Deep Neural Networks with Multistate Activation Functions

Table 2

Cross entropy losses on the training set. “Plain” means the conventional SGD without mean-normalisation and SVD restructuring. “MN” means mean-normalised SGD and “MN + SVD” means the SVD restructuring method using mean-normalised SGD.

Activation function Plain (%) MN (%) MN + SVD (%)

Logistic 1.4466 1.4602 1.5401
Two-order MSAF 1.5401 1.4183 1.5005
Three-order MSAF 1.4682 1.4176 1.5002
SYM-MSAF 1.4637 1.4373 1.5177