Research Article

Speaker Gender Recognition Based on Deep Neural Networks and ResNet50

Table 2

ResNet 34 and ResNet 50 detailed architecture.

Layer nameOutput size34 layers50 layers

Conv 1, 64, stride 2
Conv 2.x max pool, stride 2
Conv 3.x
Conv 4.x[3×3,256 3×3,256]×6
Conv 5.x
FLOPsAverage pool, 1000-d fc, softmax