Wireless Communications and Mobile Computing

Research Article

Speaker Gender Recognition Based on Deep Neural Networks and ResNet50

Table 2

ResNet 34 and ResNet 50 detailed architecture.


Layer name	Output size	34 layers	50 layers

Conv 1		, 64, stride 2
Conv 2.x		max pool, stride 2
Conv 2.x
Conv 3.x
Conv 4.x		[3×3,256 3×3,256]×6
Conv 5.x
FLOPs		Average pool, 1000-d fc, softmax