Research Article

Sound Classification Based on Multihead Attention and Support Vector Machine

Table 5

Classification accuracy on GTZAN compared across different numbers of heads and layers with Feature 1 and Feature 2 individually.

FeatureHead (#)L (#)MhaNN accu. (%)MhaNN-SVM accu. (%)MhaNN-LR accu. (%)MhaNN-KNN accu. (%)

Feature 12181.882.981.682.2
282.984.081.383.4
381.281.779.282.0
4182.383.182.083.3
285.488.484.786.7
384.286.185.584.8
8182.784.882.783.1
283.685.183.384.1
381.283.278.780.1

Feature 22170.172.270.372.0
276.578.777.074.8
372.574.670.872.2
4171.073.372.770.9
275.178.076.275.8
373.775.373.672.1