Research Article
Sound Classification Based on Multihead Attention and Support Vector Machine
Table 2
Classification accuracy on UrbanSound8K compared across different numbers of heads and layers with Feature 1 and Feature 2 individually.
| Feature | Head (#) | L (#) | MhaNN accu. (%) | MhaNN-SVM accu. (%) | MhaNN-LR accu. (%) | MhaNN-KNN accu. (%) |
| Feature 1 | 2 | 1 | 91.6 | 92.1 | 92.3 | 91.5 | 2 | 92.2 | 93.3 | 93.0 | 92.9 | 3 | 91.6 | 93.3 | 91.7 | 92.2 | 4 | 1 | 91.8 | 92.7 | 91.6 | 92.1 | 2 | 92.1 | 93.6 | 92.8 | 93.2 | 3 | 92.1 | 94.6 | 92.3 | 93.0 | 8 | 1 | 91.4 | 93.2 | 91.0 | 92.9 | 2 | 90.9 | 92.1 | 91.7 | 91.0 | 3 | 90.5 | 91.0 | 90.8 | 91.2 |
| Feature 2 | 2 | 1 | 83.7 | 84.8 | 86.1 | 85.2 | 2 | 89.1 | 90.3 | 87.8 | 88.1 | 3 | 86.2 | 87.4 | 86.1 | 86.8 | 4 | 1 | 85.5 | 86.7 | 85.9 | 85.1 | 2 | 87.1 | 89.7 | 87.2 | 88.4 | 3 | 83.0 | 84.1 | 82.7 | 83.0 |
|
|