Figure 5: Box plots of the test accuracy of the nine Horse subdatasets.