Figure 9:
Comparison between audio and visual features.