175745.fig.0017
Figure 17: DET plot of voice activity detection performance for Dataset 2: solid lines—audio + video combinations, dashed lines—audio and video systems individually. VAD based on both audio and video modalities (audio + video no. 1) indicates better performance than audio-only VAD.