175745.fig.0016
Figure 16: DET plot of voice activity detection performance for Dataset 1: solid lines—audio + video combinations, dashed lines—audio and video systems individually. VAD based on both audio and video modalities (audio + video no. 1) indicates better performance than audio-only VAD for most of the operating points.