Real-Time Audio-Visual Analysis for Multiperson Videoconferencing
Figure 9
Spatiotemporal fingerprint processing. Each column of bits (zeros and ones) represents a spatial fingerprint, a union of several consequent columns represents a spatiotemporal fingerprint. Ones correspond to voice activity; zeros correspond to silence. Horizontal bit position defines instant in time. Vertical bit position defines azimuth with respect to microphone array.