Research Article

Utterance Clustering Using Stereo Audio Channels

Figure 2

t-SNE visualization for seven speakers’ feature vectors in the condition in which audio contains overlapping. Different colors represent different speakers. (a) t-SNE visualization of d-vectors’ clusters for speakers’ mono signals, (b) t-SNE visualization of d-vectors’ clusters for speakers’ mstack processed signals, (c) t-SNE visualization of d-vectors’ clusters for speakers’ hstack processed signals, and (d) t-SNE visualization of d-vectors’ clusters for speakers’ sumdif processed signals.
(a)
(b)
(c)
(d)