Research Article

Unsupervised Two-Way Clustering of Metagenomic Sequences

Table 3

Comparison of performance of Gaussian mixture model (GMM) with 2-way Poisson mixture model (PMM) for datasets with low 𝛿 values. Each dataset contains 50,000 reads of length 500 bp.

Species 𝛿 GMM PMM

M. leprae, P. putida 74 75.25 85.24
B. subtilis, L. lactis 86 86.23 90.62
H. pylori, S. pneumoniae 148 53.48 98.76
H. salinarum, R. sphaeroides 153 94.63 98.51
M. jannaschii, S. aureus 164 50.0 97.75