Research Article

Unsupervised Two-Way Clustering of Metagenomic Sequences

Table 2

Performance of Poisson mixture model on datasets for different values of 𝐿 and word length of 5. Here, N.W.G stands for no word grouping. The maximum accuracy achieved is in bold. Each dataset contains 50,000 reads of length 500 bp.

Species 𝐿 = 5 𝐿 = 1 0 𝐿 = 3 0 𝐿 = 5 0 N.W.G

B. anthracis CI chromosome, B. halodurans C-125 90.6191.53 50.31 91.2 50.32
H. pylori 26695, S. pneumoniae 70585 98.698.79 98. 73 98.71 98.76
B. subtilis subsp. spizizenii str., L. lactis subsp. 89.96 90.3490.62 90.53 50.47