Metagenome Fragment Classification Using -Mer Frequency Profiles
Table 3
Comparison of the top 10 reads from the naive Bayes analysis of the Sargasso
Sea set for 9 mers and 15 mers and a side-by-side comparison with MEGAN results.
There are 7 common strains between the naive Bayes sets substantiating their
presence in the sample. Not all NBC “best matches” are found in MEGAN
(indicated by “None”), and this can be due to “no hits” or to not having
that strain in the database. An interesting NBC find is that
Trichodesmium erythraeum has been found to compose 0.6% of the sample. It has been extensively found in
the Sargasso Sea, but no prior methods show this presence in the Sargasso Sea
data set.
9 mers
15 mers
High-strain content in sample (genome size of both sides)