Research Article

Discovering the Unknown: Improving Detection of Novel Species and Genera from Short Reads

Table 5

The table shows the distribution of top 8 most abundant genus reads that passed the genus-resolution detectors for the red soudan acid mine drainage dataset, using the 635-genome training database.

NBC detectorPhymmBL detectorSOrt-ITEMS
Organism Matched reads Organism Matched reads Organism Matched reads

Marinobacter 40 Dinoroseobacter 101 Marinobacter 476
Dinoroseobacter 24 Marinobacter 73 Gramella 388
Rhodobacter 23 Ruegeria 59 Dinoroseobacter 297
Shewanella 20 Rhodobacter 41 Rhodobacter 264
Ruegeria 19 Shewanella 41 Flavobacterium 161
Paracoccus 9 Pseudomonas 26 Pseudomonas 131
Desulfotalea 4 Bacillus 21 Alkalilimnicola 111
Bartonella 4 Clostridium 21 Roseobacter 101