Research Article
A Practical and Scalable Tool to Find Overlaps between Sequences
Table 5
Running SOF with large random data set using 16-core AWS machine. Number of strings in millions.
| Data set | Total size | Number of strings | Time | Space | (Minutes) |
| Random | 10 GB | 100 M | 30 | 15 GB | Random | 20 GB | 200 M | 41 | 31 GB | Random | 30 GB | 300 M | 76 | 46 GB | Random | 50 GB | 660 M | 110 | 96 GB | SRR500004 | 1.1 GB | 15 M | 3 | 2.2 GB | ERR125766 | 5 GB | 97 M | 11 | 12 GB | SRR866986 | 10 GB | 53 M | 12 | 10 GB | SRR098909 | 32 GB | 162 M | 119 | 31.2 GB |
|
|