Research Article

HaVec: An Efficient de Bruijn Graph Construction Algorithm for Genome Assembly

Table 5

Results for HaVec.

File namekHash table index6 bytes per k-mer runtime (sec)5 bytes per k-mer runtime (sec)6 bytes per k-mer total memory (MB)5 bytes per k-mer total memory (MB)Unique k-mer in hash tableUnique k-mer in vectorTotal unique k-mer

50 m.fa273,558,218,0932261.75216321445.939218052.6082,358,010,00414,135,3902,372,145,394
50 m.fa323,283,745,65121832038.519228.876816648.49922,176,128,06013,035,6892,189,163,749
Ecoli_MG1655_s_6_1_bfast.fasta2720,115,58770.2570144.896125.747213,330,62279,76813,410,390
Ecoli_MG1655_s_6_1_bfast.fasta3220,693,34169.37568.75148.3776128.614413,713,60681,95513,795,561
Ecoli_MG1655_s_6_2_bfast.fasta27196,614,919488.5486.51205.86241018.368130,293,923782,686131,076,609
Ecoli_MG1655_s_6_2_bfast.fasta32200,937,899480.75476.51231.8721040.2816133,158,739799,832133,958,571
Human1_95G_CASAVA1.8a2_NCBI37_18Jan11_chr19.sorted.fasta27313,251,713601.5593.251909.04321610.24207,579,1661,255,278208,834,444
Human1_95G_CASAVA1.8a2_NCBI37_18Jan11_chr19.sorted.fasta32334,345,241611.625592.752035.60961716.736221,565,8351,330,984222,896,819
Human1_95G_CASAVA1.8a2_NCBI37_18Jan11_chr21.sorted.fasta27199,165,411371.5370.6251221.42721031.4752131,981,455795,486132,776,941
Human1_95G_CASAVA1.8a2_NCBI37_18Jan11_chr21.sorted.fasta32207,852,223374.75374.51273.44641075.2137,741,484826,661138,568,145
NA19240_GAIIx_100_chr21.fasta27163,949,171508.625513.51009.5616853.1968108,643,167656,266109,299,433
NA19240_GAIIx_100_chr21.fasta32170,662,721549504.6251016.3712886.9888113,094,644680,508113,775,152
dataset_1_7GB.fa27199,165,411395.25368.1251221.42721031.4752131,981,455795,486132,776,941
dataset_1_7GB.fa32207,852,223373.753671273.44641075.2137,741,484826,661138,568,145
dataset_1_9GB.fa27163,949,171516507.1251009.5616853.1968108,643,167656,266109,299,433
dataset_1_9GB.fa32170,662,721511.375507.6251049.8048886.9888113,094,644680,508113,775,152