Research Article

An Optimal Seed Based Compression Algorithm for DNA Sequences

Table 2

Comparison of compression ratios of the proposed method against existing methods [2, 8, 12, 18, 19, 21, 22, 24].

SequenceLengthCDNAGeMNLBiocCTW + LZGenCDNACDNAPXMProposed seed based method

HUMDYSTROP 38,7701.931.90851.92621.91751.92311.91161.90881.90311.8624
HUMGHCSA 66,4960.951.00891.30721.09721.09691.02721.6390.98281.0156
HUMHBB 73,3081.77ā€”1.88001.80821.82041.78971.77711.75131.7364
HUMHDABCD 58,8631.671.70591.87701.82181.81921.79511.73941.66711.6237
HUMHPRTB 56,8321.721.76391.90661.84331.84661.81651.78861.73611.688
MPOMTCG 1,86,6091.871.88221.93781.90001.90581.89201.89321.87681.763
VACCG 1,91,7351.811.76441.76141.76161.76141.75801.75831.67491.6434