Review Article

New Trends of Digital Data Storage in DNA

Table 1

Comparison of encoding models.

Encoding modelAdvantagesDisadvantages

Microvenus projectLaid the foundation for storing abiotic information in DNABeing inaccurate and not distinctively decodable

Genesis projectLaid the research work to explore the intricate relationship between biology, belief systems, information technology, dialogical interaction, ethics, and the InternetInaccurate as the original sentence was altered during mutation at the presence of ultraviolet light

PCR based encoding modelsHigh security because of the size of the microdots and even if an adversary identifies the microdot it would be extremely difficult without the knowledge of the primer sequenceInsertion of errors in template region making it unmanageable to recover the encoded data
Need of the knowledge of primers
Widespread experimental obstacles and practical problems
Need of PCR
Data breakage that could occur in encoding and decoding procedures due to errors of humans

Alignment based encoding modelsIndependent of Polymerase Chain Reaction
Greater speed and lower cost of reading DNA data and lower cost of synthetic DNA
Positions of the data breakages that could be identified easily by the alignment results although they were not recoverable
Multiplication of cassettes leads to redundant volumes
Parity effects cost a certain volume of data sequence
Data recovery rate is fragile and is proportional to data breakage which occurs through DNA deletion of long ranges
Sequencing of the entire genome is required to retrieve data
There is size limit of the cassette oligonucleotides being used to encode the message. If it increases a certain limit there is a possibility of it to appear by chance in host genome

Rewritable and random access based DNA storage systemRandom access to data blocks of DNA which promotes nonlinear access
Rewriting capability of information into random locations
Cross hybridization problems that are eliminated in this method by prohibiting redundancy of information
Being used to store frequently updated data which needs to memorize the editing history
High cost

Next generation digital information storageEmployment of one-bit representation per base
High scalability
High data storage density
Highly reliable
Each copy having the capability to correct the errors in the other copy as the errors are almost never coextensive
Cost is unfeasible
Time for reading and writing onto DNA is high

Encoding scheme for small text filesHigh volume data storage density
Not needing ample context information for encoding purposes
Maximum efficiency of compression
Reducing cost factor
Have not proceeded in implementing the biological protocols to insert the sequence in genome of bacteria