What is DNA-based Data Storage?

As technology advances, data storage methods are becoming more diverse to suit the changing world and its needs better. Conventionally, data is stored in numerous storage centers, which are around 100,000 square feet large on average. However, a novel method of storing data in DNA is being developed, bringing a variety of new benefits and possibilities. As the name suggests, DNA-based data storage is created when data is inserted into DNA.

How is DNA-based data storage created?

Before inserting the data into the DNA, the data should be translated into the correct form. Data is coded in binary code, meaning it should be in a sequence consisting of the numbers 0 and 1. For example, an image consisting of the colors black and white can be translated into binary code easily as black can be translated into one, and white can be translated into zero. 

The second step is to convert the binary code into DNA code. Since DNA is composed of four types of nucleotides, which are A(adenine), T(thymine), G(guanine), and C(cytosine), the data has to be in a sequence consisting of these bases in order to be inserted into the DNA. To achieve this, 1s are translated into A and T, and 0s are translated into G and C.

The third step is to insert the data in the form of DNA code into a vector, a DNA molecule used to carry a DNA sequence into a cell. The vector is then inserted into E. coli and are left to grow.

The final step is to extract the vector from the E. coli colony and mix it with TFAM(transcriptional factor a mitochondria), a protein enclosing mitochondrial DNA, to increase the stability of the data.

Why use DNA-based data storage?

DNA-based data storages have many advantages over conventional data storage, making them conclusively beneficial. Here are some of the advantages of using DNA-based data storage and why they are worth being developed:

  • DNA data storages are about a million times denser than the densest data storage available today. This means that DNA data storage can make use of the storage space more efficiently, thus storing more data in the same amount of space. Because DNA data storage is denser, it has the advantage of preserving data using less energy, making it financially advantageous.
  • Tremendous amounts of data can be replicated easily using the PCR method. This decreases the time needed to copy large amounts of data and the money needed in copying.
  • DNA data storages are much more stable as they are less temperature sensitive, and not much energy is needed to maintain the storage. The storages are also able to preserve energy for a more extended period of time compared to conventional data storage

DNA-based data storage is an innovative technology with advantages and disadvantages. However, through continuous innovation and development, this technology will be able to benefit our world positively.

Check out our video to learn more!