Skip to content

Latest commit

 

History

History
35 lines (24 loc) · 2.91 KB

Section4.md

File metadata and controls

35 lines (24 loc) · 2.91 KB

DNA-based data storage

DNA molecules, as the carriers of genetic information for billions of years in living organisms, are now also considered as a potential storage medium for digital information in the rapidly developing field of big data. Compared to conventional storage media such as optical discs, hard drives, and flash drives, DNA molecules have a series of advantages including high information storage density, low maintenance costs, long-term stability, and data security. Typically, the process of DNA storage involves 5-6 steps, with research on encoding and decoding methods leaning more towards information technology, while the synthesis, sequencing, and manipulation of DNA require more biological technology research. We have made significant progress in almost every step of the DNA storage process. In terms of encoding, we are dedicated to developing efficient and stable encoding algorithms. For decoding, we have developed graph-based data recovery and error correction algorithms to address the inevitable experimental errors in DNA storage. Regarding DNA information writing, we have developed an information writing method based on a standard reusable library, inspired by the principles of movable type printing. For DNA information reading, we are researching real-time base prediction and error correction methods to enable instant information retrieval.

overview of DNA-based data storage

  • [2019.06.20] Carbon-based archiving: current progress and future prospects of DNA-based data storage. GigaScience [review]

  • [2021.02.03] Chamaeleo: an integrated evaluation platform for DNA storage. Synthetic Biology Journal [software]

  • [2022.03.18] A new era of mass data storage in artificial chromosome. Science China Life Sciences [hightlight]

  • [2022.04.25] Towards practical and robust DNA-based data archiving using the yin–yang codec system. Nature Computational Science [article]

  • [2023.02.03] Mobile and self‐sustained data storage in an extremophile genomic DNA. Advanced Science [article]

  • [2023.03.30] SPIDER-WEB generates coding algorithms with superior error tolerance and real-time information retrieval capacity. ArXiv [preprint]

  • [2024.03.28] DNA Bloom Filter enables anti-contamination and file version control for DNA-based data storage. Briefings in Bioinformatics [article]