Scaling logical density of DNA storage with enzymatically-ligated composite motifs

Sci Rep. 2023 Sep 25;13(1):15978. doi: 10.1038/s41598-023-43172-0.

Abstract

DNA is a promising candidate for long-term data storage due to its high density and endurance. The key challenge in DNA storage today is the cost of synthesis. In this work, we propose composite motifs, a framework that uses a mixture of prefabricated motifs as building blocks to reduce synthesis cost by scaling logical density. To write data, we introduce Bridge Oligonucleotide Assembly, an enzymatic ligation technique for synthesizing oligos based on composite motifs. To sequence data, we introduce Direct Oligonucleotide Sequencing, a nanopore-based technique to sequence short oligos, eliminating common preparatory steps like DNA assembly, amplification and end-prep. To decode data, we introduce Motif-Search, a novel consensus caller that provides accurate reconstruction despite synthesis and sequencing errors. Using the proposed methods, we present an end-to-end experiment where we store the text "HelloWorld" at a logical density of 84 bits/cycle (14-42× improvement over state-of-the-art).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Consensus
  • DNA*
  • Nanopores*
  • Nutritional Status
  • Oligonucleotides

Substances

  • DNA
  • Oligonucleotides