Content-based filter queries on DNA data storage systems

Sci Rep. 2023 Apr 29;13(1):7053. doi: 10.1038/s41598-023-34160-5.

Abstract

Recent developments in DNA data storage systems have revealed the great potential to store large amounts of data at a very high density with extremely long persistence and low cost. However, despite recent contributions to robust data encoding, current DNA storage systems offer limited support for random access on DNA storage devices due to restrictive biochemical constraints. Moreover, state-of-the-art approaches do not support content-based filter queries on DNA storage. This paper introduces the first encoding for DNA that enables content-based searches on structured data like relational database tables. We provide the details of the methods for coding and decoding millions of directly accessible data objects on DNA. We evaluate the derived codes on real data sets and verify their robustness.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA* / genetics
  • Databases, Factual
  • Information Storage and Retrieval*

Substances

  • DNA