Ariadne: synthetic long read deconvolution using assembly graphs

Genome Biol. 2023 Aug 28;24(1):197. doi: 10.1186/s13059-023-03033-5.

Abstract

Synthetic long read sequencing techniques such as UST's TELL-Seq and Loop Genomics' LoopSeq combine 3[Formula: see text] barcoding with standard short-read sequencing to expand the range of linkage resolution from hundreds to tens of thousands of base-pairs. However, the lack of a 1:1 correspondence between a long fragment and a 3[Formula: see text] unique molecular identifier confounds the assignment of linkage between short reads. We introduce Ariadne, a novel assembly graph-based synthetic long read deconvolution algorithm, that can be used to extract single-species read-clouds from synthetic long read datasets to improve the taxonomic classification and de novo assembly of complex populations, such as metagenomes.

Keywords: Assembly graphs; Barcode deconvolution; Metagenomics; Synthetic long read.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Genomics
  • Metagenome
  • Pentaerythritol Tetranitrate*

Substances

  • Pentaerythritol Tetranitrate