Long-Read DNA Sequencing: Recent Advances and Remaining Challenges

Annu Rev Genomics Hum Genet. 2023 Aug 25:24:109-132. doi: 10.1146/annurev-genom-101722-103045. Epub 2023 Apr 19.

Abstract

DNA sequencing has revolutionized medicine over recent decades. However, analysis of large structural variation and repetitive DNA, a hallmark of human genomes, has been limited by short-read technology, with read lengths of 100-300 bp. Long-read sequencing (LRS) permits routine sequencing of human DNA fragments tens to hundreds of kilobase pairs in size, using both real-time sequencing by synthesis and nanopore-based direct electronic sequencing. LRS permits analysis of large structural variation and haplotypic phasing in human genomes and has enabled the discovery and characterization of rare pathogenic structural variants and repeat expansions. It has also recently enabled the assembly of a complete, gapless human genome that includes previously intractable regions, such as highly repetitive centromeres and homologous acrocentric short arms. With the addition of protocols for targeted enrichment, direct epigenetic DNA modification detection, and long-range chromatin profiling, LRS promises to launch a new era of understanding of genetic diversity and pathogenic mutations in human populations.

Keywords: epigenetic modifications; long-read sequencing; pathogenic mutations; repetitive DNA; structural variants.

Publication types

  • Review

MeSH terms

  • Base Sequence
  • DNA* / genetics
  • Humans
  • Mutation
  • Repetitive Sequences, Nucleic Acid*
  • Sequence Analysis, DNA / methods

Substances

  • DNA