NextDenovo: an efficient error correction and accurate assembly tool for noisy long reads

Genome Biol. 2024 Apr 26;25(1):107. doi: 10.1186/s13059-024-03252-4.

Abstract

Long-read sequencing data, particularly those derived from the Oxford Nanopore sequencing platform, tend to exhibit high error rates. Here, we present NextDenovo, an efficient error correction and assembly tool for noisy long reads, which achieves a high level of accuracy in genome assembly. We apply NextDenovo to assemble 35 diverse human genomes from around the world using Nanopore long-read data. These genomes allow us to identify the landscape of segmental duplication and gene copy number variation in modern human populations. The use of NextDenovo should pave the way for population-scale long-read assembly using Nanopore long-read data.

Keywords: Error-correction; Genome assembly; Human genomes; Long reads; Segmental duplication.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA Copy Number Variations*
  • Genome, Human*
  • Genomics / methods
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Nanopore Sequencing / methods
  • Sequence Analysis, DNA / methods
  • Software