Comparing assembly strategies for third-generation sequencing technologies across different genomes

Genomics. 2023 Sep;115(5):110700. doi: 10.1016/j.ygeno.2023.110700. Epub 2023 Aug 18.

Abstract

The recent advent of long-read sequencing technologies, such as Pacific Biosciences (PacBio) and Oxford Nanopore technology (ONT), has led to substantial accuracy and computational cost improvements. However, de novo whole-genome assembly still presents significant challenges related to the computational cost and the quality of the results. Accordingly, sequencing accuracy and throughput continue to improve, and many tools are constantly emerging. Therefore, selecting the correct sequencing platform, the proper sequencing depth and the assembly tools are necessary to perform high-quality assembly. This paper evaluates the primary assembly reconstruction from recent hybrid and non-hybrid pipelines on different genomes. We find that using PacBio high-fidelity long-read (HiFi) plays an essential role in haplotype construction with respect to ONT reads. However, we observe a substantial improvement in the correctness of the assembly from high-fidelity ONT datasets and combining it with HiFi or short-reads.

Keywords: Genome assembly; HiFi; Hybrid assembly; ONT; PacBio.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Genome*
  • High-Throughput Nucleotide Sequencing* / methods
  • Sequence Analysis, DNA / methods