Improvements in the sequencing and assembly of plant genomes

GigaByte. 2021 Jun 10:2021:gigabyte24. doi: 10.46471/gigabyte.24. eCollection 2021.

Abstract

Advances in DNA sequencing have made it easier to sequence and assemble plant genomes. Here, we extend an earlier study, and compare recent methods for long read sequencing and assembly. Updated Oxford Nanopore Technology software improved assemblies. Using more accurate sequences produced by repeated sequencing of the same molecule (Pacific Biosciences HiFi) resulted in less fragmented assembly of sequencing reads. Using data for increased genome coverage resulted in longer contigs, but reduced total assembly length and improved genome completeness. The original model species, Macadamia jansenii, was also compared with three other Macadamia species, as well as avocado (Persea americana) and jojoba (Simmondsia chinensis). In these angiosperms, increasing sequence data volumes caused a linear increase in contig size, decreased assembly length and further improved already high completeness. Differences in genome size and sequence complexity influenced the success of assembly. Advances in long read sequencing technology continue to improve plant genome sequencing and assembly. However, results were improved by greater genome coverage, with the amount needed to achieve a particular level of assembly being species dependent.

Publication types

  • Comment

Grants and funding

This research received funding from the Hort Frontiers Advanced Production Systems Fund, Hort Frontiers Strategic Partnership, Hort Innovation, with University of Queensland and the Australian Government as part of National Tree Genomics Program, AS17000 Genomics Resources Toolbox, to R Henry; and from King Faisal University, Jojoba Genomics Project to RJ Henry, A Furtado and A Kharabian Masouleh.