Recent Advances in Assembly of Complex Plant Genomes

Genomics Proteomics Bioinformatics. 2023 Jun;21(3):427-439. doi: 10.1016/j.gpb.2023.04.004. Epub 2023 Apr 25.

Abstract

Over the past 20 years, tremendous advances in sequencing technologies and computational algorithms have spurred plant genomic research into a thriving era with hundreds of genomes decoded already, ranging from those of nonvascular plants to those of flowering plants. However, complex plant genome assembly is still challenging and remains difficult to fully resolve with conventional sequencing and assembly methods due to high heterozygosity, highly repetitive sequences, or high ploidy characteristics of complex genomes. Herein, we summarize the challenges of and advances in complex plant genome assembly, including feasible experimental strategies, upgrades to sequencing technology, existing assembly methods, and different phasing algorithms. Moreover, we list actual cases of complex genome projects for readers to refer to and draw upon to solve future problems related to complex genomes. Finally, we expect that the accurate, gapless, telomere-to-telomere, and fully phased assembly of complex plant genomes could soon become routine.

Keywords: Assembly algorithm; Complex plant genome; Haplotype-resolved assembly; Sequencing technology; Telomere-to-telomere genome.

Publication types

  • Review

MeSH terms

  • Algorithms
  • Genome, Plant*
  • Genomics*
  • High-Throughput Nucleotide Sequencing
  • Plants / genetics
  • Sequence Analysis, DNA