Plant Pan-Genomics Comes of Age

Annu Rev Plant Biol. 2021 Jun 17:72:411-435. doi: 10.1146/annurev-arplant-080720-105454. Epub 2021 Apr 13.

Abstract

A pan-genome is the nonredundant collection of genes and/or DNA sequences in a species. Numerous studies have shown that plant pan-genomes are typically much larger than the genome of any individual and that a sizable fraction of the genes in any individual are present in only some genomes. The construction and interpretation of plant pan-genomes are challenging due to the large size and repetitive content of plant genomes. Most pan-genomes are largely focused on nontransposable element protein coding genes because they are more easily analyzed and defined than noncoding and repetitive sequences. Nevertheless, noncoding and repetitive DNA play important roles in determining the phenotype and genome evolution. Fortunately, it is now feasible to make multiple high-quality genomes that can be used to construct high-resolution pan-genomes that capture all the variation. However, assembling, displaying, and interacting with such high-resolution pan-genomes will require the development of new tools.

Keywords: natural variation; pan-genome; population genetics; sequence graph; structural variation.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Genome, Plant*
  • Genomics*