Building pan-genome infrastructures for crop plants and their use in association genetics

DNA Res. 2021 Jan 19;28(1):dsaa030. doi: 10.1093/dnares/dsaa030.

Abstract

Pan-genomic studies aim at representing the entire sequence diversity within a species to provide useful resources for evolutionary studies, functional genomics and breeding of cultivated plants. Cost reductions in high-throughput sequencing and advances in sequence assembly algorithms have made it possible to create multiple reference genomes along with a catalogue of all forms of genetic variations in plant species with large and complex or polyploid genomes. In this review, we summarize the current approaches to building pan-genomes as an in silico representation of plant sequence diversity and outline relevant methods for their effective utilization in linking structural with phenotypic variation. We propose as future research avenues (i) transcriptomic and epigenomic studies across multiple reference genomes and (ii) the development of user-friendly and feature-rich pan-genome browsers.

Keywords: association genetics; crop plants; genome sequencing; genomics; pan-genome.

Publication types

  • Review

MeSH terms

  • Computational Biology
  • Epigenomics
  • Gene Expression Profiling
  • Genetic Variation
  • Genome, Plant*
  • Genomics / methods*
  • High-Throughput Nucleotide Sequencing*
  • Plants / genetics*
  • Sequence Analysis, DNA
  • Sequence Analysis, RNA
  • Transcriptome