Plant evolution and environmental adaptation unveiled by long-read whole-genome sequencing of Spirodela

Proc Natl Acad Sci U S A. 2019 Sep 17;116(38):18893-18899. doi: 10.1073/pnas.1910401116. Epub 2019 Sep 4.

Abstract

Aquatic plants have to adapt to the environments distinct from where land plants grow. A critical aspect of adaptation is the dynamics of sequence repeats, not resolved in older sequencing platforms due to incomplete and fragmented genome assemblies from short reads. Therefore, we used PacBio long-read sequencing of the Spirodela polyrhiza genome, reaching a 44-fold increase of contiguity with an N50 (a median of contig lengths) of 831 kb and filling 95.4% of gaps left from the previous version. Reconstruction of repeat regions indicates that sequentially nested long terminal repeat (LTR) retrotranspositions occur early in monocot evolution, featured with both prokaryote-like gene-rich regions and eukaryotic repeat islands. Protein-coding genes are reduced to 18,708 gene models supported by 492,435 high-quality full-length PacBio complementary DNA (cDNA) sequences. Different from land plants, the primitive architecture of Spirodela's adventitious roots and lack of lateral roots and root hairs are consistent with dispensable functions of nutrient absorption. Disease-resistant genes encoding antimicrobial peptides and dirigent proteins are expanded by tandem duplications. Remarkably, disease-resistant genes are not only amplified, but also highly expressed, consistent with low levels of 24-nucleotide (nt) small interfering RNA (siRNA) that silence the immune system of land plants, thereby protecting Spirodela against a wide spectrum of pathogens and pests. The long-read sequence information not only sheds light on plant evolution and adaptation to the environment, but also facilitates applications in bioenergy and phytoremediation.

Keywords: aquatic adaptation; disease resistance; long reads; root evolution; tandem duplication.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adaptation, Physiological / genetics*
  • Aquatic Organisms / genetics
  • Aquatic Organisms / physiology
  • Araceae / anatomy & histology
  • Araceae / genetics*
  • Araceae / physiology
  • DNA, Plant / genetics
  • Disease Resistance / genetics
  • Evolution, Molecular
  • Gene Expression Profiling
  • Genome, Plant / genetics*
  • Plant Proteins / genetics
  • Plant Roots / anatomy & histology
  • Plant Roots / genetics
  • Plant Roots / physiology
  • Sequence Analysis, DNA
  • Tandem Repeat Sequences

Substances

  • DNA, Plant
  • Plant Proteins

Associated data

  • GENBANK/SWLF00000000
  • GENBANK/SRX5321175