The Genome of Cucurbita argyrosperma (Silver-Seed Gourd) Reveals Faster Rates of Protein-Coding Gene and Long Noncoding RNA Turnover and Neofunctionalization within Cucurbita

Mol Plant. 2019 Apr 1;12(4):506-520. doi: 10.1016/j.molp.2018.12.023. Epub 2019 Jan 8.

Abstract

Whole-genome duplications are an important source of evolutionary novelties that change the mode and tempo at which genetic elements evolve within a genome. The Cucurbita genus experienced a whole-genome duplication around 30 million years ago, although the evolutionary dynamics of the coding and noncoding genes in this genus have not yet been scrutinized. Here, we analyzed the genomes of four Cucurbita species, including a newly assembled genome of Cucurbita argyrosperma, and compared the gene contents of these species with those of five other members of the Cucurbitaceae family to assess the evolutionary dynamics of protein-coding and long intergenic noncoding RNA (lincRNA) genes after the genome duplication. We report that Cucurbita genomes have a higher protein-coding gene birth-death rate compared with the genomes of the other members of the Cucurbitaceae family. C. argyrosperma gene families associated with pollination and transmembrane transport had significantly faster evolutionary rates. lincRNA families showed high levels of gene turnover throughout the phylogeny, and 67.7% of the lincRNA families in Cucurbita showed evidence of birth from the neofunctionalization of previously existing protein-coding genes. Collectively, our results suggest that the whole-genome duplication in Cucurbita resulted in faster rates of gene family evolution through the neofunctionalization of duplicated genes.

Keywords: Cucurbita argyrosperma; comparative genomics; long noncoding RNA; molecular evolution; neofunctionalization; whole-genome duplication.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cucurbita / genetics*
  • Evolution, Molecular
  • Genes, Plant / genetics*
  • Kinetics
  • Phylogeny
  • Plant Proteins / genetics*
  • RNA, Long Noncoding / genetics*

Substances

  • Plant Proteins
  • RNA, Long Noncoding