Yerba mate (Ilex paraguariensis, A. St.-Hil.) de novo transcriptome assembly based on tissue specific genomic expression profiles

BMC Genomics. 2018 Dec 7;19(1):891. doi: 10.1186/s12864-018-5240-6.

Abstract

Background: The most common infusion in southern Latin-American countries is prepared with dried leaves of Ilex paraguariensis A. St.-Hil., an aboriginal ancestral beverage known for its high polyphenols concentration currently consumed in > 90% of homes in Argentina, in Paraguay and Uruguay. The economy of entire provinces heavily relies on the production, collection and manufacture of Ilex paraguariensis, the fifth plant species with highest antioxidant activity. Polyphenols are associated to relevant health benefits including strong antioxidant properties. Despite its regional relevance and potential biotechnological applications, little is known about functional genomics and genetics underlying phenotypic variation of relevant traits. By generating tissue specific transcriptomic profiles, we aimed to comprehensively annotate genes in the Ilex paraguariensis phenylpropanoid pathway and to evaluate differential expression profiles.

Results: In this study we generated a reliable transcriptome assembly based on a collection of 15 RNA-Seq libraries from different tissues of Ilex paraguariensis. A total of 554 million RNA-Seq reads were assembled into 193,897 transcripts, where 24,612 annotated full-length transcripts had complete ORF. We assessed the transcriptome assembly quality, completeness and accuracy using BUSCO and TransRate; consistency was also evaluated by experimentally validating 11 predicted genes by PCR and sequencing. Functional annotation against KEGG Pathway database identified 1395 unigenes involved in biosynthesis of secondary metabolites, 531 annotated transcripts corresponded to the phenylpropanoid pathway. The top 30 differentially expressed genes among tissue revealed genes involved in photosynthesis and stress response. These significant differences were then validated by qRT-PCR.

Conclusions: Our study is the first to provide data from whole genome gene expression profiles in different Ilex paraguariensis tissues, experimentally validating in-silico predicted genes key to the phenylpropanoid (antioxidant) pathway. Our results provide essential genomic data of potential use in breeding programs for polyphenol content. Further studies are necessary to assess if the observed expression variation in the phenylpropanoid pathway annotated genes is related to variations in leaves' polyphenol content at the population scale. These results set the current reference for Ilex paraguariensis genomic studies and provide a substantial contribution to research and biotechnological applications of phenylpropanoid secondary metabolites.

Keywords: Expression profile; Functional annotation; Ilex paraguariensis; Phenylpropanoid; Transcriptome; Yerba mate; de-novo assembly.

MeSH terms

  • Gene Expression Regulation, Plant
  • Gene Ontology
  • Genes, Plant
  • Genome, Plant*
  • Ilex paraguariensis / genetics*
  • Molecular Sequence Annotation
  • Organ Specificity / genetics*
  • Plant Leaves / genetics
  • Plant Roots / genetics
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Reproducibility of Results
  • Secondary Metabolism / genetics
  • Sequence Analysis, RNA / methods*
  • Transcriptome / genetics*

Substances

  • RNA, Messenger