Whole-Genome Identification and Analysis of Multiple Gene Families Reveal Candidate Genes for Theasaponin Biosynthesis in Camellia oleifera

Int J Mol Sci. 2022 Jun 7;23(12):6393. doi: 10.3390/ijms23126393.

Abstract

Camellia oleifera is an economically important oilseed tree. Seed meals of C. oleifera have a long history of use as biocontrol agents in shrimp farming and as cleaning agents in peoples' daily lives due to the presence of theasaponins, the triterpene saponins from the genus Camellia. To characterize the biosynthetic pathway of theasaponins in C. oleifera, members of gene families involved in triterpenoid biosynthetic pathways were identified and subjected to phylogenetic analysis with corresponding members in Arabidopsis thaliana, Camellia sinensis, Actinidia chinensis, Panax ginseng, and Medicago truncatula. In total, 143 triterpenoid backbone biosynthetic genes, 1169 CYP450s, and 1019 UGTs were identified in C. oleifera. The expression profiles of triterpenoid backbone biosynthetic genes were analyzed in different tissue and seed developmental stages of C. oleifera. The results suggested that MVA is the main pathway for triterpenoid backbone biosynthesis. Moreover, the candidate genes for theasaponin biosynthesis were identified by WGCNA and qRT-PCR analysis; these included 11 CYP450s, 14 UGTs, and eight transcription factors. Our results provide valuable information for further research investigating the biosynthetic and regulatory network of theasaponins.

Keywords: Camellia oleifera; biosynthesis; regulation; theasaponin; triterpenoid saponin.

MeSH terms

  • Camellia* / genetics
  • Camellia* / metabolism
  • Phylogeny
  • Saponins* / metabolism
  • Seeds
  • Triterpenes* / metabolism

Substances

  • Saponins
  • Triterpenes