De Novo Transcriptomes of Forsythia koreana Using a Novel Assembly Method: Insight into Tissue- and Species-Specific Expression of Lignan Biosynthesis-Related Gene

PLoS One. 2016 Oct 21;11(10):e0164805. doi: 10.1371/journal.pone.0164805. eCollection 2016.

Abstract

Forsythia spp. are perennial woody plants which are one of the most extensively used medicinal sources of Chinese medicines and functional diets owing to their lignan contents. Lignans have received widespread attention as leading compounds in the development of antitumor drugs and healthy diets for reducing the risks of lifestyle-related diseases. However, the molecular basis of Forsythia has yet to be established. In this study, we have verified de novo deep transcriptome of Forsythia koreana leaf and callus using the Illumina HiSeq 1500 platform. A total of 89 million reads were assembled into 116,824 contigs using Trinity, and 1,576 of the contigs displayed the sequence similarity to the enzymes responsible for plant specialized metabolism including lignan biosynthesis. Notably, gene ontology (GO) analysis indicated the remarkable enrichment of lignan-biosynthetic enzyme genes in the callus transcriptome. Nevertheless, precise annotation and molecular phylogenetic analyses were hindered by partial sequences of open reading frames (ORFs) of the Trinity-based contigs. To obtain more numerous contigs harboring a full-length ORF, we developed a novel overlapping layout consensus-based procedure, virtual primer-based sequence reassembly (VP-seq). VP-seq elucidated 709 full-length ORFs, whereas only 146 full-length ORFs were assembled by Trinity. The comparison of expression profiles of leaf and callus using VP-seq-based full-length ORFs revealed 50-fold upregulation of secoisolariciresinol dehydrogenase (SIRD) in callus. Expression and phylogenetic cluster analyses predicted candidates for matairesinol-glucosylating enzymes. We also performed VP-seq analysis of lignan-biosynthetic enzyme genes in the transcriptome data of other lignan-rich plants, Linum flavum, Linum usitatissimum and Podophyllum hexandrum. The comparative analysis indicated both common gene clusters involved in biosynthesis upstream of matairesinol such as SIRD and plant lineage-specific gene clusters, in particular, genes responsible for biosynthetic pathways for production of podophyllotoxin; CYP71BE54, a key enzyme gene for podophyllotoxin biosynthesis in P. hexandrum, was not found in L. flavum, although both P. hexandrum. and L. flavum yield podophyllotoxin. Altogether, these data have established the fruitful molecular basis of Forsythia and provided insight into the molecular evolution and diversity of lignan biosynthetic pathways.

MeSH terms

  • Amino Acid Sequence
  • Forsythia / classification
  • Forsythia / genetics*
  • Genes, Plant
  • Lignans / biosynthesis*
  • Open Reading Frames
  • Sequence Analysis, RNA
  • Sequence Homology, Amino Acid
  • Transcriptome*

Substances

  • Lignans

Grants and funding

This work was funded by the Plant Factory project of the Ministry of Economy, Trade and Industry, Japan. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.