Transcriptome analysis of the oil-rich tea plant, Camellia oleifera, reveals candidate genes related to lipid metabolism

PLoS One. 2014 Aug 19;9(8):e104150. doi: 10.1371/journal.pone.0104150. eCollection 2014.

Abstract

Background: Rapidly driven by the need for developing sustainable sources of nutritionally important fatty acids and the rising concerns about environmental impacts after using fossil oil, oil-plants have received increasing awareness nowadays. As an important oil-rich plant in China, Camellia oleifera has played a vital role in providing nutritional applications, biofuel productions and chemical feedstocks. However, the lack of C. oleifera genome sequences and little genetic information have largely hampered the urgent needs for efficient utilization of the abundant germplasms towards modern breeding efforts of this woody oil-plant.

Results: Here, using the 454 GS-FLX sequencing platform, we generated approximately 600,000 RNA-Seq reads from four tissues of C. oleifera. These reads were trimmed and assembled into 104,842 non-redundant putative transcripts with a total length of ∼38.9 Mb, representing more than 218-fold of all the C. oleifera sequences currently deposited in the GenBank (as of March 2014). Based on the BLAST similarity searches, nearly 42.6% transcripts could be annotated with known genes, conserved domains, or Gene Ontology (GO) terms. Comparisons with the cultivated tea tree, C. sinensis, identified 3,022 pairs of orthologs, of which 211 exhibited the evidence under positive selection. Pathway analysis detected the majority of genes potentially related to lipid metabolism. Evolutionary analysis of omega-6 fatty acid desaturase (FAD2) genes among 20 oil-plants unexpectedly suggests that a parallel evolution may occur between C. oleifera and Olea oleifera. Additionally, more than 2,300 simple sequence repeats (SSRs) and 20,200 single-nucleotide polymorphisms (SNPs) were detected in the C. oleifera transcriptome.

Conclusions: The generated transcriptome represents a considerable increase in the number of sequences deposited in the public databases, providing an unprecedented opportunity to discover all related-genes associated with lipid metabolic pathway in C. oleifera. It will greatly enhance the generation of new varieties of C. oleifera with increased yields and high quality.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Camellia / classification
  • Camellia / genetics*
  • Camellia / metabolism
  • Chromosome Mapping
  • Conserved Sequence
  • Evolution, Molecular
  • Fatty Acid Desaturases / genetics
  • Fatty Acid Desaturases / metabolism
  • Gene Expression Profiling
  • Genome, Plant*
  • Lipid Metabolism / genetics*
  • Molecular Sequence Annotation
  • Molecular Sequence Data
  • Phylogeny
  • Plant Oils / metabolism
  • Plant Proteins / genetics*
  • Plant Proteins / metabolism
  • Polymorphism, Single Nucleotide
  • Sequence Homology, Nucleic Acid
  • Transcriptome*

Substances

  • Plant Oils
  • Plant Proteins
  • Fatty Acid Desaturases
  • delta-12 fatty acid desaturase

Associated data

  • GENBANK/GBHI01000000
  • SRA/SRR1472842
  • SRA/SRR1472843
  • SRA/SRR1472847
  • SRA/SRR1472854

Grants and funding

This work was supported by National Science Foundation of China (U0936603), Key Project of Natural Science Foundation of Yunnan Province (2008CC016), Top Talent Program of Yunnan Province (20080A009), Frontier Grant of Kunming Institute of Botany, CAS (672705232515), the Hundred Talents Program of the Chinese Academy of Sciences (CAS), a grant from the Chinese Academy of Science (KSCX2-YW-N-029), and a start-up grant from Kunming Institute of Botany, CAS to L.-Z. Gao. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.