Transcriptome sequencing of Pinus kesiya var. langbianensis and comparative analysis in the Pinus phylogeny

BMC Genomics. 2018 Oct 3;19(1):725. doi: 10.1186/s12864-018-5127-6.

Abstract

Background: Pines are widely distributed in the Northern Hemisphere and have a long evolutionary history. The availability of transcriptome data has facilitated comparative transcriptomics for studying the evolutionary patterns associated with the different geographical distributions of species in the Pinus phylogeny.

Results: The transcriptome of Pinus kesiya var. langbianensis was sequenced using the Illumina HiSeq 2000 platform, and a total of 68,881 unigenes were assembled by Trinity. Transcriptome sequences of another 12 conifer species were downloaded from public databases. All of the pairwise orthologues were identified by comparative transcriptome analysis in 13 conifer species, from which the rate of diversification was calculated and a phylogenetic tree inferred. All of the fast-evolving positive selection sequences were identified, and some salt-, drought-, and abscisic acid-resistance genes were discovered.

Conclusions: mRNA sequences of P. kesiya var. langbianensis were obtained by transcriptome sequencing, and a large number of simple sequence repeat and short nucleotide polymorphism loci were detected. These data can be used in molecular marker-assisted selected in pine breeding. Divergence times were estimated in the 13 conifer species using comparative transcriptomic analysis. A number of positive selection genes were found to be related to environmental factors. Salt- and abscisic acid-related genes exhibited different selection patterns between coastal and inland Pinus. Our findings help elucidate speciation patterns in the Pinus lineage.

Keywords: Comparative transcriptomics; Pinus kesiya var. langbianensis; Pinus phylogeny; Transcriptome sequencing.

Publication types

  • Comparative Study

MeSH terms

  • Environment
  • Evolution, Molecular
  • Gene Expression Profiling*
  • Geography
  • Microsatellite Repeats / genetics
  • Phylogeny*
  • Pinus / genetics*
  • Polymorphism, Single Nucleotide
  • Sequence Analysis*
  • Sequence Homology, Nucleic Acid