De novo assembly of maritime pine transcriptome: implications for forest breeding and biotechnology

Plant Biotechnol J. 2014 Apr;12(3):286-99. doi: 10.1111/pbi.12136. Epub 2013 Nov 21.

Abstract

Maritime pine (Pinus pinasterAit.) is a widely distributed conifer species in Southwestern Europe and one of the most advanced models for conifer research. In the current work, comprehensive characterization of the maritime pine transcriptome was performed using a combination of two different next-generation sequencing platforms, 454 and Illumina. De novo assembly of the transcriptome provided a catalogue of 26 020 unique transcripts in maritime pine trees and a collection of 9641 full-length cDNAs. Quality of the transcriptome assembly was validated by RT-PCR amplification of selected transcripts for structural and regulatory genes. Transcription factors and enzyme-encoding transcripts were annotated. Furthermore, the available sequencing data permitted the identification of polymorphisms and the establishment of robust single nucleotide polymorphism (SNP) and simple-sequence repeat (SSR) databases for genotyping applications and integration of translational genomics in maritime pine breeding programmes. All our data are freely available at SustainpineDB, the P. pinaster expressional database. Results reported here on the maritime pine transcriptome represent a valuable resource for future basic and applied studies on this ecological and economically important pine species.

Keywords: conifers; full-length cDNA; next-generation sequencing; single nucleotide polymorphism; transcription factors; transcriptome sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biotechnology*
  • Breeding
  • DNA, Complementary / genetics
  • Databases, Genetic
  • Genome Size
  • Genome, Plant / genetics*
  • Genotype
  • High-Throughput Nucleotide Sequencing / methods*
  • Microsatellite Repeats / genetics
  • Molecular Sequence Annotation
  • Multigene Family
  • Pinus / genetics*
  • Polymorphism, Single Nucleotide*
  • RNA, Plant / genetics
  • Sequence Analysis, DNA
  • Transcription Factors / genetics
  • Transcriptome*
  • Trees

Substances

  • DNA, Complementary
  • RNA, Plant
  • Transcription Factors