De novo assembly and characterisation of the field pea transcriptome using RNA-Seq

BMC Genomics. 2015 Aug 16;16(1):611. doi: 10.1186/s12864-015-1815-7.

Abstract

Background: Field pea (Pisum sativum L.) is a cool-season grain legume that is cultivated world-wide for both human consumption and stock-feed purposes. Enhancement of genetic and genomic resources for field pea will permit improved understanding of the control of traits relevant to crop productivity and quality. Advances in second-generation sequencing and associated bioinformatics analysis now provide unprecedented opportunities for the development of such resources. The objective of this study was to perform transcriptome sequencing and characterisation from two genotypes of field pea that differ in terms of seed and plant morphological characteristics.

Results: Transcriptome sequencing was performed with RNA templates from multiple tissues of the field pea genotypes Kaspa and Parafield. Tissue samples were collected at various growth stages, and a total of 23 cDNA libraries were sequenced using Illumina high-throughput sequencing platforms. A total of 407 and 352 million paired-end reads from the Kaspa and Parafield transcriptomes, respectively were assembled into 129,282 and 149,272 contigs, which were filtered on the basis of known gene annotations, presence of open reading frames (ORFs), reciprocal matches and degree of coverage. Totals of 126,335 contigs from Kaspa and 145,730 from Parafield were subsequently selected as the reference set. Reciprocal sequence analysis revealed that c. 87% of contigs were expressed in both cultivars, while a small proportion were unique to each genotype. Reads from different libraries were aligned to the genotype-specific assemblies in order to identify and characterise expression of contigs on a tissue-specific basis, of which 87% were expressed in more than one tissue, while others showed distinct expression patterns in specific tissues, providing unique transcriptome signatures.

Conclusion: This study provided a comprehensive assembled and annotated transcriptome set for field pea that can be used for development of genetic markers, in order to assess genetic diversity, construct linkage maps, perform trait-dissection and implement whole-genome selection strategies in varietal improvement programs, as well to identify target genes for genetic modification approaches on the basis of annotation and expression analysis. In addition, the reference field pea transcriptome will prove highly valuable for comparative genomics studies and construction of a finalised genome sequence.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Nucleic Acid
  • Gene Expression Profiling / methods*
  • Genotype
  • Molecular Sequence Data
  • Organ Specificity
  • Pisum sativum / genetics*
  • Pisum sativum / physiology
  • RNA, Plant / analysis*
  • Sequence Analysis, RNA / methods*

Substances

  • RNA, Plant

Associated data

  • BioProject/PRJNA277074
  • BioProject/PRJNA277076
  • GENBANK/GCKA00000000
  • GENBANK/GCMF00000000
  • GENBANK/GCMG00000000
  • GENBANK/GCMH00000000
  • GENBANK/GCMI00000000
  • GENBANK/GCMJ00000000
  • GENBANK/GCMK00000000
  • GENBANK/GCML00000000
  • GENBANK/GCMM00000000
  • GENBANK/GCMN00000000
  • GENBANK/GCMO00000000
  • GENBANK/GCMP00000000
  • GENBANK/GCMQ00000000
  • SRA/SRR1910794
  • SRA/SRR1910804
  • SRA/SRR1910805
  • SRA/SRR1910806
  • SRA/SRR1910807
  • SRA/SRR1910808
  • SRA/SRR1910809
  • SRA/SRR1910810
  • SRA/SRR1910811
  • SRA/SRR1910812
  • SRA/SRR1910813
  • SRA/SRR1910814
  • SRA/SRR1910815
  • SRA/SRR1910816
  • SRA/SRR1910817
  • SRA/SRR1910818
  • SRA/SRR1910819
  • SRA/SRR1910820
  • SRA/SRR1910821
  • SRA/SRR1910822
  • SRA/SRR1910823
  • SRA/SRR1910824
  • SRA/SRR1910825
  • SRA/SRR1910826
  • SRA/SRR1913075
  • SRA/SRR1913256
  • SRA/SRR1913731