Construction and evaluation of normalized cDNA libraries enriched with full-length sequences for rapid discovery of new genes from Sisal (Agave sisalana Perr.) different developmental stages

Int J Mol Sci. 2012 Oct 12;13(10):13150-68. doi: 10.3390/ijms131013150.

Abstract

To provide a resource of sisal-specific expressed sequence data and facilitate this powerful approach in new gene research, the preparation of normalized cDNA libraries enriched with full-length sequences is necessary. Four libraries were produced with RNA pooled from Agave sisalana multiple tissues to increase efficiency of normalization and maximize the number of independent genes by SMART™ method and the duplex-specific nuclease (DSN). This procedure kept the proportion of full-length cDNAs in the subtracted/normalized libraries and dramatically enhanced the discovery of new genes. Sequencing of 3875 cDNA clones of libraries revealed 3320 unigenes with an average insert length about 1.2 kb, indicating that the non-redundancy of libraries was about 85.7%. These unigene functions were predicted by comparing their sequences to functional domain databases and extensively annotated with Gene Ontology (GO) terms. Comparative analysis of sisal unigenes and other plant genomes revealed that four putative MADS-box genes and knotted-like homeobox (knox) gene were obtained from a total of 1162 full-length transcripts. Furthermore, real-time PCR showed that the characteristics of their transcripts mainly depended on the tight expression regulation of a number of genes during the leaf and flower development. Analysis of individual library sequence data indicated that the pooled-tissue approach was highly effective in discovering new genes and preparing libraries for efficient deep sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Agave / genetics*
  • Agave / growth & development
  • Amino Acid Sequence
  • Cloning, Molecular
  • Computational Biology
  • Gene Expression Regulation, Developmental
  • Gene Library
  • Genes, Plant*
  • Molecular Sequence Data
  • Phylogeny
  • Plant Proteins / chemistry
  • Plant Proteins / classification
  • Sequence Alignment
  • Sequence Analysis, DNA

Substances

  • Plant Proteins