Sequencing, de novo assembly, annotation and SSR and SNP detection of sabaigrass (Eulaliopsis binata) transcriptome

Genomics. 2013 Jul;102(1):57-62. doi: 10.1016/j.ygeno.2013.02.014. Epub 2013 Apr 15.

Abstract

Eulaliopsis binata is one of the best fiber grass plants for its high fiber quality and production. Large scale trancriptome sequencing of E. binata was first performed using mixed leaf samples of 20 wild clusters. A total of 26,438,832 clean reads were generated and were assembled into 59,134 isogenes with an average length of 845bp. BLAST against the NCBI non-redundant protein, KEGG and GO databases has classified these isogenes into functional categories for understanding gene functions and regulation pathways. Only 15.0% of the assembled isogenes were similar to known proteins and 24.4% has no hits in the nr protein database. The total isogenes and 5306 highly expressed isogenes were performed by BLASTx with the MAIZEWALL, the cell wall navigator and the PlantTFDB databases. A total of 6681 simple sequence repeats (SSRs) and 147,177 single nucleotide polymorphisms (SNPs) were detected in the isogenes and 5723 pairs of SSR primers were designed.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Protein
  • Gene Expression Profiling*
  • Microsatellite Repeats / genetics*
  • Molecular Sequence Annotation*
  • Plant Proteins
  • Poaceae / genetics*
  • Polymorphism, Single Nucleotide
  • Sequence Analysis, DNA

Substances

  • Plant Proteins