Analysis of transcriptome data in the red flour beetle, Tribolium castaneum

Insect Biochem Mol Biol. 2008 Apr;38(4):380-6. doi: 10.1016/j.ibmb.2007.09.008. Epub 2007 Sep 29.

Abstract

The whole genome sequence of Tribolium castaneum, a worldwide coleopteran pest of stored products, has recently been determined. In order to facilitate accurate annotation and detailed functional analysis of this genome, we have compiled and analyzed all available expressed sequence tag (EST) data. The raw data consist of 61,228 ESTs, including 10,704 obtained from NCBI and an additional 50,524 derived from 32,544 clones generated in our laboratories. These sequences were amassed from cDNA libraries representing six different tissues or stages, namely: whole embryos, whole larvae, larval hindguts and Malpighian tubules, larval fat bodies and carcasses, adult ovaries, and adult heads. Assembly of the 61,228 sequences collapsed into 12,269 clusters (groups of overlapping ESTs representing single genes), of which 10,134 mapped onto 6,463 (39%) of the 16,422 GLEAN gene models (i.e. official Tribolium gene list). Approximately 1,600 clusters (13% of the total) lack corresponding GLEAN models, despite high matches to the genome, suggesting that a considerable number of transcribed sequences were missed by the gene prediction programs or were removed by GLEAN. We conservatively estimate that the current EST set represents more than 7,500 transcription units.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Databases, Nucleic Acid
  • Expressed Sequence Tags*
  • Gene Expression Profiling*
  • Gene Library
  • Tribolium / genetics*
  • Tribolium / metabolism