Aligning the proteome and genome of the silkworm, Bombyx mori

Funct Integr Genomics. 2009 Nov;9(4):447-54. doi: 10.1007/s10142-009-0127-x. Epub 2009 Jun 16.

Abstract

A technology of mass spectrometry (MS) was used in this study for the large-scale proteomic identification and verification of protein-encoding genes present in the silkworm (Bombyx mori) genome. Peptide sequences identified by MS were compared with those from an open reading frame (ORF) library of the B. mori genome and a cDNA library, to validate the coding attributes of ORFs. Two databases were created. The first was based on a 9x draft sequence of the silkworm genome and contained 14,632 putative proteins. The second was based on a B. mori pupal cDNA library containing 3,187 putative proteins of at least 30 amino acid residues in length. A total of 81,000 peptide sequences with a threshold score of 60% were generated by the MS/MS analysis, and 55,400 of these were chosen for a sequence alignment. By searching these two databases, 6,649 and 250 proteins were matched, which accounted for approximately 45.4% and 7.8% of the peptide sequences and putative proteins, respectively. Further analyses carried out by several bioinformatic tools suggested that the matches included proteins with predicted transmembrane domains (1,393) and preproteins with a signal peptide (976). These results provide a fundamental understanding of the expression and function of silkworm proteins.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • Bombyx* / chemistry
  • Bombyx* / genetics
  • Databases, Genetic
  • Genome*
  • Insect Proteins / analysis
  • Insect Proteins / genetics
  • Mass Spectrometry / methods
  • Molecular Sequence Data
  • Open Reading Frames
  • Proteome*

Substances

  • Insect Proteins
  • Proteome