Constrained de novo sequencing of conotoxins

J Proteome Res. 2012 Aug 3;11(8):4191-200. doi: 10.1021/pr300312h. Epub 2012 Jul 2.

Abstract

De novo peptide sequencing by mass spectrometry (MS) can determine the amino acid sequence of an unknown peptide without reference to a protein database. MS-based de novo sequencing assumes special importance in focused studies of families of biologically active peptides and proteins, such as hormones, toxins, and antibodies, for which amino acid sequences may be difficult to obtain through genomic methods. These protein families often exhibit sequence homology or characteristic amino acid content; yet, current de novo sequencing approaches do not take advantage of this prior knowledge and, hence, search an unnecessarily large space of possible sequences. Here, we describe an algorithm for de novo sequencing that incorporates sequence constraints into the core graph algorithm and thereby reduces the search space by many orders of magnitude. We demonstrate our algorithm in a study of cysteine-rich toxins from two cone snail species (Conus textile and Conus stercusmuscarum) and report 13 de novo and about 60 total toxins.

Publication types

  • Research Support, American Recovery and Reinvestment Act
  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Amino Acid Substitution
  • Animals
  • Conotoxins / chemistry*
  • Conotoxins / genetics
  • Conus Snail / chemistry*
  • Conus Snail / genetics
  • Molecular Sequence Data
  • Mutation
  • Sequence Analysis, Protein*
  • Tandem Mass Spectrometry

Substances

  • Conotoxins