Classification of bacterial species from proteomic data using combinatorial approaches incorporating artificial neural networks, cluster analysis and principal components analysis

Bioinformatics. 2005 May 15;21(10):2191-9. doi: 10.1093/bioinformatics/bti368. Epub 2005 Mar 3.

Abstract

Motivation: Robust computer algorithms are required to interpret the vast amounts of proteomic data currently being produced and to generate generalized models which are applicable to 'real world' scenarios. One such scenario is the classification of bacterial species. These vary immensely, some remaining remarkably stable whereas others are extremely labile showing rapid mutation and change. Such variation makes clinical diagnosis difficult and pathogens may be easily misidentified.

Results: We applied artificial neural networks (Neuroshell 2) in parallel with cluster analysis and principal components analysis to surface enhanced laser desorption/ionization (SELDI)-TOF mass spectrometry data with the aim of accurately identifying the bacterium Neisseria meningitidis from species within this genus and other closely related taxa. A subset of ions were identified that allowed for the consistent identification of species, classifying >97% of a separate validation subset of samples into their respective groups.

Availability: Neuroshell 2 is commercially available from Ward Systems.

MeSH terms

  • Algorithms*
  • Bacterial Proteins / analysis*
  • Bacterial Proteins / metabolism
  • Biomarkers / analysis
  • Biomarkers / metabolism
  • Cluster Analysis
  • Gene Expression Profiling / methods*
  • Mass Spectrometry / methods
  • Models, Biological
  • Neisseria meningitidis / classification*
  • Neisseria meningitidis / isolation & purification
  • Neisseria meningitidis / metabolism*
  • Neural Networks, Computer*
  • Pattern Recognition, Automated / methods
  • Principal Component Analysis
  • Proteome / analysis*
  • Proteome / metabolism
  • Species Specificity

Substances

  • Bacterial Proteins
  • Biomarkers
  • Proteome