Gene function classification using NCI-60 cell line gene expression profiles

Comput Biol Chem. 2005 Dec;29(6):412-9. doi: 10.1016/j.compbiolchem.2005.09.003. Epub 2005 Nov 9.

Abstract

Gene expression patterns from NCI's panel of 60 cell lines were used to train a Neural Network model for classifying genes to pathways. The model assigns probabilities to each gene for each of the 21 modeled pathways assigned by the Kyoto Encyclopedia of Genes and Genomes. Cross-validation of the model showed that 10 of the 21 pathways exhibited good performance in statistical significance and accuracy. The model was designed to output gene probabilities that could be screened for higher probabilities resulting in higher confidence in classification though yielding fewer genes per pathway. The model was deployed on 5798 genes and our approach allowed us to ascertain the most relevant genes above an estimated background. Eight pathways were identified with both good cross-validation and significant numbers above background, TCA Cycle, Oxidative Phosphorylation, Porphyrin Biosynthesis, Ribosome, Polymerases, Proteasome, Cell Cycle, and Cell Adhesion. Gene Ontology (GO) annotation was used for additional validation of gene classification results. A total of 551 GO annotated genes and 468 unannotated genes were classified to the 8 pathways. The primary and secondary classifications of genes revealed known pathway relationships and provide the potential for discovering new pathway relationships.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Cell Line
  • Gene Expression Profiling*
  • Neural Networks, Computer
  • Oligonucleotide Array Sequence Analysis