OPMSP: A Computational Method Integrating Protein Interaction and Sequence Information for the Identification of Novel Putative Oncogenes

Protein Pept Lett. 2016;23(12):1081-1094. doi: 10.2174/0929866523666161021165506.

Abstract

Oncogenes are genes that have the potential to cause cancer. Oncogene research can provide insight into the occurrence and development of cancer, thereby helping to prevent cancer and to design effective treatments. This study proposes a network method called the oncogene prediction method based on shortest path algorithm (OPMSP) for the identification of novel oncogenes in a large protein network built using protein-protein interaction data. Novel putative genes were extracted from the shortest paths connecting any two known oncogenes. Then, they were filtered by a randomization test, and the linkages among them and known oncogenes were measured by protein interaction and sequence data. Thirty-seven new putative oncogenes were identified by this method. The enrichment analysis of the 37 putative oncogenes indicated that they are highly associated with several biological processes related to the initiation, progression and metastasis of tumors. Six of these genes-ESR1, CDK9, SEPT2, HOXA10, LMX1B, and NR2C2-are extensively discussed. Several lines of evidence indicate that they may be novel oncogenes.

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Computational Biology / methods*
  • Cyclin-Dependent Kinase 9 / genetics
  • Estrogen Receptor alpha / genetics
  • Homeobox A10 Proteins
  • Homeodomain Proteins / genetics
  • LIM-Homeodomain Proteins / genetics
  • Neoplasms / genetics*
  • Oncogenes / genetics*
  • Protein Interaction Maps / genetics*
  • Receptors, Steroid / genetics
  • Receptors, Thyroid Hormone / genetics
  • Septins / genetics
  • Transcription Factors / genetics

Substances

  • ESR1 protein, human
  • Estrogen Receptor alpha
  • Homeobox A10 Proteins
  • Homeodomain Proteins
  • LIM homeobox transcription factor 1 beta
  • LIM-Homeodomain Proteins
  • NR2C2 protein, human
  • Receptors, Steroid
  • Receptors, Thyroid Hormone
  • Transcription Factors
  • HOXA10 protein, human
  • CDK9 protein, human
  • Cyclin-Dependent Kinase 9
  • SEPTIN2 protein, human
  • Septins