Network-based method for mining novel HPV infection related genes using random walk with restart algorithm

Biochim Biophys Acta Mol Basis Dis. 2018 Jun;1864(6 Pt B):2376-2383. doi: 10.1016/j.bbadis.2017.11.021. Epub 2017 Nov 29.

Abstract

The human papillomavirus (HPV), a common virus that infects the reproductive tract, may lead to malignant changes within the infection area in certain cases and is directly associated with such cancers as cervical cancer, anal cancer, and vaginal cancer. Identification of novel HPV infection related genes can lead to a better understanding of the specific signal pathways and cellular processes related to HPV infection, providing information for the development of more efficient therapies. In this study, several novel HPV infection related genes were predicted by a computation method based on the known genes involved in HPV infection from HPVbase. This method applied the algorithm of random walk with restart (RWR) to a protein-protein interaction (PPI) network. The candidate genes were further filtered by the permutation and association tests. These steps eliminated genes occupying special positions in the PPI network and selected key genes with strong associations to known HPV infection related genes based on the interaction confidence and functional similarity obtained from published databases, such as STRING, gene ontology (GO) terms and KEGG pathways. Our study identified 104 novel HPV infection related genes, a number of which were confirmed to relate to the infection processes and complications of HPV infection, as reported in the literature. These results demonstrate the reliability of our method in identifying HPV infection related genes. This article is part of a Special Issue entitled: Accelerating Precision Medicine through Genetic and Genomic Big Data Analysis edited by Yudong Cai & Tao Huang.

Keywords: GO terms; Human papillomavirus; KEGG pathways; Protein-protein interaction; Random walk with restart algorithm.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Data Mining / methods*
  • Databases, Genetic*
  • Gene Regulatory Networks*
  • Humans
  • Papillomaviridae* / genetics
  • Papillomaviridae* / metabolism
  • Papillomavirus Infections* / genetics
  • Papillomavirus Infections* / metabolism