Network-based characterization and prediction of human DNA repair genes and pathways

Sci Rep. 2017 Apr 3:8:45714. doi: 10.1038/srep45714.

Abstract

Network biology is a useful strategy to understand cell's functional organization. In this study, for the first time, we successfully introduced network approaches to study properties of human DNA repair genes. Compared with non-DNA repair genes, we found distinguishing features for DNA repair genes: (i) they tend to have higher degrees; (ii) they tend to be located at global network center; (iii) they tend to interact directly with each other. Based on these features, we developed the first algorithm to predict new DNA repair genes. We tested several machine-learning models and found that support vector machine with kernel function of radial basis function (RBF) achieve the best performance, with precision = 0.74 and area under curve (AUC) = 0.96. In the end, we applied the algorithm to predict new DNA repair genes and got 32 new candidates. Literature supporting four of the predictions was found. We believe the network approaches introduced here might open a new avenue to understand DNA repair genes and pathways. The suggested algorithm and the predicted genes might be helpful for scientists in the field.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • DNA Repair Enzymes / genetics*
  • DNA Repair*
  • Data Mining / methods*
  • Databases, Factual
  • Gene Regulatory Networks*
  • Humans
  • Machine Learning
  • Protein Interaction Maps
  • Support Vector Machine

Substances

  • DNA Repair Enzymes