driveR: a novel method for prioritizing cancer driver genes using somatic genomics data

BMC Bioinformatics. 2021 May 24;22(1):263. doi: 10.1186/s12859-021-04203-7.

Abstract

Background: Cancer develops due to "driver" alterations. Numerous approaches exist for predicting cancer drivers from cohort-scale genomics data. However, methods for personalized analysis of driver genes are underdeveloped. In this study, we developed a novel personalized/batch analysis approach for driver gene prioritization utilizing somatic genomics data, called driveR.

Results: Combining genomics information and prior biological knowledge, driveR accurately prioritizes cancer driver genes via a multi-task learning model. Testing on 28 different datasets, this study demonstrates that driveR performs adequately, achieving a median AUC of 0.684 (range 0.651-0.861) on the 28 batch analysis test datasets, and a median AUC of 0.773 (range 0-1) on the 5157 personalized analysis test samples. Moreover, it outperforms existing approaches, achieving a significantly higher median AUC than all of MutSigCV (Wilcoxon rank-sum test p < 0.001), DriverNet (p < 0.001), OncodriveFML (p < 0.001) and MutPanning (p < 0.001) on batch analysis test datasets, and a significantly higher median AUC than DawnRank (p < 0.001) and PRODIGY (p < 0.001) on personalized analysis datasets.

Conclusions: This study demonstrates that the proposed method is an accurate and easy-to-utilize approach for prioritizing driver genes in cancer genomes in personalized or batch analyses. driveR is available on CRAN: https://cran.r-project.org/package=driveR .

Keywords: Cancer; Driver gene; Prioritization; Somatic.

MeSH terms

  • Genomics
  • Humans
  • Mutation
  • Neoplasms* / genetics
  • Oncogenes*