Identifying statistically significant combinatorial markers for survival analysis

BMC Med Genomics. 2018 Apr 20;11(Suppl 2):31. doi: 10.1186/s12920-018-0346-x.

Abstract

Background: Survival analysis methods have been widely applied in different areas of health and medicine, spanning over varying events of interest and target diseases. They can be utilized to provide relationships between the survival time of individuals and factors of interest, rendering them useful in searching for biomarkers in diseases such as cancer. However, some disease progression can be very unpredictable because the conventional approaches have failed to consider multiple-marker interactions. An exponential increase in the number of candidate markers requires large correction factor in the multiple-testing correction and hide the significance.

Methods: We address the issue of testing marker combinations that affect survival by adapting the recently developed Limitless Arity Multiple-testing Procedure (LAMP), a p-value correction technique for statistical tests for combination of markers. LAMP cannot handle survival data statistics, and hence we extended LAMP for the log-rank test, making it more appropriate for clinical data, with newly introduced theoretical lower bound of the p-value.

Results: We applied the proposed method to gene combination detection for cancer and obtained gene interactions with statistically significant log-rank p-values. Gene combinations with orders of up to 32 genes were detected by our algorithm, and effects of some genes in these combinations are also supported by existing literature.

Conclusion: The novel approach for detecting prognostic markers presented here can identify statistically significant markers with no limitations on the order of interaction. Furthermore, it can be applied to different types of genomic data, provided that binarization is possible.

Keywords: Gene marker; Log-rank test; Multiple testing; Prognosis; Survival analysis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers, Tumor / genetics*
  • Breast Neoplasms / genetics
  • Breast Neoplasms / pathology
  • Computational Biology / methods*
  • Cystadenocarcinoma, Serous / genetics
  • Cystadenocarcinoma, Serous / pathology
  • Female
  • Humans
  • Neoplasm Invasiveness
  • Ovarian Neoplasms / genetics
  • Ovarian Neoplasms / pathology
  • Statistics as Topic*
  • Survival Analysis

Substances

  • Biomarkers, Tumor