Classification algorithm for high-dimensional protein markers in time-course data

Stat Med. 2020 Dec 10;39(28):4201-4217. doi: 10.1002/sim.8720. Epub 2020 Aug 25.

Abstract

Identification of biomarkers is an emerging area in oncology. In this article, we develop an efficient statistical procedure for the classification of protein markers according to their effect on cancer progression. A high-dimensional time-course dataset of protein markers for 80 patients motivates us for developing the model. The threshold value is formulated as a level of a marker having maximum impact on cancer progression. The classification algorithm technique for high-dimensional time-course data is developed and the algorithm is validated by comparing random components using both proportional hazard and accelerated failure time frailty models. The study elucidates the application of two separate joint modeling techniques using auto regressive-type model and mixed effect model for time-course data and proportional hazard model for survival data with proper utilization of Bayesian methodology. Also, a prognostic score is developed on the basis of few selected genes with application on patients. This study facilitates to identify relevant biomarkers from a set of markers.

Keywords: Bayesian; auto-regression; classification; frailty; joint modeling.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Bayes Theorem
  • Biomarkers
  • Humans
  • Medical Oncology*
  • Proportional Hazards Models

Substances

  • Biomarkers