Binary Classification for Failure Risk Assessment

Ali Foroughi Pour; Ian Loveless; Grzegorz Rempala; Maciej Pietrzak

doi:10.1007/978-1-0716-0849-4_6

Binary Classification for Failure Risk Assessment

Methods Mol Biol. 2021:2194:77-105. doi: 10.1007/978-1-0716-0849-4_6.

Authors

Ali Foroughi Pour^{1

2}, Ian Loveless³, Grzegorz Rempala^{2

3}, Maciej Pietrzak⁴

Affiliations

¹ Department of Electrical and Computer Engineering, The Ohio State University, Columbus, OH, USA.
² Department of Mathematics, The Ohio State University, Columbus, OH, USA.
³ College of Public Health, The Ohio State University, Columbus, OH, USA.
⁴ Department of Biomedical Informatics, The Ohio State University, Columbus, OH, USA. pietrzak.20@osu.edu.

PMID: 32926363
DOI: 10.1007/978-1-0716-0849-4_6

Abstract

Survival analysis is tremendously powerful, and is a popular methodology for analyzing time to event models in bioinformatics. Furthermore, several of its extensions can simultaneously perform variable selection in conjunction with model estimation. While this flexibility is extremely desirable, under certain scenarios, binary class variable selection and classification methods might provide more reliable risk estimates. Synthetic simulations and real data case studies suggest that when (1) randomly censored points comprise only a small portion of data, (2) biological markers are weak, (3) it is desired to compute risk across predetermined time intervals, and (4) the assumptions of the competing time to event models are violated, binary class models tend to perform superior. In practice, it might be prudent to test both model families to guarantee adequate analysis. Here we describe the pipeline of binary class feature selection and classification for time to event risk assessment.

Keywords: Classification; Risk assessment; Survival analysis; Variable selection.

MeSH terms

Algorithms
Analysis of Variance
Biostatistics / methods*
Computational Biology / methods*
Computer Simulation
Data Interpretation, Statistical
Discriminant Analysis
Humans
Linear Models
Neoplasms / mortality*
Prognosis
Risk Assessment / methods
Support Vector Machine
Survival Analysis