Improvement of predictive models of risk of disease progression in chronic hepatitis C by incorporating longitudinal data

Hepatology. 2015 Jun;61(6):1832-41. doi: 10.1002/hep.27750. Epub 2015 Mar 20.

Abstract

Existing predictive models of risk of disease progression in chronic hepatitis C have limited accuracy. The aim of this study was to improve upon existing models by applying novel statistical methods that incorporate longitudinal data. Patients in the Hepatitis C Antiviral Long-term Treatment Against Cirrhosis trial were analyzed. Outcomes of interest were (1) fibrosis progression (increase of two or more Ishak stages) and (2) liver-related clinical outcomes (liver-related death, hepatic decompensation, hepatocellular carcinoma, liver transplant, or increase in Child-Turcotte-Pugh score to ≥7). Predictors included longitudinal clinical, laboratory, and histologic data. Models were constructed using logistic regression and two machine learning methods (random forest and boosting) to predict an outcome in the next 12 months. The control arm was used as the training data set (n = 349 clinical, n = 184 fibrosis) and the interferon arm, for internal validation. The area under the receiver operating characteristic curve for longitudinal models of fibrosis progression was 0.78 (95% confidence interval [CI] 0.74-0.83) using logistic regression, 0.79 (95% CI 0.77-0.81) using random forest, and 0.79 (95% CI 0.77-0.82) using boosting. The area under the receiver operating characteristic curve for longitudinal models of clinical progression was 0.79 (95% CI 0.77-0.82) using logistic regression, 0.86 (95% CI 0.85-0.87) using random forest, and 0.84 (95% CI 0.82-0.86) using boosting. Longitudinal models outperformed baseline models for both outcomes (P < 0.0001). Longitudinal machine learning models had negative predictive values of 94% for both outcomes.

Conclusions: Prediction models that incorporate longitudinal data can capture nonlinear disease progression in chronic hepatitis C and thus outperform baseline models. Machine learning methods can capture complex relationships between predictors and outcomes, yielding more accurate predictions; our models can help target costly therapies to patients with the most urgent need, guide the intensity of clinical monitoring required, and provide prognostic information to patients.

Publication types

  • Clinical Trial
  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Artificial Intelligence
  • Disease Progression*
  • Female
  • Fibrosis
  • Hepatitis C, Chronic / pathology*
  • Humans
  • Liver / pathology*
  • Male
  • Middle Aged
  • Models, Biological*
  • Risk Assessment
  • Statistics as Topic