Prediction of hepatic inflammation in chronic hepatitis B patients with a random forest-backward feature elimination algorithm

World J Gastroenterol. 2021 Jun 7;27(21):2910-2920. doi: 10.3748/wjg.v27.i21.2910.

Abstract

Background: Persistent liver inflammatory damage is the main risk factor for developing liver fibrosis, cirrhosis, and even hepatocellular carcinoma in chronic hepatitis B (CHB) patients. Thus, accurate prediction of the degree of liver inflammation is a high priority and a growing medical need.

Aim: To build an effective and robust non-invasive model for predicting hepatitis B-related hepatic inflammation.

Methods: A total of 650 treatment-naïve CHB (402 HBeAg-positive and 248 HBeAg-negative) patients who underwent liver biopsy were enrolled in this study. Histological inflammation grading was assessed by the Ishak scoring system. Serum quantitative hepatitis B core antibody (qAnti-HBc) levels and 21 immune-related inflammatory factors were measured quantitatively using a chemiluminescent microparticle immunoassay. A backward feature elimination (BFE) algorithm utilizing random forest (RF) was used to select optional features and construct a combined model. The diagnostic abilities of the model or variables were evaluated based on the estimated area under the receiver operating characteristics curve (AUROC) and compared using the DeLong test.

Results: Four features were selected to predict moderate-to-severe inflammation in CHB patients using the RF-BFE method. These predictive features included qAnti-HBc, ALT, AST, and CXCL11. Spearman's correlation analysis indicated that serum qAnti-HBc, ALT, AST, and CXCL11 levels were positively correlated with the histology activity index (HAI) score. These selected features were incorporated into the model to establish a novel model named I-3A index. The AUROC [0.822; 95% confidence interval (CI): 0.790-0.851] of the I-3A index was significantly increased compared with qAnti-HBc alone (0.760, 95%CI: 0.724-0.792, P < 0.0001) in all CHB patients. The use of an I-3A index cutoff value of 0.41 produced a sensitivity of 69.17%, specificity of 81.44%, and accuracy of 73.8%. Additionally, the I-3A index showed significantly improved diagnostic performance for predicting moderate-to-severe inflammation in HBeAg-positive and HBeAg-negative CHB patients (0.829, 95%CI: 0.789-0.865 and 0.810, 95%CI: 0.755-0.857, respectively).

Conclusion: The selected features of the I-3A index constructed using the RF-BFE algorithm can effectively predict moderate-to-severe liver inflammation in CHB patients.

Keywords: CXCL11; Diagnostic efficiency; Hepatic inflammation; Machine learning; Quantitative hepatitis B core antibody.

MeSH terms

  • Alanine Transaminase
  • Algorithms
  • Biomarkers
  • Hepatitis B e Antigens
  • Hepatitis B virus / genetics
  • Hepatitis B, Chronic* / complications
  • Hepatitis B, Chronic* / diagnosis
  • Humans
  • Inflammation
  • Liver Cirrhosis / diagnosis
  • ROC Curve

Substances

  • Biomarkers
  • Hepatitis B e Antigens
  • Alanine Transaminase