A K-nearest Neighbor Model to Predict Early Recurrence of Hepatocellular Carcinoma After Resection

J Clin Transl Hepatol. 2022 Aug 28;10(4):600-607. doi: 10.14218/JCTH.2021.00348. Epub 2022 Jan 4.

Abstract

Background and aims: Patients with hepatocellular carcinoma (HCC) surgically resected are at risk of recurrence; however, the risk factors of recurrence remain poorly understood. This study intended to establish a novel machine learning model based on clinical data for predicting early recurrence of HCC after resection.

Methods: A total of 220 HCC patients who underwent resection were enrolled. Classification machine learning models were developed to predict HCC recurrence. The standard deviation, recall, and precision of the model were used to assess the model's accuracy and identify efficiency of the model.

Results: Recurrent HCC developed in 89 (40.45%) patients at a median time of 14 months from primary resection. In principal component analysis, tumor size, tumor grade differentiation, portal vein tumor thrombus, alpha-fetoprotein, protein induced by vitamin K absence or antagonist-II (PIVKA-II), aspartate aminotransferase, platelet count, white blood cell count, and HBsAg were positive prognostic factors of HCC recurrence and were included in the preoperative model. After comparing different machine learning methods, including logistic regression, decision tree, naïve Bayes, deep neural networks, and k-nearest neighbor (K-NN), we choose the K-NN model as the optimal prediction model. The accuracy, recall, precision of the K-NN model were 70.6%, 51.9%, 70.1%, respectively. The standard deviation was 0.020.

Conclusions: The K-NN classification algorithm model performed better than the other classification models. Estimation of the recurrence rate of early HCC can help to allocate treatment, eventually achieving safe oncological outcomes.

Keywords: Hepatocellular carcinoma; Machine learning; Prognostic model; Recurrence; Surgical resection.