Comparison of Machine Learning Models and the Fatty Liver Index in Predicting Lean Fatty Liver

Diagnostics (Basel). 2023 Apr 13;13(8):1407. doi: 10.3390/diagnostics13081407.

Abstract

The reported prevalence of non-alcoholic fatty liver disease in studies of lean individuals ranges from 7.6% to 19.3%. The aim of the study was to develop machine-learning models for the prediction of fatty liver disease in lean individuals. The present retrospective study included 12,191 lean subjects with a body mass index < 23 kg/m2 who had undergone a health checkup from January 2009 to January 2019. Participants were divided into a training (70%, 8533 subjects) and a testing group (30%, 3568 subjects). A total of 27 clinical features were analyzed, except for medical history and history of alcohol or tobacco consumption. Among the 12,191 lean individuals included in the present study, 741 (6.1%) had fatty liver. The machine learning model comprising a two-class neural network using 10 features had the highest area under the receiver operating characteristic curve (AUROC) value (0.885) among all other algorithms. When applied to the testing group, we found the two-class neural network exhibited a slightly higher AUROC value for predicting fatty liver (0.868, 0.841-0.894) compared to the fatty liver index (FLI; 0.852, 0.824-0.81). In conclusion, the two-class neural network had greater predictive value for fatty liver than the FLI in lean individuals.

Keywords: fatty liver index; lean fatty liver; machine learning model.