Comparison of Machine Learning Models and the Fatty Liver Index in Predicting Lean Fatty Liver

Pei-Yuan Su; Yang-Yuan Chen; Chun-Yu Lin; Wei-Wen Su; Siou-Ping Huang; Hsu-Heng Yen

doi:10.3390/diagnostics13081407

Comparison of Machine Learning Models and the Fatty Liver Index in Predicting Lean Fatty Liver

Diagnostics (Basel). 2023 Apr 13;13(8):1407. doi: 10.3390/diagnostics13081407.

Authors

Pei-Yuan Su^{1

2}, Yang-Yuan Chen^{1

3}, Chun-Yu Lin⁴, Wei-Wen Su¹, Siou-Ping Huang¹, Hsu-Heng Yen^{1

2

5

6

7}

Affiliations

¹ Department of Internal Medicine, Division of Gastroenterology, Changhua Christian Hospital, Changhua 500, Taiwan.
² College of Medicine, National Chung Hsing University, Taichung 400, Taiwan.
³ Department of Hospitality Management, MingDao University, Changhua 500, Taiwan.
⁴ Department of Family Medicine, Yumin Hospital, Nantou 540, Taiwan.
⁵ General Education Center, Chienkuo Technology University, Changhua 500, Taiwan.
⁶ Department of Electrical Engineering, Chung Yuan Christian University, Taoyuan 320, Taiwan.
⁷ Artificial Intelligence Development Center, Changhua Christian Hospital, Changhua 500, Taiwan.

Abstract

The reported prevalence of non-alcoholic fatty liver disease in studies of lean individuals ranges from 7.6% to 19.3%. The aim of the study was to develop machine-learning models for the prediction of fatty liver disease in lean individuals. The present retrospective study included 12,191 lean subjects with a body mass index < 23 kg/m² who had undergone a health checkup from January 2009 to January 2019. Participants were divided into a training (70%, 8533 subjects) and a testing group (30%, 3568 subjects). A total of 27 clinical features were analyzed, except for medical history and history of alcohol or tobacco consumption. Among the 12,191 lean individuals included in the present study, 741 (6.1%) had fatty liver. The machine learning model comprising a two-class neural network using 10 features had the highest area under the receiver operating characteristic curve (AUROC) value (0.885) among all other algorithms. When applied to the testing group, we found the two-class neural network exhibited a slightly higher AUROC value for predicting fatty liver (0.868, 0.841-0.894) compared to the fatty liver index (FLI; 0.852, 0.824-0.81). In conclusion, the two-class neural network had greater predictive value for fatty liver than the FLI in lean individuals.

Keywords: fatty liver index; lean fatty liver; machine learning model.

Grants and funding

111-CCH-IRP-107/Changhua Christian Hospital