Plasma Metabolomics and Machine Learning-Driven Novel Diagnostic Signature for Non-Alcoholic Steatohepatitis

Biomedicines. 2022 Jul 11;10(7):1669. doi: 10.3390/biomedicines10071669.

Abstract

We performed targeted metabolomics with machine learning (ML)-based interpretation to identify metabolites that distinguish the progression of nonalcoholic fatty liver disease (NAFLD) in a cohort. Plasma metabolomics analysis was conducted in healthy control subjects (n = 25) and patients with NAFL (n = 42) and nonalcoholic steatohepatitis (NASH, n = 19) by gas chromatography-tandem mass spectrometry (MS/MS) and liquid chromatography-MS/MS as well as RNA sequencing (RNA-seq) analyses on liver tissues from patients with varying stages of NAFLD (n = 12). The resulting metabolomic data were subjected to routine statistical and ML-based analyses and multi-omics interpretation with RNA-seq data. We found 6 metabolites that were significantly altered in NAFLD among 79 detected metabolites. Random-forest and multinomial logistic regression analyses showed that eight metabolites (glutamic acid, cis-aconitic acid, aspartic acid, isocitric acid, α-ketoglutaric acid, oxaloacetic acid, myristoleic acid, and tyrosine) could distinguish the three groups. Then, the recursive partitioning and regression tree algorithm selected three metabolites (glutamic acid, isocitric acid, and aspartic acid) from these eight metabolites. With these three metabolites, we formulated an equation, the MetaNASH score that distinguished NASH with excellent performance. In addition, metabolic map construction and correlation assays integrating metabolomics data into the transcriptome datasets of the liver showed correlations between the concentration of plasma metabolites and the expression of enzymes governing metabolism and specific alterations of these correlations in NASH. Therefore, these findings will be useful for evaluation of altered metabolism in NASH and understanding of pathophysiologic implications from metabolite profiles in relation to NAFLD progression.

Keywords: biomarkers; machine learning; metabolomics; nonalcoholic fatty liver disease; nonalcoholic steatohepatitis.

Grants and funding

This study was supported by grants from the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), which is funded by the Ministry of Health & Welfare, Korea (HI14C1135 to D.H.L); the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NFR-2019R1I1A2A02062305 to D.H.L); NRF grant funded by the Korean government (MSIT) (No. NRF-2021R1A5A2030333 to D.H.L; No. 2015R1A4A1041219 and 2018R1D1A1B07041045 to M-J.P; and No. 2020R1A2C2010964 and 2021R1A5A8029876 to D.R.); and the Gachon University Gil Medical Center (FRD2021-03 to D.H.L).