An Interpretable Screening Approach Derived Through XGBoost Regression for the Discovery of Hypolipidemic Contributors in Chinese Hawthorn Leaf and its Counterfeit Malus Doumeri Leaf

Plant Foods Hum Nutr. 2024 Mar;79(1):209-218. doi: 10.1007/s11130-024-01148-z. Epub 2024 Feb 10.

Abstract

The active ingredient group is a prominent feature reflecting the inherent characteristics of plant-based functional foods. Chinese hawthorn leaf (CHL), a tea substitute possessing intrinsic nutritional properties in anti-hyperlipidemia, was first found to be adulterated with Malus doumeri leaf (MDL) owing to similar commercial labels. In this context, the above-mentioned two contrasting species were explored through phytochemical profiling and activity assessment. The amelioration effect of CHL on free fatty acids-elicited lipid deposition in HepG2 cells was significantly better than that of MDL. Molecular networking-based metabolic profiles identified 68 and 67 components in CHL and MDL, with 33 shared components. Extreme gradient boosting (XGBoost) algorithm with outstanding performance was selected to screen candidate components contributing to hypolipidemic activity, and the output was later interpreted by Shapley additive explanations (SHAP) method. Twelve and eight components were separately screened as hyperlipidemic inhibitors in CHL and MDL, while only four constituents were shared. The bioactivity evaluation of selected ingredients and combinations further confirmed their anti-hyperlipidemia capacity. These findings emphasized the feasibility of filtering bioactivity-related compounds using interpretable machine learning approaches and illustrated that related species may contain different hypolipidemic contributors, even if shared constituents existed.

Keywords: Chinese hawthorn leaf; Extreme gradient boosting; Hypolipidemic contributors; Malus doumeri leaf; Shapley additive explanations.

MeSH terms

  • China
  • Crataegus*
  • Functional Food
  • Malus*
  • Plant Leaves