A Machine Learning Approach in Analyzing Bioaccumulation of Heavy Metals in Turbot Tissues

Molecules. 2020 Oct 14;25(20):4696. doi: 10.3390/molecules25204696.

Abstract

Metals are considered to be one of the most hazardous substances due to their potential for accumulation, magnification, persistence, and wide distribution in water, sediments, and aquatic organisms. Demersal fish species, such as turbot (Psetta maxima maeotica), are accepted by the scientific communities as suitable bioindicators of heavy metal pollution in the aquatic environment. The present study uses a machine learning approach, which is based on multiple linear and non-linear models, in order to effectively estimate the concentrations of heavy metals in both turbot muscle and liver tissues. For multiple linear regression (MLR) models, the stepwise method was used, while non-linear models were developed by applying random forest (RF) algorithm. The models were based on data that were provided from scientific literature, attributed to 11 heavy metals (As, Ca, Cd, Cu, Fe, K, Mg, Mn, Na, Ni, Zn) from both muscle and liver tissues of turbot exemplars. Significant MLR models were recorded for Ca, Fe, Mg, and Na in muscle tissue and K, Cu, Zn, and Na in turbot liver tissue. The non-linear tree-based RF prediction models (over 70% prediction accuracy) were identified for As, Cd, Cu, K, Mg, and Zn in muscle tissue and As, Ca, Cd, Mg, and Fe in turbot liver tissue. Both machine learning MLR and non-linear tree-based RF prediction models were identified to be suitable for predicting the heavy metal concentration from both turbot muscle and liver tissues. The models can be used for improving the knowledge and economic efficiency of linked heavy metals food safety and environment pollution studies.

Keywords: heavy metals; machine learning; prediction models; random forest; turbot.

MeSH terms

  • Animals
  • Bioaccumulation
  • Environmental Biomarkers
  • Environmental Monitoring / methods
  • Europe
  • Flatfishes*
  • Linear Models
  • Liver / drug effects
  • Liver / metabolism
  • Machine Learning*
  • Metals, Heavy / analysis*
  • Metals, Heavy / pharmacokinetics*
  • Muscle, Skeletal / drug effects
  • Muscle, Skeletal / metabolism
  • Nonlinear Dynamics
  • Water Pollutants, Chemical / analysis*
  • Water Pollutants, Chemical / pharmacokinetics

Substances

  • Environmental Biomarkers
  • Metals, Heavy
  • Water Pollutants, Chemical