Exploration of machine learning algorithms for predicting the changes in abundance of antibiotic resistance genes in anaerobic digestion

Sci Total Environ. 2022 Sep 15:839:156211. doi: 10.1016/j.scitotenv.2022.156211. Epub 2022 May 24.

Abstract

The land application of digestate from anaerobic digestion (AD) is considered a significant route for transmitting antibiotic resistance genes (ARGs) and mobile genetic elements (MGEs) to ecosystems. To date, efforts towards understanding complex non-linear interactions between AD operating parameters with ARG/MGE abundances rely on experimental investigations due to a lack of mechanistic models. Herein, three different machine learning (ML) algorithms, Random Forest (RF), eXtreme Gradient Boosting (XGBoost), and Artificial Neural Network (ANN), were compared for their predictive capacities in simulating ARG/MGE abundance changes during AD. The models were trained and cross-validated using experimental data collected from 33 published literature. The comparison of model performance using coefficients of determination (R2) and root mean squared errors (RMSE) indicated that ANN was more reliable than RF and XGBoost. The mode of operation (batch/semi-continuous), co-digestion of food waste and sewage sludge, and residence time were identified as the three most critical features in predicting ARG/MGE abundance changes. Moreover, the trained ANN model could simulate non-linear interactions between operational parameters and ARG/MGE abundance changes that could be interpreted intuitively based on existing knowledge. Overall, this study demonstrates that machine learning can enable a reliable predictive model that can provide a holistic optimization tool for mitigating the ARG/MGE transmission potential of AD.

Keywords: Anaerobic digestion; Antibiotic resistance genes; Machine learning; Mobile genetic elements.

MeSH terms

  • Algorithms
  • Anaerobiosis
  • Anti-Bacterial Agents* / pharmacology
  • Drug Resistance, Microbial / genetics
  • Ecosystem
  • Food
  • Genes, Bacterial
  • Machine Learning
  • Refuse Disposal*
  • Sewage

Substances

  • Anti-Bacterial Agents
  • Sewage