Enhancing predictive performance for spectroscopic studies in wildlife science through a multi-model approach: A case study for species classification of live amphibians

PLoS Comput Biol. 2024 Feb 14;20(2):e1011876. doi: 10.1371/journal.pcbi.1011876. eCollection 2024 Feb.

Abstract

Near infrared spectroscopy coupled with predictive modeling is a growing field of study for addressing questions in wildlife science aimed at improving management strategies and conservation outcomes for managed and threatened fauna. To date, the majority of spectroscopic studies in wildlife and fisheries applied chemometrics and predictive modeling with a single-algorithm approach. By contrast, multi-model approaches are used routinely for analyzing spectroscopic datasets across many major industries (e.g., medicine, agriculture) to maximize predictive outcomes for real-world applications. In this study, we conducted a benchmark modeling exercise to compare the performance of several machine learning algorithms in a multi-class problem utilizing a multivariate spectroscopic dataset obtained from live animals. Spectra obtained from live individuals representing eleven amphibian species were classified according to taxonomic designation. Seven modeling techniques were applied to generate prediction models, which varied significantly (p < 0.05) with regard to mean classification accuracy (e.g., support vector machine: 95.8 ± 0.8% vs. K-nearest neighbors: 89.3 ± 1.0%). Through the use of a multi-algorithm approach, candidate algorithms can be identified and applied to more effectively model complex spectroscopic data collected for wildlife sciences. Other key considerations in the predictive modeling workflow that serve to optimize spectroscopic model performance (e.g., variable selection and cross-validation procedures) are also discussed.

MeSH terms

  • Algorithms*
  • Animals
  • Animals, Wild*
  • Humans
  • Machine Learning
  • Spectroscopy, Near-Infrared
  • Support Vector Machine

Grants and funding

This project was supported by the U.S. Department of Agriculture, Agricultural Research Service, Biophotonics project #6066-31000-015-00D (CKK) under National Institute of Food and Agriculture Hatch project accession numbers W3173 and W4173, funding partial salary for LDC; an Institute of Museum and Library Services National Leadership Grant on Amphibian Sustainable Collections Management grant #MG-30-17-0052-17 (CKK, AJK); the Association of Zoos and Aquariums and the Disney Conservation Grant Funds CGF16-1396 and CGF-19-1618 (AJK, CKK); and the U.S. Forest Service International Programs co-operative agreement, grant #18-DG-11132762-248, on 'Coupled Forest-Human Ecosystems: Connecting Forest Management with Conservation Initiatives' (AJK, CKK), funding partial salary for LDC. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.