The importance of making testable predictions: A cautionary tale

PLoS One. 2020 Dec 8;15(12):e0236541. doi: 10.1371/journal.pone.0236541. eCollection 2020.

Abstract

We found a startling correlation (Pearson ρ > 0.97) between a single event in daily sea surface temperatures each spring, and peak fish egg abundance measurements the following summer, in 7 years of approximately weekly fish egg abundance data collected at Scripps Pier in La Jolla California. Even more surprising was that this event-based result persisted despite the large and variable number of fish species involved (up to 46), and the large and variable time interval between trigger and response (up to ~3 months). To mitigate potential over-fitting, we made an out-of-sample prediction beyond the publication process for the peak summer egg abundance observed at Scripps Pier in 2020 (available on bioRxiv). During peer-review, the prediction failed, and while it would be tempting to explain this away as a result of the record-breaking toxic algal bloom that occurred during the spring (9x higher concentration of dinoflagellates than ever previously recorded), a re-examination of our methodology revealed a potential source of over-fitting that had not been evaluated for robustness. This cautionary tale highlights the importance of testable true out-of-sample predictions of future values that cannot (even accidentally) be used in model fitting, and that can therefore catch model assumptions that may otherwise escape notice. We believe that this example can benefit the current push towards ecology as a predictive science and support the notion that predictions should live and die in the public domain, along with the models that made them.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Dinoflagellida / growth & development
  • Ecology
  • Eggs
  • Environmental Monitoring
  • Fishes / growth & development*
  • Seasons
  • Temperature

Grants and funding

This work was supported by DoD-Strategic Environmental Research and Development Program 15 RC-2509 (GS), NSF DEB-1655203 (GS), NSF ABI-1667584 (GS), DOI USDI-NPS P20AC00527 (GS), the Scripps Institution of Oceanography Postdoctoral Fellowship (TL), the McQuown Fund and the McQuown Chair in Natural Sciences, University of California, San Diego (GS). Fish egg collection and identification was supported in part by the Richard Grand Foundation and the California Ocean Protection Council R/OPCSFAQ-12 (RB).