Practical advice on variable selection and reporting using Akaike information criterion

Proc Biol Sci. 2023 Sep 27;290(2007):20231261. doi: 10.1098/rspb.2023.1261. Epub 2023 Sep 27.

Abstract

The various debates around model selection paradigms are important, but in lieu of a consensus, there is a demonstrable need for a deeper appreciation of existing approaches, at least among the end-users of statistics and model selection tools. In the ecological literature, the Akaike information criterion (AIC) dominates model selection practices, and while it is a relatively straightforward concept, there exists what we perceive to be some common misunderstandings around its application. Two specific questions arise with surprising regularity among colleagues and students when interpreting and reporting AIC model tables. The first is related to the issue of 'pretending' variables, and specifically a muddled understanding of what this means. The second is related to p-values and what constitutes statistical support when using AIC. There exists a wealth of technical literature describing AIC and the relationship between p-values and AIC differences. Here, we complement this technical treatment and use simulation to develop some intuition around these important concepts. In doing so we aim to promote better statistical practices when it comes to using, interpreting and reporting models selected when using AIC.

Keywords: ecology; information criterion; model selection; p-value; variable selection.

MeSH terms

  • Computer Simulation
  • Consensus
  • Humans
  • Intuition*
  • Students*

Associated data

  • figshare/10.6084/m9.figshare.c.6834928