Introducing zoid: A mixture model and R package for modeling proportional data with zeros and ones in ecology

Ecology. 2022 Nov;103(11):e3804. doi: 10.1002/ecy.3804. Epub 2022 Aug 17.

Abstract

Many ecological data sets are proportional, representing mixtures of constituent elements such as species, populations, or strains. Analyses of proportional data are challenged by categories with zero observations (zeros), all observations (ones), and overdispersion. In lieu of ad hoc data adjustments, we describe and evaluate a zero-and-one inflated Dirichlet regression model, with its corresponding R package (zoid), capable of handling observed data x $$ x $$ consisting of three possible categories: zeros, proportions, or ones. Instead of fitting the model to observations of single biological units (e.g., individual organisms) within a sample, we sum proportional contributions across units and estimate mixture proportions using one aggregated observation per sample. Optional estimation of overdispersion and covariate influences expand model applications. We evaluate model performance, as implemented in Stan, using simulations and two ecological case studies. We show that zoid successfully estimates mixture proportions using simulated data with varying sample sizes and is robust to overdispersion and covariate structure. In empirical case studies, we estimate the composition of a mixed-stock Chinook salmon (Oncorhynchus tshawytscha) fishery and analyze the stomach contents of Atlantic cod (Gadus morhua). Our implementation of the model as an R package facilitates its application to varied ecological data sets composed of proportional observations.

Keywords: diet composition; mixed-stock fishery; mixture model; overdispersion; proportional data; zero-and-one inflated Dirichlet regression.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Fisheries
  • Models, Statistical*
  • Research Design
  • Salmon
  • Software*