A machine learning approach for classifying and quantifying acoustic diversity

Sara C Keen; Karan J Odom; Michael S Webster; Gregory M Kohn; Timothy F Wright; Marcelo Araya-Salas

doi:10.1111/2041-210x.13599

A machine learning approach for classifying and quantifying acoustic diversity

Methods Ecol Evol. 2021 Jul;12(7):1213-1225. doi: 10.1111/2041-210x.13599. Epub 2021 Mar 25.

Authors

Sara C Keen^{1

2

3}, Karan J Odom³, Michael S Webster^{2

3}, Gregory M Kohn⁴, Timothy F Wright⁵, Marcelo Araya-Salas⁶

Affiliations

¹ Center for Conservation Bioacoustics, Cornell Lab of Ornithology, Cornell University, Ithaca, NY, 14850, USA.
² Department of Neurobiology and Behavior, Cornell University, Ithaca, NY, 14850, USA.
³ Cornell Lab of Ornithology, Cornell University, Ithaca, NY, 14850, USA.
⁴ Department of Psychology, University of North Florida, Jacksonville, FL, 32224, USA.
⁵ Department of Biology, New Mexico State University, Las Cruces, NM 88003, USA.
⁶ Sede del Sur, Universidad de Costa Rica, Golfito, 60701, Costa Rica.

Abstract

1. Assessing diversity of discretely varying behavior is a classical ethological problem. In particular, the challenge of calculating an individuals' or species' vocal repertoire size is often an important step in ecological and behavioral studies, but a reproducible and broadly applicable method for accomplishing this task is not currently available. 2. We offer a generalizable method to automate the calculation and quantification of acoustic diversity using an unsupervised random forest framework. We tested our method using natural and synthetic datasets of known repertoire sizes that exhibit standardized variation in common acoustic features as well as in recording quality. We tested two approaches to estimate acoustic diversity using the output from unsupervised random forest analyses: (i) cluster analysis to estimate the number of discrete acoustic signals (e.g., repertoire size) and (ii) an estimation of acoustic area in acoustic feature space, as a proxy for repertoire size. 3. We find that our unsupervised analyses classify acoustic structure with high accuracy. Specifically, both approaches accurately estimate element diversity when repertoire size is small to intermediate (5-20 unique elements). However, for larger datasets (20-100 unique elements), we find that calculating the size of the area occupied in acoustic space is a more reliable proxy for estimating repertoire size. 4. We conclude that our implementation of unsupervised random forest analysis offers a generalizable tool that researchers can apply to classify acoustic structure of diverse datasets. Additionally, output from these analyses can be used to compare the distribution and diversity of signals in acoustic space, creating opportunities to quantify and compare the amount of acoustic variation among individuals, populations, or species in a standardized way. We provide R code and examples to aid researchers interested in using these techniques.

Keywords: Acoustic diversity; acoustic space; classification; data augmentation; random forest; repertoire size; unsupervised machine learning; vocal signals.

Grants and funding

SC1 GM112582/GM/NIGMS NIH HHS/United States