On the reliability of acoustic annotations and automatic detections of Antarctic blue whale calls under different acoustic conditions

Emmanuelle C Leroy; Karolin Thomisch; Jean-Yves Royer; Olaf Boebel; Ilse Van Opzeeland

doi:10.1121/1.5049803

On the reliability of acoustic annotations and automatic detections of Antarctic blue whale calls under different acoustic conditions

J Acoust Soc Am. 2018 Aug;144(2):740. doi: 10.1121/1.5049803.

Authors

Emmanuelle C Leroy¹, Karolin Thomisch², Jean-Yves Royer¹, Olaf Boebel², Ilse Van Opzeeland²

Affiliations

¹ Centre National de la Recherche Scientifique & University of Brest, Laboratoire Géosciences Océan, Institut Universitaire Européen de la Mer, 29280 Plouzané, France.
² Ocean Acoustics Lab, Alfred-Wegener-Institut, Helmholtz-Zentrum fur Polar- und Meeresforschung, 27570 Bremerhaven, Germany.

PMID: 30180708
DOI: 10.1121/1.5049803

Abstract

Evaluation of the performance of computer-based algorithms to automatically detect mammalian vocalizations often relies on comparisons between detector outputs and a reference data set, generally obtained by manual annotation of acoustic recordings. To explore the reproducibility of these annotations, inter- and intra-analyst variability in manually annotated Antarctic blue whale (ABW) Z-calls are investigated by two analysts in acoustic data from two ocean basins representing different scenarios in terms of call abundance and background noise. Manual annotations exhibit strong inter- and intra-analyst variability, with less than 50% agreement between analysts. This variability is mainly caused by the difficulty of reliably and reproducibly distinguishing single calls in an ABW chorus made of overlaying distant calls. Furthermore, the performance of two automated detectors, based on spectrogram correlation or subspace-detection strategy, is evaluated by comparing detector output to a "conservative" manually annotated reference data set, which comprises only analysts' matching events. This study highlights the need for a standardized approach for human annotations and automatic detections, including a quantitative description of their performance, to improve the comparability of acoustic data, which is particularly relevant in the context of collaborative approaches in collecting and analyzing large passive acoustic data sets.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Acoustics / instrumentation*
Animals
Balaenoptera / physiology*
Noise / adverse effects
Reference Standards
Reproducibility of Results
Signal-To-Noise Ratio
Vocalization, Animal*