Automated segmentation of linear time-frequency representations of marine-mammal sounds

J Acoust Soc Am. 2013 Sep;134(3):2546-55. doi: 10.1121/1.4816579.

Abstract

Many marine mammals produce highly nonlinear frequency modulations. Determining the time-frequency support of these sounds offers various applications, which include recognition, localization, and density estimation. This study introduces a low parameterized automated spectrogram segmentation method that is based on a theoretical probabilistic framework. In the first step, the background noise in the spectrogram is fitted with a Chi-squared distribution and thresholded using a Neyman-Pearson approach. In the second step, the number of false detections in time-frequency regions is modeled as a binomial distribution, and then through a Neyman-Pearson strategy, the time-frequency bins are gathered into regions of interest. The proposed method is validated on real data of large sequences of whistles from common dolphins, collected in the Bay of Biscay (France). The proposed method is also compared with two alternative approaches: the first is smoothing and thresholding of the spectrogram; the second is thresholding of the spectrogram followed by the use of morphological operators to gather the time-frequency bins and to remove false positives. This method is shown to increase the probability of detection for the same probability of false alarms.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Acoustics*
  • Algorithms
  • Animals
  • Chi-Square Distribution
  • Common Dolphins / physiology*
  • Common Dolphins / psychology
  • Environmental Monitoring / methods*
  • France
  • Linear Models*
  • Marine Biology / methods*
  • Oceans and Seas
  • Pattern Recognition, Automated*
  • Reproducibility of Results
  • Signal Processing, Computer-Assisted
  • Sound Spectrography
  • Time Factors
  • Vocalization, Animal*