Generalized perceptual linear prediction features for animal vocalization analysis

Patrick J Clemins; Michael T Johnson

doi:10.1121/1.2203596

Generalized perceptual linear prediction features for animal vocalization analysis

J Acoust Soc Am. 2006 Jul;120(1):527-34. doi: 10.1121/1.2203596.

Authors

Patrick J Clemins¹, Michael T Johnson

Affiliation

¹ Speech and Signal Processing Laboratory, Marquette University, P.O. Box 1881, Milwaukee, Wisconsin 53233-1881, USA. patrick.clemins@marquette.edu

PMID: 16875249
DOI: 10.1121/1.2203596

Abstract

A new feature extraction model, generalized perceptual linear prediction (gPLP), is developed to calculate a set of perceptually relevant features for digital signal analysis of animal vocalizations. The gPLP model is a generalized adaptation of the perceptual linear prediction model, popular in human speech processing, which incorporates perceptual information such as frequency warping and equal loudness normalization into the feature extraction process. Since such perceptual information is available for a number of animal species, this new approach integrates that information into a generalized model to extract perceptually relevant features for a particular species. To illustrate, qualitative and quantitative comparisons are made between the species-specific model, generalized perceptual linear prediction (gPLP), and the original PLP model using a set of vocalizations collected from captive African elephants (Loxodonta africana) and wild beluga whales (Delphinapterus leucas). The models that incorporate perceptional information outperform the original human-based models in both visualization and classification tasks.

MeSH terms

Acoustics*
Analysis of Variance
Animals
Auditory Perception / physiology
Elephants / physiology*
Humans
Linear Models
Models, Biological*
Sound Spectrography
Vocalization, Animal / physiology*