Improved binary dragonfly optimization algorithm and wavelet packet based non-linear features for infant cry classification

Comput Methods Programs Biomed. 2018 Mar:155:39-51. doi: 10.1016/j.cmpb.2017.11.021. Epub 2017 Nov 28.

Abstract

Background and objective: Infant cry signal carries several levels of information about the reason for crying (hunger, pain, sleepiness and discomfort) or the pathological status (asphyxia, deaf, jaundice, premature condition and autism, etc.) of an infant and therefore suited for early diagnosis. In this work, combination of wavelet packet based features and Improved Binary Dragonfly Optimization based feature selection method was proposed to classify the different types of infant cry signals.

Methods: Cry signals from 2 different databases were utilized. First database contains 507 cry samples of normal (N), 340 cry samples of asphyxia (A), 879 cry samples of deaf (D), 350 cry samples of hungry (H) and 192 cry samples of pain (P). Second database contains 513 cry samples of jaundice (J), 531 samples of premature (Prem) and 45 samples of normal (N). Wavelet packet transform based energy and non-linear entropies (496 features), Linear Predictive Coding (LPC) based cepstral features (56 features), Mel-frequency Cepstral Coefficients (MFCCs) were extracted (16 features). The combined feature set consists of 568 features. To overcome the curse of dimensionality issue, improved binary dragonfly optimization algorithm (IBDFO) was proposed to select the most salient attributes or features. Finally, Extreme Learning Machine (ELM) kernel classifier was used to classify the different types of infant cry signals using all the features and highly informative features as well.

Results: Several experiments of two-class and multi-class classification of cry signals were conducted. In binary or two-class experiments, maximum accuracy of 90.18% for H Vs P, 100% for A Vs N, 100% for D Vs N and 97.61% J Vs Prem was achieved using the features selected (only 204 features out of 568) by IBDFO. For the classification of multiple cry signals (multi-class problem), the selected features could differentiate between three classes (N, A & D) with the accuracy of 100% and seven classes with the accuracy of 97.62%.

Conclusion: The experimental results indicated that the proposed combination of feature extraction and selection method offers suitable classification accuracy and may be employed to detect the subtle changes in the cry signals.

Keywords: Feature extraction; Feature selection; Infant cry signal; Optimization and classification.

MeSH terms

  • Algorithms*
  • Asphyxia / physiopathology
  • Crying*
  • Databases, Factual
  • Deafness / physiopathology
  • Humans
  • Hunger
  • Infant
  • Jaundice / physiopathology
  • Machine Learning
  • Neural Networks, Computer
  • Nonlinear Dynamics
  • Pain / physiopathology
  • Reproducibility of Results
  • Signal Processing, Computer-Assisted
  • Wavelet Analysis*