Prediction of menarcheal status of girls using voice features

Comput Biol Med. 2018 Sep 1:100:296-304. doi: 10.1016/j.compbiomed.2017.11.005. Epub 2017 Nov 8.

Abstract

A method for evaluating the menarcheal status of girls on the basis of their voice features is presented in the paper. The registration procedure consists of voice recording and measuring 20 anthropological features. The input feature vector is a combination of voice and anthropometric parameters, counting 220 features. The optimal set of parameters was selected using five different methods: Method A - stepwise regression (first forward, then backward regression) performed on features with statistically different means/medians; Method B - stepwise regression (forward and backward) on all features, with age; Method C - stepwise regression as in B; including age, Method D - all features with statistically different means/medians, Method E - all features excluding age. For classification purposes three methods were employed: random forest (RF), support vector machine (SVM) and linear discriminant analysis (LDA) classifier. They were tested with 10-fold cross validation. The classification accuracy for RF using only voice features is higher than using only anthropometric data: 86.86% vs. 81.02% respectively. For the other two classifiers, the results do not show as large a difference: 80.60% vs. 82.80% for SVM and 80.66% vs. 82.34% for LDA. The advantage of voice features is more noticeable with sensitivity: 91.92% vs. 83.06% for RF. The obtained results suggest that the presented method can be used for automatic recognition of girls' menarcheal status using voice signal.

Keywords: Post-menarche; Pre-menarche; Puberty; Voice analysis.

MeSH terms

  • Adolescent
  • Algorithms*
  • Female
  • Humans
  • Menarche / physiology*
  • Signal Processing, Computer-Assisted*
  • Support Vector Machine*
  • Voice / physiology*