Identification of Smith-Magenis syndrome cases through an experimental evaluation of machine learning methods

Front Comput Neurosci. 2024 Mar 22:18:1357607. doi: 10.3389/fncom.2024.1357607. eCollection 2024.

Abstract

This research work introduces a novel, nonintrusive method for the automatic identification of Smith-Magenis syndrome, traditionally studied through genetic markers. The method utilizes cepstral peak prominence and various machine learning techniques, relying on a single metric computed by the research group. The performance of these techniques is evaluated across two case studies, each employing a unique data preprocessing approach. A proprietary data "windowing" technique is also developed to derive a more representative dataset. To address class imbalance in the dataset, the synthetic minority oversampling technique (SMOTE) is applied for data augmentation. The application of these preprocessing techniques has yielded promising results from a limited initial dataset. The study concludes that the k-nearest neighbors and linear discriminant analysis perform best, and that cepstral peak prominence is a promising measure for identifying Smith-Magenis syndrome.

Keywords: Smith–Magenis syndrome; acoustics; cepstral peak prominence; children; machine learning.

Grants and funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by the University Rey Juan Carlos, under grant 2023/00004/039-M3002.