Harnessing Machine Learning in Vocal Arts Medicine: A Random Forest Application for "Fach" Classification in Opera

Diagnostics (Basel). 2023 Sep 6;13(18):2870. doi: 10.3390/diagnostics13182870.

Abstract

Vocal arts medicine provides care and prevention strategies for professional voice disorders in performing artists. The issue of correct "Fach" determination depending on the presence of a lyric or dramatic voice structure is of crucial importance for opera singers, as chronic overuse often leads to vocal fold damage. To avoid phonomicrosurgery or prevent a premature career end, our aim is to offer singers an improved, objective fach counseling using digital sound analyses and machine learning procedures. For this purpose, a large database of 2004 sound samples from professional opera singers was compiled. Building on this dataset, we employed a classic ensemble learning method, namely the Random Forest algorithm, to construct an efficient fach classifier. This model was trained to learn from features embedded within the sound samples, subsequently enabling voice classification as either lyric or dramatic. As a result, the developed system can decide with an accuracy of about 80% in most examined voice types whether a sound sample has a lyric or dramatic character. To advance diagnostic tools and health in vocal arts medicine and singing voice pedagogy, further machine learning methods will be applied to find the best and most efficient classification method based on artificial intelligence approaches.

Keywords: digital sound analysis; dramatic voice structure; lyric voice structure; machine learning; opera singer; random forest; vocal arts medicine; voice classification; voice disorder prevention; voice timbre parameter.

Grants and funding

This research received no external funding.