Correlating Perceptual Voice Quality in Adductor Spasmodic Dysphonia With Computer Vision Assessment of Glottal Geometry Dynamics

Quinn A Peterson; Teng Fei; Lauren E Sy; Laura L O Froeschke; Abie H Mendelsohn; Gerald S Berke; David A Peterson

doi:10.1044/2022_JSLHR-22-00053

Correlating Perceptual Voice Quality in Adductor Spasmodic Dysphonia With Computer Vision Assessment of Glottal Geometry Dynamics

J Speech Lang Hear Res. 2022 Oct 17;65(10):3695-3708. doi: 10.1044/2022_JSLHR-22-00053. Epub 2022 Sep 21.

Authors

Quinn A Peterson¹, Teng Fei², Lauren E Sy², Laura L O Froeschke³, Abie H Mendelsohn⁴, Gerald S Berke⁴, David A Peterson⁵

Affiliations

¹ Department of Computer Science and Software Engineering, California Polytechnic State University, San Luis Obispo.
² Department of Cognitive Science, University of California, San Diego, La Jolla.
³ Department of Communication Sciences and Disorders, Elmhurst University, IL.
⁴ Department of Head and Neck Surgery, David Geffen School of Medicine, University of California, Los Angeles.
⁵ Institute for Neural Computation, University of California, San Diego, La Jolla.

Abstract

Purpose: This study examined the relationship between voice quality and glottal geometry dynamics in patients with adductor spasmodic dysphonia (ADSD).

Method: An objective computer vision and machine learning system was developed to extract glottal geometry dynamics from nasolaryngoscopic video recordings for 78 patients with ADSD. General regression models were used to examine the relationship between overall voice quality and 15 variables that capture glottal geometry dynamics derived from the computer vision system. Two experts in ADSD independently rated voice quality for two separate voice tasks for every patient, yielding four different voice quality rating models.

Results: All four of the regression models exhibited positive correlations with clinical assessments of voice quality (R ²s = .30-.34, Spearman rho = .55-.61, all with p < .001). Seven to 10 variables were included in each model. There was high overlap in the variables included between the four models, and the sign of the correlation with voice quality was consistent for each variable across all four regression models.

Conclusion: We found specific glottal geometry dynamics that correspond to voice quality in ADSD.

Publication types

Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, Non-P.H.S.
Research Support, Non-U.S. Gov't

MeSH terms

Computers
Dysphonia* / diagnosis
Glottis
Humans
Voice Quality
Voice*

Abstract

Publication types

MeSH terms

Grants and funding