Quantitative assessment of videolaryngostroboscopic images in patients with glottic pathologies

Logoped Phoniatr Vocol. 2017 Jul;42(2):73-83. doi: 10.3109/14015439.2016.1174293. Epub 2016 May 2.

Abstract

Introduction: Digital imaging techniques enable exploration of novel visualization modalities of the vocal folds during phonation and definition of parameters, facilitating more precise diagnosis of voice disorders.

Aim: Application of computer vision algorithms for analysis of videolaryngostroboscopic (VLS) images aimed at qualitative and quantitative description of phonatory vibrations.

Materials and methods: VLS examinations were conducted for 45 females, including 15 subjects with vocal nodules, 15 subjects with glottal incompetence, and 15 normophonic females. The recorded VLS images were preprocessed, the glottis area was segmented out, and the glottal cycles were identified. The glottovibrograms were built, and then the glottal area waveforms (GAW) were quantitatively described by computing the following parameters: open quotient (OQ), closing quotient (CQ), speed quotient (SQ), minimal relative glottal area (MRGA), and a new parameter termed closure difference index (CDI).

Results: Profiles of the glottal widths assessed along the glottal length differentiated the study groups (P < 0.001). Moreover, it was shown that the OQ, CQ, CDI, and MRGA indices can be considered as viable parameters for quantifying kinematics of the vocal folds for normophonic subjects and patients with diagnosed vocal nodules and glottal incompetence (P < 0.001).

Conclusions: Computer image processing and analysis methods applied to videolaryngostroboscopic images allow for their quantitative assessment. Computation of the size-related and time-related parameters characterizing glottic pathologies is of interest for evidence-based voice diagnostics. Results of the performed ROC curve analysis suggest that the evaluated parameters can distinguish patients with voice disorders from normophonic subjects.

Keywords: Glottal pathologies; glottovibrograms; quantitative analysis; software for processing images of larynx; videolaryngostroboscopy; voice disorders.

MeSH terms

  • Adult
  • Algorithms
  • Area Under Curve
  • Biomechanical Phenomena
  • Case-Control Studies
  • Female
  • Humans
  • Image Interpretation, Computer-Assisted
  • Laryngeal Diseases / diagnostic imaging*
  • Laryngeal Diseases / pathology
  • Laryngeal Diseases / physiopathology
  • Laryngoscopy / methods*
  • Middle Aged
  • Phonation
  • Predictive Value of Tests
  • ROC Curve
  • Stroboscopy*
  • Time Factors
  • Vibration
  • Video Recording*
  • Vocal Cords / diagnostic imaging*
  • Vocal Cords / pathology
  • Vocal Cords / physiopathology
  • Voice Disorders / diagnostic imaging*
  • Voice Disorders / pathology
  • Voice Disorders / physiopathology
  • Voice Quality
  • Young Adult