Estimating vocal tract geometry from acoustic impedance using deep neural network

JASA Express Lett. 2022 Feb;2(3):034801. doi: 10.1121/10.0009599.

Abstract

A data-driven approach using artificial neural networks is proposed to address the classic inverse area function problem, i.e., to determine the vocal tract geometry (modelled as a tube of nonuniform cylindrical cross-sections) from the vocal tract acoustic impedance spectrum. The predicted cylindrical radii and the actual radii were found to have high correlation in the three- and four-cylinder model (Pearson coefficient (ρ) and Lin concordance coefficient (ρc) exceeded 95%); however, for the six-cylinder model, the correlation was low (ρ around 75% and ρc around 69%). Upon standardizing the impedance value, the correlation improved significantly for all cases (ρ and ρc exceeded 90%).

MeSH terms

  • Acoustics*
  • Electric Impedance
  • Neural Networks, Computer*