Identification of digital voice biomarkers for cognitive health

Honghuang Lin; Cody Karjadi; Ting F A Ang; Joshi Prajakta; Chelsea McManus; Tuka W Alhanai; James Glass; Rhoda Au

doi:10.37349/emed.2020.00028

Identification of digital voice biomarkers for cognitive health

Explor Med. 2020:1:406-417. doi: 10.37349/emed.2020.00028. Epub 2020 Dec 31.

Authors

Honghuang Lin^{1

2}, Cody Karjadi^{2

3}, Ting F A Ang^{2

3

4

5}, Joshi Prajakta^{2

3}, Chelsea McManus^{2

3}, Tuka W Alhanai⁶, James Glass⁷, Rhoda Au^{2

3

4

5

8}

Affiliations

¹ Section of Computational Biomedicine, Department of Medicine, Boston University School of Medicine, Boston, MA 02118, USA.
² The Framingham Heart Study, Boston University School of Medicine, Boston, MA 02118, USA.
³ Department of Anatomy and Neurobiology, Boston University School of Medicine, Boston, MA 02118, USA.
⁴ Department of Epidemiology, Boston University School of Public Health, Boston, MA 02118, USA.
⁵ Slone Epidemiology Center, Boston University School of Medicine, Boston, MA 02118, USA.
⁶ Department of Electrical and Computer Engineering, New York University Abu Dhabi, Abu Dhabi, UAE.
⁷ Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
⁸ Department of Neurology, Boston University School of Medicine, Boston, MA 02118, USA.

Abstract

Aim: Human voice contains rich information. Few longitudinal studies have been conducted to investigate the potential of voice to monitor cognitive health. The objective of this study is to identify voice biomarkers that are predictive of future dementia.

Methods: Participants were recruited from the Framingham Heart Study. The vocal responses to neuropsychological tests were recorded, which were then diarized to identify participant voice segments. Acoustic features were extracted with the OpenSMILE toolkit (v2.1). The association of each acoustic feature with incident dementia was assessed by Cox proportional hazards models.

Results: Our study included 6, 528 voice recordings from 4, 849 participants (mean age 63 ± 15 years old, 54.6% women). The majority of participants (71.2%) had one voice recording, 23.9% had two voice recordings, and the remaining participants (4.9%) had three or more voice recordings. Although all asymptomatic at the time of examination, participants who developed dementia tended to have shorter segments than those who were dementia free (P < 0.001). Additionally, 14 acoustic features were significantly associated with dementia after adjusting for multiple testing (P < 0.05/48 = 1 × 10^-3). The most significant acoustic feature was jitterDDP_sma_de (P = 7.9 × 10^-7), which represents the differential frame-to-frame Jitter. A voice based linear classifier was also built that was capable of predicting incident dementia with area under curve of 0.812.

Conclusions: Multiple acoustic and linguistic features are identified that are associated with incident dementia among asymptomatic participants, which could be used to build better prediction models for passive cognitive health monitoring.

Keywords: Digital voice; acoustic features; dementia; epidemiology; prediction.

Abstract

Grants and funding