Towards spoken clinical-question answering: evaluating and adapting automatic speech-recognition systems for spoken clinical questions

Feifan Liu; Gokhan Tur; Dilek Hakkani-Tür; Hong Yu

doi:10.1136/amiajnl-2010-000071

Towards spoken clinical-question answering: evaluating and adapting automatic speech-recognition systems for spoken clinical questions

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):625-30. doi: 10.1136/amiajnl-2010-000071. Epub 2011 Jun 24.

Authors

Feifan Liu¹, Gokhan Tur, Dilek Hakkani-Tür, Hong Yu

Affiliation

¹ Department of Health Sciences, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA.

Abstract

Objective: To evaluate existing automatic speech-recognition (ASR) systems to measure their performance in interpreting spoken clinical questions and to adapt one ASR system to improve its performance on this task.

Design and measurements: The authors evaluated two well-known ASR systems on spoken clinical questions: Nuance Dragon (both generic and medical versions: Nuance Gen and Nuance Med) and the SRI Decipher (the generic version SRI Gen). The authors also explored language model adaptation using more than 4000 clinical questions to improve the SRI system's performance, and profile training to improve the performance of the Nuance Med system. The authors reported the results with the NIST standard word error rate (WER) and further analyzed error patterns at the semantic level.

Results: Nuance Gen and Med systems resulted in a WER of 68.1% and 67.4% respectively. The SRI Gen system performed better, attaining a WER of 41.5%. After domain adaptation with a language model, the performance of the SRI system improved 36% to a final WER of 26.7%.

Conclusion: Without modification, two well-known ASR systems do not perform well in interpreting spoken clinical questions. With a simple domain adaptation, one of the ASR systems improved significantly on the clinical question task, indicating the importance of developing domain/genre-specific ASR systems.

Publication types

Comparative Study
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Data Mining*
Decision Support Systems, Clinical*
Humans
Natural Language Processing*
Semantics
Software Design
Speech Recognition Software*
Technology Assessment, Biomedical
Unified Medical Language System

Abstract

Publication types

MeSH terms

Grants and funding