Aci-bench: a Novel Ambient Clinical Intelligence Dataset for Benchmarking Automatic Visit Note Generation

Wen-Wai Yim; Yujuan Fu; Asma Ben Abacha; Neal Snider; Thomas Lin; Meliha Yetisgen

doi:10.1038/s41597-023-02487-3

Aci-bench: a Novel Ambient Clinical Intelligence Dataset for Benchmarking Automatic Visit Note Generation

Sci Data. 2023 Sep 6;10(1):586. doi: 10.1038/s41597-023-02487-3.

Authors

Wen-Wai Yim¹, Yujuan Fu², Asma Ben Abacha³, Neal Snider⁴, Thomas Lin³, Meliha Yetisgen²

Affiliations

¹ Microsoft, Health AI, Redmond, 98052, USA. yimwenwai@microsoft.com.
² University of Washington, Biomedical and Health Informatics, Seattle, 98109, USA.
³ Microsoft, Health AI, Redmond, 98052, USA.
⁴ Nuance Communications, Healthcare R&D, Burlington, 01803, USA.

Abstract

Recent immense breakthroughs in generative models such as in GPT4 have precipitated re-imagined ubiquitous usage of these models in all applications. One area that can benefit by improvements in artificial intelligence (AI) is healthcare. The note generation task from doctor-patient encounters, and its associated electronic medical record documentation, is one of the most arduous time-consuming tasks for physicians. It is also a natural prime potential beneficiary to advances in generative models. However with such advances, benchmarking is more critical than ever. Whether studying model weaknesses or developing new evaluation metrics, shared open datasets are an imperative part of understanding the current state-of-the-art. Unfortunately as clinic encounter conversations are not routinely recorded and are difficult to ethically share due to patient confidentiality, there are no sufficiently large clinic dialogue-note datasets to benchmark this task. Here we present the Ambient Clinical Intelligence Benchmark (ACI-BENCH) corpus, the largest dataset to date tackling the problem of AI-assisted note generation from visit dialogue. We also present the benchmark performances of several common state-of-the-art approaches.

Publication types

Dataset

MeSH terms

Artificial Intelligence*
Benchmarking*
Electronic Health Records
Health Facilities*
Humans