AdaDiag: Adversarial Domain Adaptation of Diagnostic Prediction with Clinical Event Sequences

J Biomed Inform. 2022 Oct:134:104168. doi: 10.1016/j.jbi.2022.104168. Epub 2022 Aug 17.

Abstract

Early detection of heart failure (HF) can provide patients with the opportunity for more timely intervention and better disease management, as well as efficient use of healthcare resources. Recent machine learning (ML) methods have shown promising performance on diagnostic prediction using temporal sequences from electronic health records (EHRs). In practice, however, these models may not generalize to other populations due to dataset shift. Shifts in datasets can be attributed to a range of factors such as variations in demographics, data management methods, and healthcare delivery patterns. In this paper, we use unsupervised adversarial domain adaptation methods to adaptively reduce the impact of dataset shift on cross-institutional transfer performance. The proposed framework is validated on a next-visit HF onset prediction task using a BERT-style Transformer-based language model pre-trained with a masked language modeling (MLM) task. Our model empirically demonstrates superior prediction performance relative to non-adversarial baselines in both transfer directions on two different clinical event sequence data sources.

Keywords: Clinical event sequence modeling; Domain adaptation; Heart failure; Transformers.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Electronic Health Records
  • Heart Failure* / diagnosis
  • Humans
  • Information Storage and Retrieval
  • Language
  • Machine Learning
  • Neural Networks, Computer*