Using deep learning and electronic health records to detect Noonan syndrome in pediatric patients

Zeyu Yang; Amy Shikany; Yizhao Ni; Ge Zhang; K Nicole Weaver; Jing Chen

doi:10.1016/j.gim.2022.08.002

Using deep learning and electronic health records to detect Noonan syndrome in pediatric patients

Genet Med. 2022 Nov;24(11):2329-2337. doi: 10.1016/j.gim.2022.08.002. Epub 2022 Sep 13.

Authors

Zeyu Yang¹, Amy Shikany², Yizhao Ni¹, Ge Zhang³, K Nicole Weaver⁴, Jing Chen⁵

Affiliations

¹ Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati Children's Research Foundation, Cincinnati, OH; Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH.
² Heart Institute, Cincinnati Children's Hospital Medical Center, Cincinnati, OH.
³ Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH; Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH.
⁴ Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH; Heart Institute, Cincinnati Children's Hospital Medical Center, Cincinnati, OH; Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH.
⁵ Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati Children's Research Foundation, Cincinnati, OH; Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH. Electronic address: jing.chen2@cchmc.org.

PMID: 36098741
DOI: 10.1016/j.gim.2022.08.002

Abstract

Purpose: The variable expressivity and multisystem features of Noonan syndrome (NS) make it difficult for patients to obtain a timely diagnosis. Genetic testing can confirm a diagnosis, but underdiagnosis is prevalent owing to a lack of recognition and referral for testing. Our study investigated the utility of using electronic health records (EHRs) to identify patients at high risk of NS.

Methods: Using diagnosis texts extracted from Cincinnati Children's Hospital's EHR database, we constructed deep learning models from 162 NS cases and 16,200 putative controls. Performance was evaluated on 2 independent test sets, one containing patients with NS who were previously diagnosed and the other containing patients with undiagnosed NS.

Results: Our novel method performed significantly better than the previous method, with the convolutional neural network model achieving the highest area under the precision-recall curve in both test sets (diagnosed: 0.43, undiagnosed: 0.16).

Conclusion: The results suggested the validity of using text-based deep learning methods to analyze EHR and showed the value of this approach as a potential tool to identify patients with features of rare diseases. Given the paucity of medical geneticists, this has the potential to reduce disease underdiagnosis by prioritizing patients who will benefit most from a genetics referral.

Keywords: Deep learning; Diagnostics; Electronic health records; Mendelian disorder; Noonan syndrome.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Child
Databases, Factual
Deep Learning*
Electronic Health Records
Genetic Testing
Humans
Noonan Syndrome* / diagnosis
Noonan Syndrome* / genetics