Artificial Intelligence Algorithms and Natural Language Processing for the Recognition of Syncope Patients on Emergency Department Medical Records

Franca Dipaola; Mauro Gatti; Veronica Pacetti; Anna Giulia Bottaccioli; Dana Shiffer; Maura Minonzio; Roberto Menè; Alessandro Giaj Levra; Monica Solbiati; Giorgio Costantino; Marco Anastasio; Elena Sini; Franca Barbic; Enrico Brunetta; Raffaello Furlan

doi:10.3390/jcm8101677

Artificial Intelligence Algorithms and Natural Language Processing for the Recognition of Syncope Patients on Emergency Department Medical Records

J Clin Med. 2019 Oct 14;8(10):1677. doi: 10.3390/jcm8101677.

Authors

Franca Dipaola^{1

2}, Mauro Gatti³, Veronica Pacetti⁴, Anna Giulia Bottaccioli⁵, Dana Shiffer^{6

7}, Maura Minonzio^{8

9}, Roberto Menè¹⁰, Alessandro Giaj Levra¹¹, Monica Solbiati¹², Giorgio Costantino¹³, Marco Anastasio¹⁴, Elena Sini¹⁵, Franca Barbic^{16

17}, Enrico Brunetta^{18

19}, Raffaello Furlan^{20

21}

Affiliations

¹ Internal Medicine, Humanitas Clinical and Research Center- IRCCS, 20089 Rozzano, Milan, Italy. franca.dipaola@humanitas.it.
² Department of Biomedical Sciences, Humanitas University, 20090 Pieve Emanuele, Milan, Italy. franca.dipaola@humanitas.it.
³ IBM Italy, 20090 Segrate, Milan, Italy. MAURO_GATTI@it.ibm.com.
⁴ Centro Trombosi e Malattie Emorragiche, Humanitas Clinical and Research Center- IRCCS, 20089 Rozzano, Milan, Italy. v_pacetti@hotmail.com.
⁵ Faculty of Psychology, "Vita-Salute San Raffaele" University, 20132 Milan, Italy. annagiulia.bottaccioli@gmail.com.
⁶ Internal Medicine, Humanitas Clinical and Research Center- IRCCS, 20089 Rozzano, Milan, Italy. dana.shiffer@humanitas.it.
⁷ Department of Biomedical Sciences, Humanitas University, 20090 Pieve Emanuele, Milan, Italy. dana.shiffer@humanitas.it.
⁸ Internal Medicine, Humanitas Clinical and Research Center- IRCCS, 20089 Rozzano, Milan, Italy. maura.minonzio@humanitas.it.
⁹ Department of Biomedical Sciences, Humanitas University, 20090 Pieve Emanuele, Milan, Italy. maura.minonzio@humanitas.it.
¹⁰ IBM Italy, 20090 Segrate, Milan, Italy. meneroberto@gmail.com.
¹¹ Internal Medicine, Humanitas Clinical and Research Center- IRCCS, 20089 Rozzano, Milan, Italy. alessandro.giajlevra@st.hunimed.eu.
¹² Pronto Soccorso e Medicina D'Urgenza, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico, Università degli Studi di Milano, 20122 Milan, Italy. monica.solbiati@gmail.com.
¹³ Pronto Soccorso e Medicina D'Urgenza, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico, Università degli Studi di Milano, 20122 Milan, Italy. giorgic2@gmail.com.
¹⁴ ICT Department, Humanitas Clinical and Research Center- IRCCS, 20089 Rozzano, Milan, Italy. marco.anastasio@humanitas.it.
¹⁵ GVM Care & Research, 48124 Ravenna, Italy. esini@gvmnet.it.
¹⁶ Internal Medicine, Humanitas Clinical and Research Center- IRCCS, 20089 Rozzano, Milan, Italy. franca.barbic@humanitas.it.
¹⁷ Department of Biomedical Sciences, Humanitas University, 20090 Pieve Emanuele, Milan, Italy. franca.barbic@humanitas.it.
¹⁸ Internal Medicine, Humanitas Clinical and Research Center- IRCCS, 20089 Rozzano, Milan, Italy. enrico.brunetta@humanitas.it.
¹⁹ Department of Biomedical Sciences, Humanitas University, 20090 Pieve Emanuele, Milan, Italy. enrico.brunetta@humanitas.it.
²⁰ Internal Medicine, Humanitas Clinical and Research Center- IRCCS, 20089 Rozzano, Milan, Italy. raffaello.furlan@hunimed.eu.
²¹ Department of Biomedical Sciences, Humanitas University, 20090 Pieve Emanuele, Milan, Italy. raffaello.furlan@hunimed.eu.

Abstract

Background: Enrollment of large cohorts of syncope patients from administrative data is crucial for proper risk stratification but is limited by the enormous amount of time required for manual revision of medical records.

Aim: To develop a Natural Language Processing (NLP) algorithm to automatically identify syncope from Emergency Department (ED) electronic medical records (EMRs).

Methods: De-identified EMRs of all consecutive patients evaluated at Humanitas Research Hospital ED from 1 December 2013 to 31 March 2014 and from 1 December 2015 to 31 March 2016 were manually annotated to identify syncope. Records were combined in a single dataset and classified. The performance of combined multiple NLP feature selectors and classifiers was tested. Primary Outcomes: NLP algorithms' accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F3 score.

Results: 15,098 and 15,222 records from 2013 and 2015 datasets were analyzed. Syncope was present in 571 records. Normalized Gini Index feature selector combined with Support Vector Machines classifier obtained the best F3 value (84.0%), with 92.2% sensitivity and 47.4% positive predictive value. A 96% analysis time reduction was computed, compared with EMRs manual review.

Conclusions: This artificial intelligence algorithm enabled the automatic identification of a large population of syncope patients using EMRs.

Keywords: Emergency Department; artificial intelligence; electronic medical records; natural language processing; syncope.