Data-driven knowledge acquisition, validation, and transformation into HL7 Arden Syntax

Maqbool Hussain; Muhammad Afzal; Taqdir Ali; Rahman Ali; Wajahat Ali Khan; Arif Jamshed; Sungyoung Lee; Byeong Ho Kang; Khalid Latif

doi:10.1016/j.artmed.2015.09.008

Data-driven knowledge acquisition, validation, and transformation into HL7 Arden Syntax

Artif Intell Med. 2018 Nov:92:51-70. doi: 10.1016/j.artmed.2015.09.008. Epub 2015 Oct 28.

Authors

Maqbool Hussain¹, Muhammad Afzal², Taqdir Ali³, Rahman Ali⁴, Wajahat Ali Khan⁵, Arif Jamshed⁶, Sungyoung Lee⁷, Byeong Ho Kang⁸, Khalid Latif⁹

Affiliations

¹ Department of Computer Engineering, Kyung Hee University, Seocheon-dong, Giheung-gu, Yongin-si 446-701, Gyeonggi-do, Republic of Korea. Electronic address: maqbool.hussain@oslab.khu.ac.kr.
² Department of Computer Engineering, Kyung Hee University, Seocheon-dong, Giheung-gu, Yongin-si 446-701, Gyeonggi-do, Republic of Korea. Electronic address: muhammad.afzal@oslab.khu.ac.kr.
³ Department of Computer Engineering, Kyung Hee University, Seocheon-dong, Giheung-gu, Yongin-si 446-701, Gyeonggi-do, Republic of Korea. Electronic address: taqdir.ali@oslab.khu.ac.kr.
⁴ Department of Computer Engineering, Kyung Hee University, Seocheon-dong, Giheung-gu, Yongin-si 446-701, Gyeonggi-do, Republic of Korea. Electronic address: rahmanali@oslab.khu.ac.kr.
⁵ Department of Computer Engineering, Kyung Hee University, Seocheon-dong, Giheung-gu, Yongin-si 446-701, Gyeonggi-do, Republic of Korea. Electronic address: wajahat.alikhan@oslab.khu.ac.kr.
⁶ Department of Radiation Oncology, Shaukat Khanum Memorial Cancer Hospital and Research Centre, 7A Block R-3, M.A. Johar Town, Lahore 54782, Pakistan. Electronic address: arifj@skm.org.pk.
⁷ Department of Computer Engineering, Kyung Hee University, Seocheon-dong, Giheung-gu, Yongin-si 446-701, Gyeonggi-do, Republic of Korea. Electronic address: sylee@oslab.khu.ac.kr.
⁸ Computing and Information Systems, University of Tasmania, Hobart 7001, Tasmania, Australia. Electronic address: byeong.Kang@utas.edu.au.
⁹ Department of Computer Science, COMSATS Institute of Information Technology, Park Road, Islamabad 45550, Pakistan. Electronic address: khalid.latif@comsats.edu.pk.

PMID: 26573247
DOI: 10.1016/j.artmed.2015.09.008

Abstract

Objective: The objective of this study is to help a team of physicians and knowledge engineers acquire clinical knowledge from existing practices datasets for treatment of head and neck cancer, to validate the knowledge against published guidelines, to create refined rules, and to incorporate these rules into clinical workflow for clinical decision support.

Methods and materials: A team of physicians (clinical domain experts) and knowledge engineers adapt an approach for modeling existing treatment practices into final executable clinical models. For initial work, the oral cavity is selected as the candidate target area for the creation of rules covering a treatment plan for cancer. The final executable model is presented in HL7 Arden Syntax, which helps the clinical knowledge be shared among organizations. We use a data-driven knowledge acquisition approach based on analysis of real patient datasets to generate a predictive model (PM). The PM is converted into a refined-clinical knowledge model (R-CKM), which follows a rigorous validation process. The validation process uses a clinical knowledge model (CKM), which provides the basis for defining underlying validation criteria. The R-CKM is converted into a set of medical logic modules (MLMs) and is evaluated using real patient data from a hospital information system.

Results: We selected the oral cavity as the intended site for derivation of all related clinical rules for possible associated treatment plans. A team of physicians analyzed the National Comprehensive Cancer Network (NCCN) guidelines for the oral cavity and created a common CKM. Among the decision tree algorithms, chi-squared automatic interaction detection (CHAID) was applied to a refined dataset of 1229 patients to generate the PM. The PM was tested on a disjoint dataset of 739 patients, which gives 59.0% accuracy. Using a rigorous validation process, the R-CKM was created from the PM as the final model, after conforming to the CKM. The R-CKM was converted into four candidate MLMs, and was used to evaluate real data from 739 patients, yielding efficient performance with 53.0% accuracy.

Conclusion: Data-driven knowledge acquisition and validation against published guidelines were used to help a team of physicians and knowledge engineers create executable clinical knowledge. The advantages of the R-CKM are twofold: it reflects real practices and conforms to standard guidelines, while providing optimal accuracy comparable to that of a PM. The proposed approach yields better insight into the steps of knowledge acquisition and enhances collaboration efforts of the team of physicians and knowledge engineers.

Keywords: Clinical decision support systems; Clinical guidelines; HL7 Arden Syntax; Knowledge acquisition; Knowledge validation; Prediction models.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Artificial Intelligence*
Decision Support Systems, Clinical / organization & administration*
Expert Systems*
Head and Neck Neoplasms / therapy*
Humans
Information Systems / organization & administration*
Information Systems / standards
Medical Informatics
Practice Guidelines as Topic
Programming Languages
Workflow