Comprehensible knowledge model creation for cancer treatment decision making

Comput Biol Med. 2017 Mar 1:82:119-129. doi: 10.1016/j.compbiomed.2017.01.010. Epub 2017 Jan 29.

Abstract

Background: A wealth of clinical data exists in clinical documents in the form of electronic health records (EHRs). This data can be used for developing knowledge-based recommendation systems that can assist clinicians in clinical decision making and education. One of the big hurdles in developing such systems is the lack of automated mechanisms for knowledge acquisition to enable and educate clinicians in informed decision making.

Materials and methods: An automated knowledge acquisition methodology with a comprehensible knowledge model for cancer treatment (CKM-CT) is proposed. With the CKM-CT, clinical data are acquired automatically from documents. Quality of data is ensured by correcting errors and transforming various formats into a standard data format. Data preprocessing involves dimensionality reduction and missing value imputation. Predictive algorithm selection is performed on the basis of the ranking score of the weighted sum model. The knowledge builder prepares knowledge for knowledge-based services: clinical decisions and education support.

Results: Data is acquired from 13,788 head and neck cancer (HNC) documents for 3447 patients, including 1526 patients of the oral cavity site. In the data quality task, 160 staging values are corrected. In the preprocessing task, 20 attributes and 106 records are eliminated from the dataset. The Classification and Regression Trees (CRT) algorithm is selected and provides 69.0% classification accuracy in predicting HNC treatment plans, consisting of 11 decision paths that yield 11 decision rules.

Conclusion: Our proposed methodology, CKM-CT, is helpful to find hidden knowledge in clinical documents. In CKM-CT, the prediction models are developed to assist and educate clinicians for informed decision making. The proposed methodology is generalizable to apply to data of other domains such as breast cancer with a similar objective to assist clinicians in decision making and education.

Keywords: Algorithm selection; Decision support; Education support; Knowledge acquisition; Prediction model.

MeSH terms

  • Algorithms
  • Clinical Decision-Making / methods
  • Data Accuracy
  • Data Mining / methods*
  • Decision Support Systems, Clinical / organization & administration*
  • Decision Support Techniques*
  • Electronic Health Records / organization & administration*
  • Humans
  • Knowledge Bases*
  • Neoplasms / diagnosis*
  • Neoplasms / therapy*
  • Reproducibility of Results
  • Sensitivity and Specificity