Diagnostic Support for Selected Paediatric Pulmonary Diseases Using Answer-Pattern Recognition in Questionnaires Based on Combined Data Mining Applications--A Monocentric Observational Pilot Study

PLoS One. 2015 Aug 12;10(8):e0135180. doi: 10.1371/journal.pone.0135180. eCollection 2015.

Abstract

Background: Clinical symptoms in children with pulmonary diseases are frequently non-specific. Rare diseases such as primary ciliary dyskinesia (PCD), cystic fibrosis (CF) or protracted bacterial bronchitis (PBB) can be easily missed at the general practitioner (GP).

Objective: To develop and test a questionnaire-based and data mining-supported tool providing diagnostic support for selected pulmonary diseases.

Methods: First, interviews with parents of affected children were conducted and analysed. These parental observations during the pre-diagnostic time formed the basis for a new questionnaire addressing the parents' view on the disease. Secondly, parents with a sick child (e.g. PCD, PBB) answered the questionnaire and a data base was set up. Finally, a computer program consisting of eight different classifiers (support vector machine (SVM), artificial neural network (ANN), fuzzy rule-based, random forest, logistic regression, linear discriminant analysis, naive Bayes and nearest neighbour) and an ensemble classifier was developed and trained to categorise any given new questionnaire and suggest a diagnosis. For estimating the diagnostic accuracy, we applied ten-fold stratified cross validation.

Results: All questionnaires of patients suffering from CF, asthma (AS), PCD, acute bronchitis (AB) and the healthy control group were correctly diagnosed by the fusion algorithm. For the pneumonia (PM) group 19/21 (90.5%) and for the PBB group 17/18 (94.4%) correct diagnoses could be reached. The program detected the correct diagnoses with an overall sensitivity of 98.8%. Receiver operating characteristics (ROC) analyses confirmed the accuracy of this diagnostic tool. Case studies highlighted the applicability of the tool in the daily work of a GP.

Conclusion: For children with symptoms of pulmonary diseases a questionnaire-based diagnostic support tool using data mining techniques exhibited good results in arriving at diagnostic suggestions. In the hands of a doctor, this tool could be of value in arousing awareness for rare pulmonary diseases such as PCD or CF.

Publication types

  • Observational Study

MeSH terms

  • Child
  • Child, Preschool
  • Data Interpretation, Statistical
  • Data Mining
  • Female
  • Humans
  • Infant
  • Lung Diseases / diagnosis*
  • Male
  • Pilot Projects
  • Surveys and Questionnaires*

Grants and funding

The authors received no funding for this work. One of the authors [WL] received salaries through Improved Medical Diagnostics (IMD) Ptd Ltd., Singapore. The specific role of this author is articulated in the ‘author contributions’ section.