Identifying patients in target customer segments using a two-stage clustering-classification approach: a hospital-based assessment

Comput Biol Med. 2012 Feb;42(2):213-21. doi: 10.1016/j.compbiomed.2011.11.010. Epub 2011 Dec 16.

Abstract

Identifying patients in a Target Customer Segment (TCS) is important to determine the demand for, and to appropriately allocate resources for, health care services. The purpose of this study is to propose a two-stage clustering-classification model through (1) initially integrating the RFM attribute and K-means algorithm for clustering the TCS patients and (2) then integrating the global discretization method and the rough set theory for classifying hospitalized departments and optimizing health care services. To assess the performance of the proposed model, a dataset was used from a representative hospital (termed Hospital-A) that was extracted from a database from an empirical study in Taiwan comprised of 183,947 samples that were characterized by 44 attributes during 2008. The proposed model was compared with three techniques, Decision Tree, Naive Bayes, and Multilayer Perceptron, and the empirical results showed significant promise of its accuracy. The generated knowledge-based rules provide useful information to maximize resource utilization and support the development of a strategy for decision-making in hospitals. From the findings, 75 patients in the TCS, three hospital departments, and specific diagnostic items were discovered in the data for Hospital-A. A potential determinant for gender differences was found, and the age attribute was not significant to the hospital departments.

MeSH terms

  • Algorithms*
  • Bayes Theorem
  • Cluster Analysis*
  • Computational Biology
  • Decision Trees
  • Delivery of Health Care*
  • Hospitals / statistics & numerical data*
  • Humans
  • Patients / statistics & numerical data*