Designing and evaluating a clustering system for organizing and integrating patient drug outcomes in personal health messages

AMIA Annu Symp Proc. 2012:2012:417-26. Epub 2012 Nov 3.

Abstract

Patient outcomes to drugs vary, but physicians currently have little data about individual responses. We designed a comprehensive system to organize and integrate patient outcomes utilizing semantic analysis, which groups large collections of personal comments into a series of topics. A prototype implementation was built to extract situational evidences by filtering and digesting user comments provided by patients. Our methods do not require extensive training or dictionaries, while categorizing comments based on expert opinions from standard source, or patient-specified categories. This system has been tested with sample health messages from our unique dataset from Yahoo! Groups, containing 12M personal messages from 27K public groups in Health and Wellness. We have performed an extensive evaluation of the clustering results with medical students. Evaluated results show high quality of labeled clustering, promising an effective automatic system for discovering patient outcomes from large volumes of health information.

Publication types

  • Evaluation Study
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Adverse Drug Reaction Reporting Systems
  • Cluster Analysis
  • Data Mining
  • Drug Therapy*
  • Electronic Data Processing
  • Health Education
  • Humans
  • Mathematical Concepts
  • Outcome Assessment, Health Care / methods*
  • PubMed
  • Support Vector Machine
  • Terminology as Topic*