Machine Learning for Knowledge Extraction from PHR Big Data

Stud Health Technol Inform. 2014:202:36-9.

Abstract

Cloud computing, Internet of things (IOT) and NoSQL database technologies can support a new generation of cloud-based PHR services that contain heterogeneous (unstructured, semi-structured and structured) patient data (health, social and lifestyle) from various sources, including automatically transmitted data from Internet connected devices of patient living space (e.g. medical devices connected to patients at home care). The patient data stored in such PHR systems constitute big data whose analysis with the use of appropriate machine learning algorithms is expected to improve diagnosis and treatment accuracy, to cut healthcare costs and, hence, to improve the overall quality and efficiency of healthcare provided. This paper describes a health data analytics engine which uses machine learning algorithms for analyzing cloud based PHR big health data towards knowledge extraction to support better healthcare delivery as regards disease diagnosis and prognosis. This engine comprises of the data preparation, the model generation and the data analysis modules and runs on the cloud taking advantage from the map/reduce paradigm provided by Apache Hadoop.

MeSH terms

  • Cloud Computing*
  • Data Mining / methods*
  • Datasets as Topic*
  • Electronic Health Records / organization & administration*
  • Health Records, Personal
  • Knowledge Management
  • Machine Learning*
  • Natural Language Processing*
  • Pattern Recognition, Automated / methods