Big data and machine learning algorithms for health-care delivery

Lancet Oncol. 2019 May;20(5):e262-e273. doi: 10.1016/S1470-2045(19)30149-4.

Abstract

Analysis of big data by machine learning offers considerable advantages for assimilation and evaluation of large amounts of complex health-care data. However, to effectively use machine learning tools in health care, several limitations must be addressed and key issues considered, such as its clinical implementation and ethics in health-care delivery. Advantages of machine learning include flexibility and scalability compared with traditional biostatistical methods, which makes it deployable for many tasks, such as risk stratification, diagnosis and classification, and survival predictions. Another advantage of machine learning algorithms is the ability to analyse diverse data types (eg, demographic data, laboratory findings, imaging data, and doctors' free-text notes) and incorporate them into predictions for disease risk, diagnosis, prognosis, and appropriate treatments. Despite these advantages, the application of machine learning in health-care delivery also presents unique challenges that require data pre-processing, model training, and refinement of the system with respect to the actual clinical problem. Also crucial are ethical considerations, which include medico-legal implications, doctors' understanding of machine learning tools, and data privacy and security. In this Review, we discuss some of the benefits and challenges of big data and machine learning in health care.

Publication types

  • Review

MeSH terms

  • Big Data*
  • Data Mining*
  • Delivery of Health Care, Integrated*
  • Diagnosis, Computer-Assisted
  • Health Services Research
  • Humans
  • Machine Learning*
  • Medical Oncology*
  • Neoplasms* / diagnosis
  • Neoplasms* / epidemiology
  • Neoplasms* / therapy
  • Neural Networks, Computer*
  • Therapy, Computer-Assisted