Erroneous data: The Achilles' heel of AI and personalized medicine

Front Digit Health. 2022 Jul 22:4:862095. doi: 10.3389/fdgth.2022.862095. eCollection 2022.

Abstract

This paper reviews dilemmas and implications of erroneous data for clinical implementation of AI. It is well-known that if erroneous and biased data are used to train AI, there is a risk of systematic error. However, even perfectly trained AI applications can produce faulty outputs if fed with erroneous inputs. To counter such problems, we suggest 3 steps: (1) AI should focus on data of the highest quality, in essence paraclinical data and digital images, (2) patients should be granted simple access to the input data that feed the AI, and granted a right to request changes to erroneous data, and (3) automated high-throughput methods for error-correction should be implemented in domains with faulty data when possible. Also, we conclude that erroneous data is a reality even for highly reputable Danish data sources, and thus, legal framework for the correction of errors is universally needed.

Keywords: AI; artificial intelligence; data quality; deep learning; machine learning (ML); personalized medicine.

Publication types

  • Review