Erroneous data: The Achilles' heel of AI and personalized medicine

Thomas Birk Kristiansen; Kent Kristensen; Jakob Uffelmann; Ivan Brandslund

doi:10.3389/fdgth.2022.862095

Erroneous data: The Achilles' heel of AI and personalized medicine

Front Digit Health. 2022 Jul 22:4:862095. doi: 10.3389/fdgth.2022.862095. eCollection 2022.

Authors

Thomas Birk Kristiansen¹, Kent Kristensen², Jakob Uffelmann^{3

4}, Ivan Brandslund⁵

Affiliations

¹ Ishøjcentrets Læger, Ishøj, Denmark.
² Institute of Law, University of Southern Denmark, Odense, Denmark.
³ Public Danish E-Health Portal (Sundhed.dk), Copenhagen, Denmark.
⁴ Sundhed.dk International Foundation, Copenhagen, Denmark.
⁵ Department of Medical Science and Artificial Intelligence, Institute of Regional Health Research, University Hospital of Southern Denmark Sygehus Lillebælt (SLB), University of Southern Denmark, Odense, Denmark.

Abstract

This paper reviews dilemmas and implications of erroneous data for clinical implementation of AI. It is well-known that if erroneous and biased data are used to train AI, there is a risk of systematic error. However, even perfectly trained AI applications can produce faulty outputs if fed with erroneous inputs. To counter such problems, we suggest 3 steps: (1) AI should focus on data of the highest quality, in essence paraclinical data and digital images, (2) patients should be granted simple access to the input data that feed the AI, and granted a right to request changes to erroneous data, and (3) automated high-throughput methods for error-correction should be implemented in domains with faulty data when possible. Also, we conclude that erroneous data is a reality even for highly reputable Danish data sources, and thus, legal framework for the correction of errors is universally needed.

Keywords: AI; artificial intelligence; data quality; deep learning; machine learning (ML); personalized medicine.

Publication types

Review