Exploring the potential of artificial intelligence in improving skin lesion diagnosis in primary care

Anna Escalé-Besa; Oriol Yélamos; Josep Vidal-Alaball; Aïna Fuster-Casanovas; Queralt Miró Catalina; Alexander Börve; Ricardo Ander-Egg Aguilar; Xavier Fustà-Novell; Xavier Cubiró; Mireia Esquius Rafat; Cristina López-Sanchez; Francesc X Marin-Gomez

doi:10.1038/s41598-023-31340-1

Exploring the potential of artificial intelligence in improving skin lesion diagnosis in primary care

Sci Rep. 2023 Mar 15;13(1):4293. doi: 10.1038/s41598-023-31340-1.

Authors

Anna Escalé-Besa^{1

2}, Oriol Yélamos^{3

4}, Josep Vidal-Alaball^{5

6

7}, Aïna Fuster-Casanovas^{2

8}, Queralt Miró Catalina^{2

8}, Alexander Börve^{9

10}, Ricardo Ander-Egg Aguilar⁹, Xavier Fustà-Novell¹¹, Xavier Cubiró¹², Mireia Esquius Rafat¹¹, Cristina López-Sanchez^{3

4}, Francesc X Marin-Gomez^{2

13}

Affiliations

¹ Centre d'Atenció Primària Navàs-Balsareny, Institut Català de la Salut, Navàs, Spain.
² Health Promotion in Rural Areas Research Group, Gerència Territorial de la Catalunya Central, Institut Català de la Salut, Sant Fruitós de Bages, Spain.
³ Dermatology Department, Hospital de la Santa Creu i Sant Pau, Universitat Autònoma de Barcelona, Barcelona, Spain.
⁴ Dermatology Associate Research Group, Institut d'Investigació Biomèdica Sant Pau (IIB SANT PAU), Barcelona, Spain.
⁵ Health Promotion in Rural Areas Research Group, Gerència Territorial de la Catalunya Central, Institut Català de la Salut, Sant Fruitós de Bages, Spain. jvidal.cc.ics@gencat.cat.
⁶ Unitat de Suport a la Recerca de la Catalunya Central, Fundació Institut Universitari per a la Recerca a l'Atenció Primària de Salut Jordi Gol i Gurina, Sant Fruitós de Bages, Spain. jvidal.cc.ics@gencat.cat.
⁷ Factulty of Medicine, University of Vic-Central University of Catalonia, Vic, Spain. jvidal.cc.ics@gencat.cat.
⁸ Unitat de Suport a la Recerca de la Catalunya Central, Fundació Institut Universitari per a la Recerca a l'Atenció Primària de Salut Jordi Gol i Gurina, Sant Fruitós de Bages, Spain.
⁹ iDoc24 Inc, San Francisco, CA, USA.
¹⁰ Institute of Clinical Sciences, University of Gothenburg, Sahlgrenska, Gothenburg, Sweden.
¹¹ Fundació Althaia de Manresa, Manresa, Spain.
¹² Servei de Dermatologia, Hospital Universitari Mollet, Mollet del Vallès, Barcelona, Spain.
¹³ Servei d'Atenció Primària Osona, Gerència Territorial de la Catalunya Central, Institut Català de La Salut, Vic, Spain.

Abstract

Dermatological conditions are a relevant health problem. Machine learning (ML) models are increasingly being applied to dermatology as a diagnostic decision support tool using image analysis, especially for skin cancer detection and disease classification. The objective of this study was to perform a prospective validation of an image analysis ML model, which is capable of screening 44 skin diseases, comparing its diagnostic accuracy with that of General Practitioners (GPs) and teledermatology (TD) dermatologists in a real-life setting. Prospective, diagnostic accuracy study including 100 consecutive patients with a skin problem who visited a participating GP in central Catalonia, Spain, between June 2021 and October 2021. The skin issue was first assessed by the GPs. Then an anonymised skin disease picture was taken and uploaded to the ML application, which returned a list with the Top-5 possible diagnosis in order of probability. The same image was then sent to a dermatologist via TD for diagnosis, as per clinical practice. The GPs Top-3, ML model's Top-5 and dermatologist's Top-3 assessments were compared to calculate the accuracy, sensitivity, specificity and diagnostic accuracy of the ML models. The overall Top-1 accuracy of the ML model (39%) was lower than that of GPs (64%) and dermatologists (72%). When the analysis was limited to the diagnoses on which the algorithm had been explicitly trained (n = 82), the balanced Top-1 accuracy of the ML model increased (48%) and in the Top-3 (75%) was comparable to the GPs Top-3 accuracy (76%). The Top-5 accuracy of the ML model (89%) was comparable to the dermatologist Top-3 accuracy (90%). For the different diseases, the sensitivity of the model (Top-3 87% and Top-5 96%) is higher than that of the clinicians (Top-3 GPs 76% and Top-3 dermatologists 84%) only in the benign tumour pathology group, being on the other hand the most prevalent category (n = 53). About the satisfaction of professionals, 92% of the GPs considered it as a useful diagnostic support tool (DST) for the differential diagnosis and in 60% of the cases as an aid in the final diagnosis of the skin lesion. The overall diagnostic accuracy of the model in this study, under real-life conditions, is lower than that of both GPs and dermatologists. This result aligns with the findings of few existing prospective studies conducted under real-life conditions. The outcomes emphasize the significance of involving clinicians in the training of the model and the capability of ML models to assist GPs, particularly in differential diagnosis. Nevertheless, external testing in real-life conditions is crucial for data validation and regulation of these AI diagnostic models before they can be used in primary care.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Artificial Intelligence
Humans
Primary Health Care
Prospective Studies
Skin Diseases* / diagnosis
Skin Neoplasms* / diagnosis
Skin Neoplasms* / pathology