Evaluation of an artificial intelligence-based decision support for detection of cutaneous melanoma in primary care - a prospective, real-life, clinical trial

Br J Dermatol. 2024 Jan 17:ljae021. doi: 10.1093/bjd/ljae021. Online ahead of print.

Abstract

Background: Use of artificial intelligence, or machine learning, to assess dermoscopic images of skin lesions to detect melanoma has in several retrospective studies shown high levels of diagnostic accuracy on par with, or even outperforming, experienced dermatologists. However, the enthusiasm around these algorithms has not yet been matched by prospective clinical trials performed in authentic clinical settings. In several European countries, including Sweden, the initial clinical assessment of suspected skin cancer is principally conducted in the primary health care setting by primary care physicians; with or without access to teledermoscopic support from dermatology clinics.

Objective: To determine the diagnostic performance of an artificial intelligence-based clinical decision support tool for cutaneous melanoma detection, operated by a smartphone application (app), when used prospectively by primary care physicians to assess skin lesions of concern due to some degree of melanoma suspicion.

Methods: This prospective, multicentre, clinical trial was conducted at 36 primary care centres in Sweden. The physicians used the smartphone app on skin lesions of concern by photographing them dermoscopically, which resulted in a dichotomous decision support text regarding evidence for melanoma. Regardless of the app outcome, all lesions underwent standard diagnostic procedure, by surgical excision or referral to dermatologist. After completed investigation, lesion diagnoses were collected from the patients' medical records and compared to app outcome and other lesion data.

Results: In total, 253 lesions of concern in 228 patients were included, of which 21 proved to be melanomas, with 11 thin invasive melanomas and 10 melanomas in situ. The app's accuracy (95% confidence interval) in identifying melanomas was reflected in an area under the receiver operating characteristic (AUROC) curve of 0.960 (0.928-0.980), corresponding to a maximum sensitivity and specificity of 95.2% and 84.5%, respectively. For invasive melanomas alone, the AUROC was 0.988 (0.965-0.997), corresponding to a maximum sensitivity and specificity of 100% and 92.6%, respectively.

Conclusions: The clinical decision support tool evaluated in this investigation showed high diagnostic accuracy when used prospectively on primary care patients, which could add significant clinical value for primary care physicians in assessing skin lesions to detect melanoma. ClinicalTrials.gov Identifier: NCT05172232.

Associated data

  • ClinicalTrials.gov/NCT05172232