The accuracy of artificial intelligence used for non-melanoma skin cancer diagnoses: a meta-analysis

Kuang Ming Kuo; Paul C Talley; Chao-Sheng Chang

doi:10.1186/s12911-023-02229-w

The accuracy of artificial intelligence used for non-melanoma skin cancer diagnoses: a meta-analysis

BMC Med Inform Decis Mak. 2023 Jul 28;23(1):138. doi: 10.1186/s12911-023-02229-w.

Authors

Kuang Ming Kuo¹, Paul C Talley², Chao-Sheng Chang^{3

4}

Affiliations

¹ Department of Business Management, National United University, No.1, Miaoli, 360301, Lienda, Taiwan, Republic of China.
² Department of Applied English, I-Shou University, No. 1, Sec. 1, Syuecheng Rd., Dashu District, 84001, Kaohsiung City, Taiwan, Republic of China.
³ Department of Occupational Therapy, I-Shou University, No. 1, Yida Rd., Yanchao District, 82445, Kaohsiung City, Taiwan, Republic of China. zincfinger522@yahoo.com.tw.
⁴ Department of Emergency Medicine, E-Da Hospital, I-Shou University, Kaohsiung, Taiwan, Republic of China. zincfinger522@yahoo.com.tw.

Abstract

Background: With rising incidence of skin cancer and relatively increased mortality rates, an improved diagnosis of such a potentially fatal disease is of vital importance. Although frequently curable, it nevertheless places a considerable burden upon healthcare systems. Among the various types of skin cancers, non-melanoma skin cancer is most prevalent. Despite such prevalence and its associated cost, scant proof concerning the diagnostic accuracy via Artificial Intelligence (AI) for non-melanoma skin cancer exists. This study meta-analyzes the diagnostic test accuracy of AI used to diagnose non-melanoma forms of skin cancer, and it identifies potential covariates that account for heterogeneity between extant studies.

Methods: Various electronic databases (Scopus, PubMed, ScienceDirect, SpringerLink, and Dimensions) were examined to discern eligible studies beginning from March 2022. Those AI studies predictive of non-melanoma skin cancer were included. Summary estimates of sensitivity, specificity, and area under receiver operating characteristic curves were used to evaluate diagnostic accuracy. The revised Quality Assessment of Diagnostic Studies served to assess any risk of bias.

Results: A literature search produced 39 eligible articles for meta-analysis. The summary sensitivity, specificity, and area under receiver operating characteristic curve of AI for diagnosing non-melanoma skin cancer was 0.78, 0.98, & 0.97, respectively. Skin cancer typology, data sources, cross validation, ensemble models, types of techniques, pre-trained models, and image augmentation became significant covariates accounting for heterogeneity in terms of both sensitivity and/or specificity.

Conclusions: Meta-analysis results revealed that AI is predictive of non-melanoma with an acceptable performance, but sensitivity may become improved. Further, ensemble models and pre-trained models are employable to improve true positive rating.

Keywords: Artificial intelligence; Cross validation; Diagnostic test accuracy; Ensemble models; Meta-analysis; Non-melanoma skin cancers; Pre-trained models.

Publication types

Meta-Analysis
Research Support, Non-U.S. Gov't

MeSH terms

Artificial Intelligence*
Humans
Physical Examination / methods
ROC Curve
Sensitivity and Specificity
Skin Neoplasms* / diagnosis