PAD-UFES-20: A skin lesion dataset composed of patient data and clinical images collected from smartphones

Data Brief. 2020 Aug 25:32:106221. doi: 10.1016/j.dib.2020.106221. eCollection 2020 Oct.

Abstract

Over the past few years, different Computer-Aided Diagnosis (CAD) systems have been proposed to tackle skin lesion analysis. Most of these systems work only for dermoscopy images since there is a strong lack of public clinical images archive available to evaluate the aforementioned CAD systems. To fill this gap, we release a skin lesion benchmark composed of clinical images collected from smartphone devices and a set of patient clinical data containing up to 21 features. The dataset consists of 1373 patients, 1641 skin lesions, and 2298 images for six different diagnostics: three skin diseases and three skin cancers. In total, 58.4% of the skin lesions are biopsy-proven, including 100% of the skin cancers. By releasing this benchmark, we aim to support future research and the development of new tools to assist clinicians to detect skin cancer.

Keywords: Cancer research; Clinical data; Computer-Aided Diagnosis (CAD); Skin cancer; Skin lesion.