A curated collection of tissue microarray images and clinical outcome data of prostate cancer patients

Sci Data. 2017 Mar 14:4:170014. doi: 10.1038/sdata.2017.14.

Abstract

Microscopy image data of human cancers provide detailed phenotypes of spatially and morphologically intact tissues at single-cell resolution, thus complementing large-scale molecular analyses, e.g., next generation sequencing or proteomic profiling. Here we describe a high-resolution tissue microarray (TMA) image dataset from a cohort of 71 prostate tissue samples, which was hybridized with bright-field dual colour chromogenic and silver in situ hybridization probes for the tumour suppressor gene PTEN. These tissue samples were digitized and supplemented with expert annotations, clinical information, statistical models of PTEN genetic status, and computer source codes. For validation, we constructed an additional TMA dataset for 424 prostate tissues, hybridized with FISH probes for PTEN, and performed survival analysis on a subset of 339 radical prostatectomy specimens with overall, disease-specific and recurrence-free survival (maximum 167 months). For application, we further produced 6,036 image patches derived from two whole slides. Our curated collection of prostate cancer data sets provides reuse potential for both biomedical and computational studies.

Publication types

  • Dataset
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Gene Expression Profiling
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Male
  • PTEN Phosphohydrolase / genetics
  • Prostatectomy
  • Prostatic Neoplasms / genetics*
  • Prostatic Neoplasms / surgery
  • Proteomics*

Substances

  • PTEN Phosphohydrolase
  • PTEN protein, human