Dead or alive? Pitfall of survival analysis with TCGA datasets

Cancer Biol Ther. 2021 Dec 2;22(10-12):527-528. doi: 10.1080/15384047.2021.1979845. Epub 2021 Sep 16.

Abstract

We often encounter situations in which data from the TCGA that have been analyzed in papers we read or reviewed cannot be reproduced, even when TCGA datasets are used, especially in survival analyses. Therefore, we attempted to confirm the data source for TCGA survival analysis and found that several websites used to analyze the survival data of TCGA datasets inappropriately handle the survival data, causing differences in statistical analyses. This causes the misinterpretation of results because figures of survival analysis results in several papers are sometimes exactly as generated by these sites, and the results depend on only the tools provided by these sites. We would like to make this situation widely known and raise the problem for scientific soundness.

Keywords: Clinical data; Kaplan–Meier method; TCGA; reproducibility; survival analysis.

MeSH terms

  • Humans
  • Kaplan-Meier Estimate
  • Prognosis*
  • Survival Analysis