How to use the Surveillance, Epidemiology, and End Results (SEER) data: research design and methodology

Mil Med Res. 2023 Oct 31;10(1):50. doi: 10.1186/s40779-023-00488-2.

Abstract

In the United States (US), the Surveillance, Epidemiology, and End Results (SEER) program is the only comprehensive source of population-based information that includes stage of cancer at the time of diagnosis and patient survival data. This program aims to provide a database about cancer incidence and survival for studies of surveillance and the development of analytical and methodological tools in the cancer field. Currently, the SEER program covers approximately half of the total cancer patients in the US. A growing number of clinical studies have applied the SEER database in various aspects. However, the intrinsic features of the SEER database, such as the huge data volume and complexity of data types, have hindered its application. In this review, we provided a systematic overview of the commonly used methodologies and study designs for retrospective epidemiological research in order to illustrate the application of the SEER database. Therefore, the goal of this review is to assist researchers in the selection of appropriate methods and study designs for enhancing the robustness and reliability of clinical studies by mining the SEER database.

Keywords: Big data; Epidemiology; Methodologies; Study design; Surveillance, Epidemiology, and End results (SEER).

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Neoplasms*
  • Reproducibility of Results
  • Research Design*
  • Retrospective Studies
  • SEER Program
  • United States / epidemiology