Identifying Similarities and Disparities Between DNA Copy Number Changes in Cancer and Matched Blood Samples

Cancer Invest. 2019;37(10):535-545. doi: 10.1080/07357907.2019.1667368. Epub 2019 Oct 10.

Abstract

Background: Non-small cell lung cancer (NSCLC) is the first cause of cancer-related mortality for men and women in the United States. In spite of curative resection in early-stage, patient survival is not optimal and recurrence rate is high. Consequently, early detection and staging is essential to increase the patient's survival.Methods: Copy number (CN) changes in cancer populations have been broadly investigated to identify CN gains and deletions associated with cancer. In contrast, in this research, we quantify the similarities and disparities between cancer and paired peripheral blood samples using maximal information coefficient (MIC). We then detect the spatial locations with substantially high and the spatial locations with very low MICs in each chromosome. These locations can potentially help with early diagnosis, treatment, and prevention of cancer by identifying the similarities and disparities between cancer and healthy tissues.Results: Lung cancer data used in this project contains CN pairs for cancer and blood (non-involved) samples for 63 subjects. MIC was obtained to quantify the relation (linear or nonlinear) between cancer-blood pair samples for 63 subjects at each location for each chromosome. MIC values above a high threshold and MIC values below a low threshold were located. Among them top five (with lowest MIC's and with highest MIC's) were identified for each chromosome. For these identified locations, a high MIC score indicates high similarity between blood (non-involved) and cancer samples, while a low MIC score shows lack of similarity between the two samples.Conclusions: The results showed that a few chromosomes have a large number of MICs exceeding a high threshold. These locations can potentially be used to identify early indicators of NSCLC. In contrast, second group of chromosomes have several locations with small MICs which are potential candidates to develop biomarkers for discriminating cancer from the matched blood sample. Moreover, there is a third group of chromosomes with a large number of MICs exceeding a high threshold and a large set of MICs below a low threshold. These locations can help with both finding early indicators of cancer and developing biomarkers for discriminating cancer from non-involved tissue.

Keywords: DNA copy number; Maximal information coefficient; lung cancer; nonlinear correlation.

MeSH terms

  • Biomarkers, Tumor / genetics
  • Carcinoma, Non-Small-Cell Lung / genetics*
  • Carcinoma, Non-Small-Cell Lung / pathology
  • DNA Copy Number Variations / genetics*
  • Female
  • Humans
  • Lung Neoplasms / genetics*
  • Lung Neoplasms / pathology
  • Male
  • Neoplasm Recurrence, Local / genetics
  • Neoplasm Recurrence, Local / pathology
  • Neoplasm Staging / methods

Substances

  • Biomarkers, Tumor