Genomic data integration in chronic lymphocytic leukemia

J Gene Med. 2017 Jan;19(1-2). doi: 10.1002/jgm.2936.

Abstract

Background: B-cell chronic lymphocytic leukemia (CLL) is a heterogeneous disease and the most common adult leukemia in western countries. IgVH mutational status distinguishes two major types of CLL, each associated with a different prognosis and survival. Sequencing identified NOTCH1 and SF3B1 as the two main recurrent mutations. We described a novel method to clarify how these mutations affect gene expression by finding small-scale signatures that predict the IgVH, NOTCH1 and SF3B1 mutations. We subsequently defined the biological pathways and correlation networks involved in disease development, with the potential goal of identifying new drugable targets.

Methods: We modeled a microarray dataset consisting of 48807 probes derived from 163 samples. The use of Fisher's ratio and fold change combined with feature elimination allowed us to identify the minimum number of genes with the highest predictive mutation power and, subsequently, we applied network and pathway analyses of these genes to identify their biological roles.

Results: The mutational status of the patients was accurately predicted (94-99%) using small-scale gene signatures: 13 genes for IgVH, 60 for NOTCH1 and 22 for SF3B1. LPL plays an important role in the case of the IgVH mutation, whereas MSI2, LTK, TFEC and CNTAP2 are involved in the NOTCH1 mutation, and RPL32 and PLAGL1 are involved in the SF3B1 mutation. Four high discriminatory genes (IGHG1, MYBL1, NRIP1 and RGS1) are common to these three mutations. The IL-4-mediated signaling events pathway appears to be involved as a common mechanism and suggests an important role of the immune response mechanisms and antigen presentation.

Conclusions: This retrospective analysis served to provide a deeper understanding of the effects of the different mutations in CLL disease progression, with the expectation that these findings will be clinically applied in the near future to the development of new drugs.

Keywords: cancer; gene expression; hematologic; leukemia; mathematical modeling; oncology.

MeSH terms

  • Biomarkers, Tumor
  • Computational Biology / methods
  • Databases, Nucleic Acid
  • Gene Regulatory Networks
  • Genetic Association Studies
  • Genetic Predisposition to Disease
  • Genomics* / methods
  • Humans
  • Immunoglobulin Heavy Chains / genetics
  • Leukemia, Lymphocytic, Chronic, B-Cell / diagnosis
  • Leukemia, Lymphocytic, Chronic, B-Cell / genetics*
  • Leukemia, Lymphocytic, Chronic, B-Cell / metabolism
  • Leukemia, Lymphocytic, Chronic, B-Cell / mortality
  • Models, Biological
  • Mutation
  • Oligonucleotide Array Sequence Analysis
  • Phosphoproteins / genetics
  • Principal Component Analysis
  • Prognosis
  • RNA Splicing Factors / genetics
  • Receptors, Notch / genetics
  • Reproducibility of Results
  • Retrospective Studies
  • Signal Transduction

Substances

  • Biomarkers, Tumor
  • Immunoglobulin Heavy Chains
  • Phosphoproteins
  • RNA Splicing Factors
  • Receptors, Notch
  • SF3B1 protein, human